AbstractWhile N6-methyladenosine (m6A) is a well-known epigenetic modification in bacterial DNA, it remained largely unstudied in eukaryotes. Recent studies have brought to fore its potential epigenetic role across diverse eukaryotes with biological consequences, which are distinct and possibly even opposite to the well-studied 5-methylcytosine mark. Adenine methyltransferases appear to have been independently acquired by eukaryotes on at least 13 occasions from prokaryotic restriction-modification and counter-restriction systems. On at least 4-5 instances these methyltransferases were recruited as RNA methylases. Thus, m6A marks in eukaryotic DNA and RNA might be more widespread and diversified than previously believed. Several m6A-binding protein domains from prokaryotes were also acquired by eukaryotes, facilitating prediction potential readers for these marks. Further, multiple lineages of the AlkB family of dioxygenases have been recruited as m6A demethylases. Although members of the TET/JBP family of dioxygenases have also been suggested to be m6A demethylases, this proposal needs more careful evaluation. |
Detection of distant sequence similaritiesIterative sequence profile searches were done using PSI-BLAST [1] and the web version of the JACKHMMER ( http://www.ebi.ac.uk/Tools/hmmer/search/jackhmmer) programs. For m6A DNA methylases, previously identified families were used as seed in iterative profile searches against the non-redundant (NR) protein database of National Center for Biotechnology Information (NCBI) and a local database that additionally contained sequences from completed eukaryotic genomes that have not been deposited in the NR database [2-5]. The HHpred program [6] was used for profile–profile comparisons which compares HMMs derived from a given alignment created using either PSIBLAST of HHBLITZ against a library of HMM created from Pfam and PDB. For previously known domains, the Pfam database [7] was used as a guide and augmented by addition of newly detected divergent members using a local database of profiles.Creation of multiple alignmentsSimilarity-based clustering for both classification and culling of nearly identical sequences was performed using the BLASTCLUST program (ftp://ftp.ncbi.nih.gov/blast/documents/blastclust.html). Multiple sequence alignments were built by the Kalign2 [8] and Muscle [9] programs, followed by manual adjustments on the basis of profile-profile and structural alignments.Prediction of operons in prokaryotic genomesGene neighborhoods were obtained by isolating all conserved prokaryotic genes, in the neighborhood of the gene under consideration, which showed a separation of less than 70 nucleotides between their termini. Genes fulfilling this criterion and occurring in the same direction were considered likely to form operons. These were further filtered using BLASTCLUST (-L 0.3 –S 0.3) to cluster all those proteins that were encoded by the putative operons to determine conserved gene-neighborhoods. If such conserved gene-neighborhoods were found across more than one major bacterial lineage (phylum) mapped using the NCBI taxonomy id then they were seen as notable associations, which was further analyzed for functional significance.Structural analysisStructure similarity searches were performed using the DaliLite program [10] which orders alignments based on Z-scored derived from C-alpha matches. Secondary structures were predicted using the JPred program [11]. Structural visualization and manipulations were performed using the PyMol ( http://pymol.org) programPhylogenetic analysisPhylogenetic analysis was conducted using an approximately maximum-likelihood method implemented in the FastTree 2.1 [12] program under default parameters. Independent ML analysis was done using the MEGA5 program with the JTT substitution model and alpha parameter of 1. The in-house TASS package comprising Perl scripts was used to automate the analysis.REFERENCES1 Altschul SF, Madden TL, Schaffer AA, Zhang J, et al. 1997. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic acids research 25: 3389-402.2 Iyer LM, Anantharaman V, Wolf MY, Aravind L. 2008. Comparative genomics of transcription factors and chromatin proteins in parasitic protists and other eukaryotes. International journal for parasitology 38: 1-31. 3 Iyer LM, Abhiman S, Aravind L. 2011. Natural history of eukaryotic DNA methylation systems. Progress in molecular biology and translational science 101: 25-104. 4 Iyer LM, Zhang D, Burroughs AM, Aravind L. 2013. Computational identification of novel biochemical systems involved in oxidation, glycosylation and other complex modifications of bases in DNA. Nucleic acids research 41: 7635-55. 5 Iyer LM, Zhang D, de Souza RF, Pukkila PJ, et al. 2014. Lineage-specific expansions of TET/JBP genes and a new class of DNA transposons shape fungal genomic and epigenetic landscapes. Proceedings of the National Academy of Sciences of the United States of America 111: 1676-83. 6 Soding J, Biegert A, Lupas AN. 2005. The HHpred interactive server for protein homology detection and structure prediction. Nucleic acids research 33: W244-8. 7 Finn RD, Bateman A, Clements J, Coggill P, et al. 2014. Pfam: the protein families database. Nucleic acids research 42: D222-30. 8 Lassmann T, Frings O, Sonnhammer EL. 2009. Kalign2: high-performance multiple alignment of protein and nucleotide sequences allowing external features. Nucleic acids research 37: 858-65. 9 Edgar RC. 2004. MUSCLE: a multiple sequence alignment method with reduced time and space complexity. BMC Bioinformatics 5: 113. 10 Holm L, Kaariainen S, Rosenstrom P, Schenkel A. 2008. Searching protein structure databases with DaliLite v.3. Bioinformatics 24: 2780-1. 11 Cole C, Barber JD, Barton GJ. 2008. The Jpred 3 secondary structure prediction server. Nucleic acids research 36: W197-201. 12 Price MN, Dehal PS, Arkin AP. 2010. FastTree 2--approximately maximum-likelihood trees for large alignments. PLoS One 5: e9490. |
Note: Many members, especially those of the METTL14 group, show disruptions of the canonical active site residues. Sequence features annotation Str-3 Str-4 Str-5 Str-6 Str-7 Synapomorphic strand conservedK Str-1 Str-2 RES PLPPQWINCDLRRFDYSVL-------------------------GKFHVIMADPPWDIHM------------------------SLP-------------YGTMTDDE-----MRAMPIPALQ-DE-GLLFLWVTGR--------AMEVGRE----------CLR----VWGYTRVD--EVVWVKTNQ-------------------------LQRVIRTGR----------------TGHWLNHTKEHMLVGIKNPPGVTQGSNT-------------------------------------GETPTLKFPSWI-----NRGLDT------------------DVIVSEV------------------------RETSRKPDE---------VYNMIERMCP------------------------------GGRKVEIF-GRKHN------VRPGWITLG-----------NQLGNVD ALIGN ------EE----E---------------------------------EEEEE------------------------------------------------------HHH-----HH---HH--------EEEEEE------------HHHHHH----------HHH----H---HHHH--HEEEEEE-------------------------------------------------------EE---HHHEEEH-------------------------------------------------------------------------------------------EEEEEE-------------------------------------------HHEHHHH------------------------------------HHEHH-H---------------EE-------------------- HMM ------E----------------------------------------EEEEEE---------------------------------------------------HHHH-----HHHHHHHHHH-----EEEEEE-----------HHHHHHH----------HHH----HHHHHEEE--EEEEEEEE----------------------------EEEEE--------------------EEEE---EEEEEEEE---------------------------------------------------------EEE-----EE----------------------EEEEEE-------------------------------HHH---------HHHHHHHH-----------------------------------EEEEE-E--------------EEEEE-----------E------ FREQ ------EE--HHHHHHHH----------------------------EEEEE------------------------------------------------------HHH-----HHH-------------EEEEEHHH--------HHHH-HH----------HHH----HH---EEE--EEEEEHH----------------------------EEEEEE--------------------EEE----EEEEEEE-----------------------------------------------------------------------E------------------EEEEEEE------------------------E-----------------EEEEEEE------------------------------------EEEEE----------------EEE-------------------- PSSM -----------------------------------------------EEEEE---HHHH----------------------------------------------HHH-----HH-------------EEEEEE-----------HHHHHHH----------HHH----H---EEE----EEEEEE--------------------------------------------------------------EEEEEEE----------------------------------------------------------------------E------------------EEEEE----------------------------------H---------HHHHHHHH-----------------------------------EEEEE-----------------EEEE------------------ FINAL ----------HHHHH-------------------------------EEEEEE-----------------------------------------------------HHH-----HHH------------EEEEEE--H--------HHHHHHH----------HHH----H---EEEE--EEEEEEE----------------------------EEEEEE--------------------EEE----EEEEEEEE----------------------------------------------------------------------E------------------EEEEEEE------------------------E-------H---------HHHHHHH------------------------------------EEEEE----------------EEEEE------------------ CC1G_14583_Coprinopsis_cinerea_okayama7#130_299747281 PLPPQWINCDLRRFDYSVL-------------------------GKFHVIMADPPWDIHM------------------------SLP-------------YGTMTDDE-----MRAMPIPALQ-DE-GLLFLWVTGR--------AMEVGRE----------CLR----VWGYTRVD--EVVWVKTNQ-------------------------LQRVIRTGR----------------TGHWLNHTKEHMLVGIKNPPGVTQGSNT-------------------------------------GETPTLKFPSWI-----NRGLDT------------------DVIVSEV------------------------RETSRKPDE---------VYNMIERMCP------------------------------GGRKVEIF-GRKHN------VRPGWITLG-----------NQLGNVD NEMVEDRAFT_v1g33607_Nematostella_vectensis_156398086 LYPPQWISCDVRSLQMDVL-------------------------GKFSVIMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRNLSVPSLQ-DN-GYIFLWVTGR--------AMELGRE----------CLE----IWGYERCD--ELIWVKTNQ-------------------------LQRLIRTGR----------------TGHWINHGKEHCLIGVK------------------------------------------------GDTTGF-----------NRGMDC------------------DV------------------------------------------------------------------------------------------------------------------------------------LV PF07_0123_Plasmodium_falciparum_3D7_124512114 VYGPQWIRCDLRNFDLSIF-------------------------KYVSVVMADPPWDIHM------------------------DLP-------------YGTMTDNE-----MKLLPVQLIQ-DE-GMIFLWVTGR--------AMELARE----------CLQ----IWGYKRVE--EILWVKTNH-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GNPII------------NRNIDC------------------NVIVSEV------------------------RETSRKPDE---------IYSLIERLCP------------------------------QNLKIELF-GRPHN------CRSNWITLG-----------NQLNGVV TGFOU_217350_Toxoplasma_gondii_FOU_672280105 EYPAQWIRCDIRTFDFSIF-------------------------KLIRVVMADPPWDIHM------------------------DLP-------------YGTMTDQE-----MRSLRVDLIQ-EE-GLLFLWVTGR--------AMELARE----------CLQ----LWGYRRVE--EILWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVAVK------------------------------------------------GNMAF------------NRNIDC------------------DVIVSEV------------------------RETSRKPDEIYRQGEAR-HRGMIERMAP------------------------------DSLKVELF-GRMHN------VRNNWITLG-----------NQLKGVK LOC100641238_Amphimedon_queenslandica_340369522 LVPPQWLNCDLRNFDTSVL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRQLDIPSLQ-DD-GFIFLWVTGR--------AMELGRE----------CLT----LWGYERID--ELVWVKTNQ-------------------------LQRLIRTGR----------------TGHWINHGKEHCLVGAK------------------------------------------------GNLQGV-----------NRGIDT------------------DVIVAEV------------------------RATSRKPDE---------IYGVIERLSP------------------------------GTRKIELF-GRQHN------CQPNWLTLG-----------NQLEGDN EMIHUDRAFT_211665_Emiliania_huxleyi_CCMP1516_551562135 HYESQFVNCDIRTFPMQTL-------------------------GKFPVIMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRRMNVQVLQ-DD-GVLFLWVTGR--------AMELGRE----------CLE----IWGYRFVQ--ELLWVKTNQ-------------------------LQRIIRTGR----------------TGHWINHSKEHCLIGVK------------------------------------------------GDLDDRF----------NQNLDC------------------DVICAEV------------------------RETSRKPDE---------MYDLLERLAP------------------------------GQRKLELF-GRPHN------VHKGWTTLG-----------NQLGKTQ _Danio_rerio_597501008 LFPSQWICCDIRYLDVSIL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTLTDDE-----MRKLNIPILQ-DD-GFLFLWVTGR--------AMELGRE----------CLS----LWGYDRVD--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHGKEHCLVGVK------------------------------------------------GNPQGF-----------NRGLDC------------------DVIVAEV------------------------RSTSHKPDE---------IYGMIERLSP------------------------------GTRKIELF-GRPHN------VQPNWITLG-----------NQLDGIH METTL3_Homo_sapiens_21361827 LFPPQWICCDIRYLDVSIL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTLTDDE-----MRRLNIPVLQ-DD-GFLFLWVTGR--------AMELGRE----------CLN----LWGYERVD--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHGKEHCLVGVK------------------------------------------------GNPQGF-----------NQGLDC------------------DVIVAEV------------------------RSTSHKPDE---------IYGMIERLSP------------------------------GTRKIELF-GRPHN------VQPNWITLG-----------NQLDGIH Dmel_CG5933_Drosophila_melanogaster_21355141 LYPPQWIQCDLRFLDMTVL-------------------------GKFAVVMADPPWDIHM------------------------ELP-------------YGTMSDDE-----MRALGVPALQ-DD-GLIFLWVTGR--------AMELGRD----------CLK----LWGYERVD--ELIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHGKEHCLVGMK------------------------------------------------GNPTNL-----------NRGLDC------------------DVIVAEV------------------------RATSHKPDE---------IYGIIERLSP------------------------------GTRKIELF-GRPHN------IQPNWITLG-----------NQLDGIR _Saccharomyces_cerevisiae_S288c_1174426 ALPAQWIRCDVRKFDFRVL-------------------------GKFSVVIADPAWNIHM------------------------NLP-------------YGTCNDIE-----LLGLPLHELQ-DE-GIIFLWVTGR--------AIELGKE----------SLN----NWGYNVIN--EVSWIKTNQ-------------------------LGRTIVTGR----------------TGHWLNHSKEHLLVGLK------------------------------------------------GNPKWI-----------NKHIDV------------------DLIVSMT------------------------RETSRKPDE---------LYGIAERLAGT-----------------------------HARKLEIF-GRDHN------TRPGWFTIG-----------NQLTGNC AT4G10760_Arabidopsis_thaliana_15236910 LGEAQWINCDIRSFRMDIL-------------------------GTFGVVMADPPWDIHM------------------------ELP-------------YGTMADDE-----MRTLNVPSLQ-TD-GLIFLWVTGR--------AMELGRE----------CLE----LWGYKRVE--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GNPEV------------NRNIDT------------------DVIVAEV------------------------RETSRKPDE---------MYAMLERIMP------------------------------RARKLELF-ARMHN------AHAGWLSLG-----------NQLNGVR PTSG_03395_Salpingoeca_rosetta_514696822 TFPAQWIQCDVRYIDFSVL-------------------------GKFSVIMADPPWRINM------------------------ELP-------------YGTMSDEE-----MRQLPVQDLQ-DN-GVIFLWVTAR--------CVDLGRE----------LLK----RWGYNYAN--DLIWIKINQ-------------------------LQNLVRTGR----------------TGHWMNHAKEHCMIGVK------------------------------------------------GNLDGI-----------YPGIDC------------------DVLVSEV------------------------RDTSRKPDE---------IYGLIERLSP------------------------------GTRKIELF-GRPHN------VQSNWLTLG-----------DQLQGVQ TTHERM_00962190_Tetrahymena_thermophila_SB210_586734236 KLNPQWINCDLRQIDFNIL-------------------------GKFNCIMADPPWDIHM------------------------TLP-------------YGTLKDRE-----MKAMRVDLLQ-EE-GVIFLWVTGR--------AMELGRE----------CLT----NWGYRRVE--EIIWVKTNQ-------------------------LQRIIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GNPKI------------NRKIDC------------------DVIVSEV------------------------RETSRKPDE---------IYNLIERMCP------------------------------GGKKIELF-GRPHN------TMPGWLTLG-----------NQLPGIY GSPATT00017263001_Paramecium_tetraurelia_strain_d4-2_145529029 SMPPQWINCDLRIFDFRVL-------------------------GKFDVIMADPPWDIHM------------------------NLP-------------YGTLKDKE-----MKALRVDLLQ-ND-GIIFLWVTGR--------AMELGRE----------CLI----LWGYRRVE--ELVWIKVNQ-------------------------LHRIIRTGR----------------TGHWLNHSKEHCLIGIK------------------------------------------------GNPQL------------IKGLDC------------------DVIVSEV------------------------RETSRKPDE---------VYGIINRMCP------------------------------NGKKVELF-GRPHN------CRPNWITLG-----------NQLPGVY ACA1_074420_Acanthamoeba_castellanii_str_Neff_470518935 WADRTFINCDLRYYNLASL-------------------------GKFDAILIDPPWRIKGNQLISNEKTMFNNSKW--------GLS-------------YGTMSNDE-----IIDIDVGCLS-DK-GFIFLWVINS--------QIEFGFK----------CLQ----KWGYTYVD--RITWVKKTA-------------------------SGNIAIS------------------QGYYFLHSSEICLVGVKYDAK--------------------------------------------GKSLEF-----------ISKTSN------------------DLLFAEI------------------------REKSRKPDQ---------LYHIIERMVP------------------------------GGRKVEIF-ARNHN------MRPGWLSLG-----------NQLGEYY SPRG_03347_Saprolegnia_parasitica_CBS_22365_641538296 PTTAIAIACDVTTYDVARL-------------------------GTFDAIVMDPPWEINL------------------------QLP-------------YTTLSDEA-----IGALAIPALQ-TA-GWIFLWVATG--------KLVVGRQ----------LLR----QWGYTVVD--DIVWIKIDQ-------------------------LQHVAHQGR----------------TGHWLNHSQEHCLVGRK------------------------------------------------GLAPS------------AARLDC------------------DVIVAAP------------------------RENSRKPDE---------LYHLVERVVP------------------------------AGRKLELF-GRRHN------LRDGWTTLG----------------DQ CHLREDRAFT_128290_Chlamydomonas_reinhardtii_159466562 ALDPQWINCDVRSFDMTVL-------------------------GKFGVIMADPPWEIHQ------------------------DLP-------------YGTMKDDE-----MVNLNVGCLQ-DN-GVLFLWVTGR--------AMELARE----------CMA----KWGYKRVD--ELIWVKTNQ-------------------------LQRLIRTGR----------------TGHWLNHSKEHCLVGIK------------------------------------------------GSPQL------------NRYVDT------------------DVVVAEV------------------------RETSRKPDE---------MYSLLERLSP------------------------------GTRKLEIF-ARVHN------CKPGWVGLG-----------NQLKNVN NEMVEDRAFT_v1g95490_Nematostella_vectensis_156393637 PSMSSFLNSDATKLQPVIEHGKVKI-------------------VPFDLIVIDPPWYNK-------------------------SAKRKRM---------YSFMSLWQ-----IKALPVPELIAPG-GLLAVWVTNKAK------YIRFTRSE---------LLP----SWGVDVIA--EWHWIKVTK-------------------------TGEYVVG------------------MESAHKKPYETLIIGRLPILPGASID---------------------------------------GGVKQ------------V--PEH------------------QVICSVPC-----------------------LKHSRKPPL---------GDVFKDFLPR------------------------------HPHCLEMF-AR--N------LTPGWTSWG-----------NQVLKFQ GSPATT00037207001_Paramecium_tetraurelia_strain_d4-2_145499669 NHPPNYIKADLRTFDLQQL-------------------------GKFDVILIDPPWAEYAKRLMQANMQV--------------KEH-------------QQSWTLEE-----LKQLHIDKIADIP-SFIFLWCGSE--------HLDDGRE----------LFK----TWGFKRCE--DIVWLKTNKDHSK---------------------QNQYVAGQDY---------------GDNLFRRVKEHCLVGLR------------------------------------------------GDVKRASDQHFI-----HANIDT------------------DVIITEEEV----------------------MGSTKKPEE---------LYEIIERFCL------------------------------GRKRIELF-GEIHN------IRDGWLTIG-----------TQLRDTR NEMVEDRAFT_v1g224635_Nematostella_vectensis_156328704 ATPPMYLRCDLETFALHDLD------------------------NKFDVILVDPPLEEYQRRHAGV------------------SFN-------------FKPWTWDD-----IMKLDIEEVAAQR-SFIFLWCGSHE-------GLTEGRKVQHLKLKSDMCLR----KWGFRRCE--DICWIKTNKTNP----------------------GNTKYLE------------------PIAIFQHTKEHCLMGIR------------------------------------------------GTVRRSTDGDFI-----HANVDI------------------DLIITEECK----------------------GV--------------------------------------------------------------------------------------------------VTRDN CHLREDRAFT_174824_Chlamydomonas_reinhardtii_159474530 PPHCVPIHANVTTFDWPSLYSH----------------------AQFDVIMMDPPWQLA-------------------------TANPTRGVALG-----YSQLNDDH-----ISRLPVPQLQRQG-GYLFVWVINA--------KYKWTLD----------LFD----RWGYRLVD--EVVWVKMTV-------------------------NRRLAKS------------------HGYYLQHAKEVCLVAKR------------------------------------------------GNPPVPPGC--------EGGVGS------------------DIIFSER------------------------RGQSQKPEE---------IYHLIEQLVP------------------------------NGRYLEIF-ARKNN------LRNYWVSIG-----------NEVTGTG PHYPADRAFT_206270_Physcomitrella_patens_168011388 PKSCTFLISDISEVHRLI--------PGD--S-K----------DGFNLMVIDPPWENK-------------------------SVHRKSL---------YPTLPNKY-----LLSLPVKQLAHADGALVALWITNRE-------KLRHFAETE--------LFP----AWGVKMAA--VWYWLKVTV-------------------------EGTMVSP------------------LDLAHHKPYECLLLGYLPSKIGSTSEESVQFT---------------------------------GSREH------------ADLPDK------------------FVLISIP------------------------GDHSRKPPL---------KSLLSKHIPGQRH---------------------------AERGLELF-AR--E------LSAGWTSWG-----------NEPLRFQ CELE_C18A3.1_Caenorhabditis_elegans_17531953 PPKSTFHVGDVKDIEQYS--------RAH--D------------LLFDLIIADPPWFSK-------------------------SVKRKR----------TYQMDEEV-----LDCLDIPVILTHD-ALIAFWITNRIG------IEEEMIE----------RFD----KWGMEVVA--TWKLLKITT-------------------------QGDPVYDF-----------------DNQKHKVPFESLMLAKK------------------------------------------------KDSMR------------KFELPE------------------NFVFASVPM----------------------SVHSHKPPLLDLLRHF--GIEFTEPL-------------------------------------ELF-AR--S------LLPSTHSVG-----------YEPFLLQ AT1G19340_Arabidopsis_thaliana_18394726 PRNSCFYMSDLHHIRNLV--------PAK--S-E----------EGYNLIVIDPPWENA-------------------------SAHQKSK---------YPTLPNQY-----FLSLPIKQLAHAEGALVALWVTNRE-------KLLSFVEKE--------LFP----AWGIKYVA--TMYWLKVKP-------------------------DGTLICD------------------LDLVHHKPYEYLLLGYHFTELA-------------------------------------------GSEKRSDF---------KLLDKN------------------QIIMSIP------------------------GDFSRKPPI---------GDILLKHTPGSQ----------------------------PARCLELF-AR--E------MAAGWTSWG-----------NEPLHFQ mettl4_Danio_rerio_189522093 PPRCRFLLSDVTRMDPLV--------NSG---------------DKFDLIVLDPPWENK-------------------------SVKRSNR---------YSSLPSSQ-----LKKLPVPALAAPG-GLVVTWVTNRAK------HRRFVREE---------LYP----HWAVEVLA--EWLWVKVTR-------------------------SGEFVFP------------------LDSQHKKPYEVLVLGRC------------------------------------------------RSTSD------------HTDRCSAVNELPDQ----------RLLVSVPS-----------------------TLHSHKPSL---------AAVLKPYIRR------------------------------EPRCLELF-AR--S------LQSDWSCWG-----------NEVLKFQ Dmel_CG7818_Drosophila_melanogaster_19920926 ASAPMYLKADLKSLDVKT--L----------G------------AKFDVILIEPPLEEYARAAPSVATVG--------------GAP-------------RVFWNWDD-----ILNLDVGEIAAHR-SFVFLWCGSSE-------GLDMGRN----------CLK----KWGFRRCE--DICWIRTNI-------------------------NKPGHSKQLE---------------PKAVFQRTKEHCLMGIK------------------------------------------------GTVRRSTDGDFI-----HANVDI------------------DLIISEEEE----------------------FGSFEKPIE---------IFHIIEHFCL------------------------------GRRRLHLF-GRDSS------IRPGWLTVG-----------PELTNSN GSPATT00032234001_Paramecium_tetraurelia_strain_d4-2_145486788 --IKSYINCDIRYFNIDF--------LVE--K------V-----GGFDVVLMDPPWRIKGGQQNDSSFMFTNSKF---------SLD-------------YNTMSNQE-----IMDIKIEKLS-KK-GFLFLWILNT--------QLNIAYE----------MAS----KWGYEIVD--QIIWVKLNPQ------------------------GNNVYLS------------------TGYYFMHSFEICLVGYTN-----------------------------------------------KHVEY------------HSKISN------------------NIIFSPV------------------------RNKSQKPIE---------LYEIIELMMP------------------------------GSKKVEIF-ARNHN------LRHGWFSIG-----------NQLGETF GSPATT00027481001_Paramecium_tetraurelia_strain_d4-2_145473723 --VKSYINCDIRYFNLDF--------LVE--K------V-----GGFDVVLMDPPWRIKGGQQNDSSFMFTNSKF---------SLD-------------YNTMSNQE-----IMDIKIEKLS-KK-GFLFLWILNT--------QLNIAYE----------MAS----KWGYEIVD--QIIWVKLNPQ------------------------GNNVYLS------------------TGYYFMHSFEICLVGYK------------------------------------------------CPPGEHVEY--------HSKISN------------------NIIFSPV------------------------RNKSQKPIE---------MYEIIEIMMP------------------------------GAKKVEIF-ARNNN------LRHGWFSIG-----------NQLGETY METTL14_Homo_sapiens_24308265 NTPPMYLQADIEAFDIRE--L----------T------------PKFDVILLEPPLEEYYRETGI-------------------TAN-------------EKCWTWDD-----IMKLEIDEIAAPR-SFIFLWCGSGE-------GLDLGRV----------CLR----KWGYRRCE--DICWIKTNKNNP----------------------GKTKTLD------------------PKAVFQRTKEHCLMGIK------------------------------------------------GTVKRSTDGDFI-----HANVDI------------------DLIITEEPE----------------------IGNIEKPVE---------IFHIIEHFCL------------------------------GRRRLHLF-GRDST------IRPGWLTVG-----------PTLTNSN Dmel_CG14906_Drosophila_melanogaster_24647514 PNQSRFFNHNVDNLPALL-----HQ-LL----------------PAYDLIVLDPPWRNKYIRRLKRAKP---------------ELG-------------YSMLSNEQ-----LSHIPLSKLTHPR-SLVAIWCTNST-------LHQLALEQQ--------LLP----SWNLRLLH--KLRWYKLST-------------------------DHELIAPPQ----------------SDLTQKQPYEMLYVACR------------------------------------------------SDASENYG---------KDIQQT------------------ELIFSVPS-----------------------IVHSHKPPL---------LSWLREHLLLDKDQL-------------------------EPNCLELF-AR--Y------LHPHFTSIG-----------LEVLKLM NAEGRDRAFT_72415_Naegleria_gruberi_strain_NEG-M_290979461 PKHCVPIRTDVRNMNWKA--------LAR--V------------AQFDVILMDPPWQLA-------------------------TSNPLRGVAIS-----YKPLSDKH-----IQSMDISSLQEKNGGFLFVWVINA--------KYVKTLE----------MIE----KWGYKFVD--EITWVKQTK-------------------------HRRLAKG------------------HGYYLQHAKENCIIAIK------------------------------------------------NTTPEREKEMMEAA---RQKVTSLC----------------DVILSDR------------------------RGQSQKPED---------LYHFIEQMVP------------------------------DGKYLEIF-GRRNN------LRDYWVTIG----------------NE NAEGRDRAFT_30463_Naegleria_gruberi_strain_NEG-M_290998263 FKGGHYINCDLRYFTLS--------------S------L-----GKFDVILIDPPWRVIQSRPQEAMMFSNTNF----------KLN-------------YNTLSYEE-----IMDINVGSLC-DQ-GFCFMWVLNS--------SLQFGLN----------LLN----HWGFSYID--KITWIKKTK-------------------------NDQIFAG------------------TGYYFLHSTELLLVGVKH-----------------------------------------------GSTKKNGQKLQY-----ISKITN------------------DILFSKV------------------------GIQSQKPNE---------VYEIIESMVP------------------------------GARKIEIF-ARNHN------IRKGWLSIG-----------NRLGEAF CC1G_11190_Coprinopsis_cinerea_okayama7#130_299741172 SLPPSYLPYSQLSTLNS---------------------------SKFDVILLDPPFS--------------------------------------------SSFTWDN-----LQELPIPSLAADP-SFVFLWVGSGAGE-----GLERGRE----------VLA----KWGYRRCE--DVVWVKTNK-------------------------TTNQGPGTDPP--------------TTSLLTRTKQHCLMGIR------------------------------------------------GTVRRSTDSWFV-----HCNVDT------------------DVIIWEGDP----------------------TDPTRKPPE---------MYTLIENFCL------------------------------GIRRLEIF-GRPSS------LRRGWVTVL---------------GPN AT4G09980_Arabidopsis_thaliana_145340055 ASAPMYLKGDLHEVELSPELF----------G------------TKFDVILVDPPWEEYVHRAPGV------------------SDS-------------MEYWTFED-----IINLKIEAIADTP-SFLFLWVGDGV-------GLEQGRQ----------CLK----KWGFRRCE--DICWVKTNKSNA----------------------APTLRHD------------------SRTVFQRSKEHCLMGIK------------------------------------------------GTVRRSTDGHII-----HANIDT------------------DVIIAEEPP----------------------YGSTQKPED---------MYRIIEHFAL------------------------------GRRRLELF-GEDHN------IRAGWLTVG-----------KGLSSSN EAG_10107_Camponotus_floridanus_307172265 PKKCTFYCYDVRDIDKKI--------ELN---------------NQYDFILLDPPWWNKSIRRKKIKCA---------------EAS-------------YKMMYNEE-----LIKIPIRKLLHSN-GIVAIWCTNSSN------HLNSIFNE---------IFP----SWGITYRA--KWYWIKVTQ-------------------------AGDTICNFNLA--------------PG---KQPFELLILGSA------------------------------------------------LEEDK------------VNIPDA------------------KLMISIPS-----------------------AVHSHKPPL---------TEIIKDYLPN------------------------------EPKCLEIF-AR--Y------LLPGWTSWG-----------LEILKFQ Ot11g01290_Ostreococcus_tauri_308809243 PPQCIPVHANVTTYDWRP--------MYE--H------------EQFDVIMMDPPWQLATANPTRGV-----------------SLG-------------YSQLTDQD-----IANLPLPQLQ-KN-GLLFVWVINA--------KYQWCLN----------QFK----KWGYEFVD--EIVWVKVTN-------------------------SRRLAKS------------------HGFYLQHAKEVCLVARR------------------------------------------------GDTPPGLK---------DKAIGS------------------DIIFAPR------------------------RGQSQKPTE---------IYELIEELVP------------------------------NGRYLEIF-ARKNN------LRDFWVSVG-----------NEVTGTG SINV_06005_Solenopsis_invicta_322796786 PRKCTFYSYDVRDIEKKI--------ELS---------------NQYDFILLDPPWWNKSIRRKKMKCA---------------EAS-------------YKMMYNEE-----LVKIPIKKLLHSN-GIVAVWCTNSSN------HLNSIINE---------IFP----SWGIIYRA--KWYWLKVTQ-------------------------AGDTICNFNSA--------------PG---KQPYELLVLGTA------------------------------------------------LEKGK------------IDIPGG------------------KLMISVPS-----------------------AVHSHKPPLTDLFI----FLEIIKDYLPD-----------------------------EPKCLEIF-AR--Y------LLPGWTSWG-----------LEILKFQ gp5_EBPR_siphovirus_4_337731296 --------MSSTE-EFIA--L----------R------P-A---GGFSLIMADPPWSYEMRSEKGYAKAP--------------EAQ-------------YATMPLAE-----IAAMPVELLAAED-CLLWLWAVNP--------QLPQALE----------VLV----AWGFTFKT--AGTWLKRST-------------------------RGKVSFG------------------TGYILRSANEPFLIGAR------------------------------------------------GRPKT------------TRATRS------------------AVITRDERLRGMEDNWPLGTITIEAAG----REHSRKPDE---------AYVACEELMP------------------------------GARRLDLF-SR--Q------RRQGWVSWG-----------NEVGKFE METTL4_Homo_sapiens_145275206 PPKSSFLLSDISCMQPLL--------NYR---------------KTFDVIVIDPPWQNK-------------------------SVKRSNR---------YSYLSPLQ-----IQQIPIPKLAAPN-CLLVTWVTNRQK------HLRFIKEE---------LYP----SWSVEVVA--EWHWVKITN-------------------------SGEFVFP------------------LDSPHKKPYEGLILGRVQEKTALPLRN--------------------------------------ADVNV------------LPIPDH------------------KLIVSVPC-----------------------TLHSHKPPL---------AEVLKDYIKP------------------------------DGEYLELF-AR--N------LQPGWTSWG-----------NEVLKFQ LOC100633541_Amphimedon_queenslandica_340370562 PPSSSFLLSDISKIQLLK--------RFS--RYAD---A-----NGYNIIVLDPPWENRSAIR---------------------GGK-------------YKWLDKED-----ISQLPIPELIAPG-GLVALWVTNKRQ------LVQWTVQE---------LLP----KWGLEYIG--EWLWIKVTT-------------------------EGDFVFD------------------VDSVHKKPYESLIIGKRPSLPADSDPTPPPAKAQRLTESHCIPPSNPLMLKDLS-----------GDKSVQSLSADSASSV-KSRSGD------------------EILVDNVSVITGSCKPPSQYSLMCIPS----TTHSQKPYLGDILQ----LYAESKDM----------------------------------KCLELF-AR--N------LLPGWTSWG-----------NEVLQFQ LOC100635916_Amphimedon_queenslandica_340382361 AIPPVYYKVDLSSFDLTS--L----------D------------AKFDVILIDPPLEEY-------------------------QRRTTGITYP------WQPWDFEE-----IMNLKIEDVSAPR-SFVFLWCGSCE-------GLDLGRE----------CLK----KWGFRRCE--DICWVKTNMNDP----------------------GNTTHLE------------------QKSIFQHTKEHCLMGIK------------------------------------------------GTVRRNQDGHFI-----HANIDL------------------DIIISEEPE----------------------MGNNDKPEE---------IFHIIEHFCL------------------------------GRKRLHLF-GNDAT------VRPGWLTLG-----------PNLSSSN mettl14_Danio_rerio_46309507 NTPPMYLQADPDTFDLRE--L----------K------------CKFDVILIEPPLEEY-------------------------YRESGIIAN-------ERFWNWDD-----IMKLNIEEISSIR-SFVFLWCGSGE-------GLDLGRM----------CLR----KWGFRRCE--DICWIKTNKNNP----------------------GKTKTLD------------------PKAVFQRTKEHCLMGIK------------------------------------------------GTVRRSTDGDFI-----HANVDI------------------DLIITEEPE----------------------MGNIEKPVE---------IFHIIEHFCL------------------------------GRRRLHLF-GRDST------IRPGWLTVG-----------PTLTNSN CAOG_07090_Capsaspora_owczarzaki_ATCC_30864_470293128 APGARWLVADILRSNLAE--L-T--------G------------QTFEGILIDAPLARS-------------------------GEPAT-----------PGMVTVDE-----LKAAGISPALIPR-GFIFMWAEKE--------WIPDLLE----------VAQ----AWGFHYVE--NICWVRHNI-------------------------NNKVSRE------------------DSRFFRKSKLLLLIFRN------------------------------------------------FDIGA------------K----------------------------QV------------------------AIPEEKPQP---------IYSTIETLLPNANATALPDGSIG-----------------PGKLLELS-WRAQPF-----LRRGWTTIS---------------HSQ ACA1_156200_Acanthamoeba_castellanii_str_Neff_470390089 IPGSRYVEATIEELELEK--L-----------------------GSFEAILMDPPWDLSPRASASDKARQCHQLVPKRG-----KNG-------------ERLISPEE-----LGKWPVTDKLIPK-GFLFIWTEKE--------LIPRVLT----------MAQ----KWGFHYVE--NFAWVKFDV-------------------------NNKIHTE------------------DYAYFRKSKLTLLMLRKP-----------------------------------------------GDIEL------------RHQRNP------------------DVKFDFV------------------------RKGRERLPF---------VFDIIETLLPTAKYNPKTG---------------------KGKMLQLW-CLPQE------RRSGWTSVH----------------LS ACA1_219460_Acanthamoeba_castellanii_str_Neff_470419934 PIGSTYLEGDILKMELKN--Y-----------------------GEFEAILMDPPWHTG-------------------------AKDDPARL--------PGTVTPEE-----LGKLKITDALLPK-GLAFVWVEKE--------LIPKVFA----------LMK----KWNFIYVE--NFAWVKKSV-------------------------NNKFVSQ------------------PYKYFQKSKTTLFIFRKFTAE--------------------------------------------GKDQLEL----------RHQRNP------------------DV-----------------------------------PDF---------TYHVIETLLPNAAYKEDAG---------------------RGKLLELW-GRAGSQ-----RRTGWT-------------------TI ACA1_366350_Acanthamoeba_castellanii_str_Neff_470407427 PPHCVPIRADVHNPSLRSSLASAWASKQVDGQQALGKQ------VQFDVIVMDPPWQLA-------------------------GSAPTRGVALG-----YKQLHNKD-----IEKIPIPLLQ-TN-GFLFIWVINA--------RYAFALD----------LME----KWGYRFVD--DIAWVKATV-------------------------NRRMAKG------------------HGFYLQHAKETCLVGLK------------------------------------------------GEDPPNM----------RGNRCS------------------DVIFSER------------------------RGQSQKPEE---------IYHIMEALVP------------------------------NGRYLEIF-ARRNN------LRNHWVSVG----------------LE ACA1_149840_Acanthamoeba_castellanii_str_Neff_470510758 AAPLVVVFVAGQQQQQKQEEEEEQEAGGE--G-EGEAEV--A--GGFGLVVVDPPWENR-------------------------SLSRSHN---------YGTLAPHE-----IAKLPVRSLLSSAGAYVVVWVTNNP-------AIHNFVKRN--------LFP----RWRVQYVA--THYWLKLTS-------------------------SGEPVMP------------------LNSAHRKPYEEL----------------------------------------------------------------------AGLRKE------------------MVVCSVA------------------------NRYARKPS------------------------------------------------------------------------------------------LDAVAAA PFL1715w_Plasmodium_falciparum_3D7_124806530 STPARYIRCDLRTFDLGS--L----------D------------TKFDVILIDPPWKEYYDRKIHNLHVLNNINLDQDLNNDMNNEK-------------DKFWTLED-----LANIEIEKIAEVP-SFLFIWCGVT--------HLEDARV----------LLN----KWGYRRCE--DICWLKTNI-------------------------NEKNKKNKYLNEINN----------ENSYLQRTTEHCLVGIK------------------------------------------------GAVRRSYDIHLI-----HANLDT------------------DVIIAEETEQN--------------------IYDNNKPEE---------LYKIIEKFCL------------------------------GRRKIELF-GTNRN------IRNGWLTLG---------------KHI _Maritimibacter_alkaliphilus_495609045 ---MARMTAN-VVQQFAN--L----------R------P-G---GGFGLIMADPPWRFE-------------------------NFSAKGEGKNATAH--YECTSLDW-----IKSLPVEVLAADN-CLLWLWATNP--------MLREAFE----------VLD----AWDFEFAT--AGTWVKRTV-------------------------HGKVAFG------------------TGYVLRSSNEPFLIGKR------------------------------------------------GKPKAT-----------RSTRSTIPTYCDIDLFEGDWPKSAITIEAVA------------------------REHSRKPDE---------AFAAAEALLP------------------------------DVPRIELF-SR--Q------TRPGWRAWG-----------NQTDKFG TVAG_136190_Trichomonas_vaginalis_G3_154414896 IKHSASINCDVRTFPFDK--------LGE--I------------TQFDVITMDPPWLIA-------------------------QAGITRGVAIN-----YDQLSTDI-----IGQIPLQKIQ-KN-GYIFVWVIAS--------QLENGIQ----------LLQ----NWGYEFLT--YLNWVKISK-------------------------YGRYMPS------------------HGYYLQHNKETVLIGHK------------------------------------------------GKDPENM----------RPNKFN------------------DLIIQQRS-----------------------LRQSHKPIE---------IYELIERVFP------------------------------NSMYCEIF-ARPHN------LRQGWVSVG----------------LE _Afipia_sp_1NLS2_496698392 ---MTLPAKDLLSFAGQ---------------------------RRFSTILADPPWQFT-------------------------NKTGKVAPEHKRLSR-YGTMKLDE-----IMMLPVADIAAPT-SHLYLWCPNA--------LLPEGLA----------VMK----AWGFNYKS--NIVWHKVRKD------------------------GGSDGRG------------------VGFYFRNVTEVILFGVR------------------------------------------------GKNARTLA---------PGRRQV------------------NLLATRK------------------------REHSRKPDE---------QYEIIESCSP------------------------------GP-FLELF-AR--G------TRKNWATWG-----------NQADDDY _Actinomyces_sp_oral_taxon_175_497433097 AGRSSSEGTDSS-PAIPG--L----------P------P-----GGFATILVDPPWPLQSG-----------------------EKH-------------YRTMSLAR-----IKALPVGALAARD-AHLWLWTTNA--------LLPKAYE----------VAE----AWGFTVRS--PLTWVKFRL-------------------------------GLG----------------GRYQLRNATEQLLFCTR------------------------------------------------GRAPL------------GSRSQP------------------TWFNAPV------------------------TEHSRKPAE---------QFAIIERVSP------------------------------GP-YLELF-ARRRPE-----SNQPWAVWG-----------DQVASDI XAUT_RS18300_Xanthobacter_autotrophicus_501064335 --------MNG-LWQFGD--L----------K------M-----FGYDLIVADPPWDFELYSEAGEGKSA--------------KAH-------------YGTMKLDE-----IAALRVGDLARGD-CLLLLWCCEW--------MPPAARQR---------VLD----AWGFTYKT--TIIWRKVTR-------------------------AGKVRMG------------------PGYRARTMHEPVIVATV------------------------------------------------GNPKH------------T--PFS------------------SVFDGVA------------------------REHSRKPEA---------FYRMVEAAAP------------------------------KAARADLF-SR--Q------RRDGWDAFG-----------NEVEKFD _Cenarchaeum_symbiosum_503247195 NRTKQNKIPQEILYQKLPN-------------------------RKFDIIYADPPWDYNGKLQYDKTDLYVSTS----------SFK-------------YPTMKTKK-----MMEIPIKKIASSN-SLLFLWATSP--------HLEQAIQ----------LGK----AWGFEYRTV-AFVWDKMNH-------------------------------N------------------PGKYTLSNCELCLLFKH------------------------------------------------GKIPTP-----------RGARNV------------------RQLITIPR-----------------------TEHSRKPVQ---------AMQGIERMFP------------------------------FQKKIELF-AR--E------KYRGWSAWG-----------LDLVLKN ETHHA_RS08455_Ethanoligenens_harbinense_503250901 MSTAKETANNLLQFCGE---------------------------KKYATVYADPPWRFQ-------------------------NRTGKVAPENKKLNR-YPTMDLED-----IKALPVGKIAAEK-SHLYLWVPNA--------LLPDGLE----------VMK----AWGFEYKG--NIIWEKVRKD------------------------GEPDGRG------------------VGFYFRNVTEILLFGIR------------------------------------------------GGNNRTLA---------PARSQV------------------NLIRTQK------------------------REHSRKPDE---------IITIIESCSP------------------------------GP-YLELF-AR--G------DRENWDMWG-----------NQATAEY _Mycobacterium_abscessus_511283520 MAAPLREVNEPPPLPVTD--------------------------GGFSTILADPPWRFT-------------------------NRTGKVAPEHRRLDR-YSTLSLDE-----ICALGVSDVTADN-AHLYLWVPNA--------LLPDGLR----------VME----EWGFRYVS--NIVWSKVRRD------------------------GLPDGRG------------------VGFYFRNTTELLLFGVR------------------------------------------------GSMRTLQ----------PARSQV------------------NQIVTRK------------------------REHSRKPDE---------QYELIEACSP------------------------------GP-YLEMF-GR--Y------RRPNWAVWG-----------DEANEDV CAOG_04822_Capsaspora_owczarzaki_ATCC_30864_514485079 PPHCVPIKANVLEFDWAS--------LAA--H------------CQFDVIMMDPPWQLA-------------------------SNAPTRGIALT-----YNQLPDAA-----IEDIPIASLQRNG-GFVFVWVINN--------RYAKAFD----------MLK----RWGYRFVD--SIDWVKFTV-------------------------NRRLAKC------------------HGFYLQHAKETCLIGLK------------------------------------------------GDPPPGC----------VGNVAS------------------DVIFSER------------------------RGNSQKPDE---------MYELVEALVP------------------------------NGKYLEIF-GRRNN------LRNYWVTIG----------------NE PTSG_05864_Salpingoeca_rosetta_514690366 PSPCRFLLANIQHLRPHM--------Q-------D---L-----GVFDLIVMDPPWHNG-------------------------SVRRGSR---------YGTMDYDA-----IMDIPIPFLMSPR-CLLALWITNND-------RCATFVHER--------LLP----HWGLKKVT--EWKWLKVTT-------------------------QGEPVFP------------------LSSRHKRPYEVLILATNAPGAFETAPAYGIVHQWQAAMRQQQDEDRREQQQDEEKQQKLETKENEGEKQEGEQHQQVAKCSNHTEHSQ------------------QVKHAPPAVVQHGADAPIALPADLRIAGVPSLVHSEKPPA---------IHRLLVSLLTRGSNTSPLRQQQQQQQRQEEDGVLASTPRQRPRCLEVF-AR--R------LHRHWTSVG-----------NQVFKLQ PTSG_04805_Salpingoeca_rosetta_514693100 STPPMSIRADPLCLDASS--L----------G------------TSFGVIYIDAPLPEYARRAP--------------------GLK-------------LDTVSWEE-----LGRLDVRGLAGEI-AFVFMWVGCSE-------GLEKGAQ----------LLR----RWGFRRCE--DICWVKTNKQQP----------------------RRRGIME------------------PHSLLQHTKEHCLLGIR------------------------------------------------GAPNRKTEPHIL-----HSNMDV------------------DVIVSEDPP----------------------IESTEKPSE---------IFAVMERMCQ------------------------------SRKRLHLF-AS--GT-----VRPGWVGVG-----------KDLPQTD TVAG_062450_Trichomonas_vaginalis_G3_154413191 IKHSAAIACDVREFPFDK--------LGA--I------------TQFDVITMDPPWLIA-------------------------QASITRGVAIN-----YDQLGTDT-----ITQIPLHKIQ-KN-GYIFLWVIAS--------QLENGIQ----------ILN----KWGYEFLT--YLNWVKISK-------------------------YGRYMPS------------------HGYYLQHNKETVLIGRK------------------------------------------------GRDPENM----------RAEMFD------------------DLIIQQRG-----------------------LRQSHKPVE---------IYELIERVFP------------------------------NSMYLEIF-ARPHN------LREGWVSMG----------------LE RPHASCH2410_RS00155_Rhizobium_phaseoli_515104987 ----MRLFPDL-WPFGD---L----------Q------P-----HSFDFIMADPPWKMQEWSDNGDKSKST-------------QSK-------------YRLMPLDE-----IKAMPVLDLAAPN-CLLWLWATNP--------MLPQALD----------VLH----AWGFTFAT--AGSWMKTTR-------------------------NGKQAFG------------------TGYIFRTSNEPILIGKR------------------------------------------------GEPKTT-----------RSVRSS--------------------FPGLA------------------------REHSRKPEE---------GYREAERLMP------------------------------RARRLELF-SR--T------NRVGWTTWG-----------DEVGKFG H156_RS0101780_Methylococcus_capsulatus_515934135 TENTLDPAADLLERLGD---------------------------KRFRTILADPPWQFQ-------------------------NRTGKMAPEHKRLNR-YGTMSLEA-----IAGLPVERLTADT-AHLYLWVPNA--------LLLEGLK----------VME----AWGFTYKT--NLVWHKIRKD------------------------GGPDGRG------------------VGFYFRNVTELVLFGVR------------------------------------------------GKNARTLA---------AGRRQV------------------NFLATRK------------------------REHSRKPDE---------MYGIIEACSP------------------------------GP-YLELF-AR--G------ARDRWSVWG-----------NEADENY LOKHON_RS09140_Loktanella_hongkongensis_516541036 --------------------M-----------------------GPFDIILADPPWRFASNSEAKPGRNP--------------RRH-------------YPCMRDEE-----ICALPVARWAAPA-ALLLMWTTSP--------MLDRSMA----------IPR----AWGFRYVS--SLVWTKDRI-------------------------------G------------------TGYWARNRHELVLICKR------------------------------------------------GRFDCP-----------RPAPFA------------------DSVISGQQ-----------------------REHSRKPDA---------LHAQIDAAWP------------------------------EARKLELF-AR--Q------ERPGWTAWG---------------NDT CYME_CME116C_Cyanidioschyzon_merolae_strain_10D_544210442 PPHCIPVRADVRFADWDQ--------IAAAAN------------GNYDVILMDPPWQLA-------------------------TANPTRGVALG-----YNQLSDES-----ILAIPLEKLQ-RC-GLLLIWVINA--------KYRVALQ----------MFE----RWGYRLVD--EIVWVKLTV-------------------------NRRLAKN------------------HGFYLQHAKETCLVGVK------------------------------------------------GNDLSALSTA-------PGMPRP------------------DVILSER------------------------RGQSQKPDE---------LYEWIEALVP------------------------------NGKYIEIF-ARKNN------LRNFWVSIG-----------NEVTGES CYME_CMH026C_Cyanidioschyzon_merolae_strain_10D_544211235 LNDGYYINCDLRYFNLAY--------LRE--C------V-----GNFDVVLIDPPWRIAGGQRASTPNGPMFTNNHW-------AVN-------------YNTLSNEE-----ILDLDIGCLS-NS-GLCFLWVVSS--------QLPTGMA----------CLS----RWGYEYID--KITWIKKRQ--------------------------GKLHVS------------------HGYHFMHSSELCLIGVK------------------------------------------------RPCEF------------IGKVSN------------------DLIFAEV------------------------REKSRKPDE---------LYHVVETMLP------------------------------GTAKIELF-ARNHN------IRRGWLSLG-----------NELGEQF _Thalassobacter_arenae_544667667 MDMSSNPSQDLRDFLSG---------------------------DSFGCVMADPPWRFT-------------------------NRTGKVAPEHKRLAR-YPTMTVED-----ICALPVSDHLMDR-AHCYMWVPNA--------LLPEGLR----------VLN----AWGFEYKS--NIIWHKIRKD------------------------GGSDGRG------------------VGFYFRNVTEILLFGVR------------------------------------------------GKNIRTLA---------PGRRQV------------------NMMQTRK------------------------REHSRKPDE---------QYELIESCSW------------------------------GP-YLELF-GR--G------IRDGWTVWG-----------NQADADY COCSUDRAFT_36424_Coccomyxa_subellipsoidea_C-169_545366117 ------------MSKHASAPE-----------------------GGYDCIVMDPPWENK-------------------------SAKRSGH---------YPTLPSRH-----LLSIPIARLLNQQGGLLALWVTNRE-------RLRRFVDQE--------LLA----KWGLEQVA--TWFWLKVTN-------------------------SGQLVSP------------------LEVAHRRPYEALLLARLRPSAASGS----------------------------------------DTNEE------------GRAVRN------------------MVFLAVP------------------------GEHSRKPHI---------GSLLAPHLLA------------------------------QPACLEMF-AR--E------LAADWTSWG-----------NEALRFQ TVAG_002370_Trichomonas_vaginalis_G3_123390303 IKQAAPIKADIRYFDWET--------LGK--I------------CQFDVILMDPPWNIQPAQTTRGV-----------------ELG-------------YELMLESE-----IASMKIPLVQ-TN-GYCFMWVVAS--------FLPVGVS----------MLQ----GWGYKVID--FINWIKTSK-------------------------YGRYRPS------------------NGYFLQHDKETCLVGIK------------------------------------------------GKPLD------------GEDVDIFN----------------DLIIDERG-----------------------ARQSHKPPS---------LYDIIERMFP------------------------------GRLYLEIF-ARAHN------EREGWVSLG----------------LE EMIHUDRAFT_205550_Emiliania_huxleyi_CCMP1516_551588467 PARSSFSLCRLAYWPRLS-------------R-IL---------PRFRCLVVDPPWPSR-------------------------SVQRAGA---------YKVLALEELAEA-LRSLP--ALCDRRGCLICVWMTNAI-------KVQEMVEQT--------LLP----AWGATKVG--LWYWLKLSP--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DG EMIHUDRAFT_449178_Emiliania_huxleyi_CCMP1516_551613601 -----------------------------------------------------------------------------------------------MEGDEDEVWTPQE-----VMNLRIEAITETP-SFCFLWSGSGV-------SLQWGRA----------CLR----KWGFRRCE--DISWVKSNRAT-----------------------GRNTHFL------------------PDSVVTPTTEHCLVGIK------------------------------------------------GTVRRNYDGHII-----HANVDT------------------DVMLSEEPP----------------------YGSTEKPTE---------LYAIIEHFSN------------------------------GRRRLELF-GEDHN------LRRGWLTLG-----------KGLSSSS HMPREF0742_RS10030_Rothia_aeria_553802969 MLDPMNTNEEFAPLPTVE--------------------------GGFQTVLADPPWRFT-------------------------NRTGKVAPEHHRLGR-YGTMSLDE-----IKALRVGDVTADN-AHLYLWVPNA--------LLPEGLE----------VMQ----AWGFRYVS--NIIWAKRRKD------------------------GGPDGRG------------------VGFYFRNVTEPILFGVK------------------------------------------------GSMRTLA----------PGRSTV------------------NMIETRK------------------------REHSRKPDE---------QYDLIEACSP------------------------------GP-YLELF-AR--Y------ARPGWSVWG-----------NEASNEI G966_02949_Escherichia_coli_UMEA_3323-1_554729604 ------------MGWFMT--------------------------KKYTLIYADPPWVYRDKAADGNRGA---------------GFK-------------YPVMSVLD-----ICRLPVWDLADEN-CLLAMWWVPT--------QPLEALK----------VVE----AWGFRLMTMKGFTWIKCGSRQ-----------------------PDKLVMG------------------MGHMTRANSEDCLFAVK------------------------------------------------GKLPT------------R--INA------------------GIVQSFTAPR---------------------LEHSRKPDI---------VREKLVQLLG------------------------------DVSRIELF-AR--Q------TSHGFDVWG-----------NQCEDPA AGZ61752.1_Phormidium_phage_MIS-PhV1A_556471807 --------------------------------------------KDYRLIVVDPPWSYSLRETDATHRG---------------RCP-------------YPSMSDEQ-----ILNLPIGAIAHQN-SYLLLWVTNN--------HLPLGFR----------CLE----RWGFTYKS--IFTWVKTTKAST----------------------EEKIAPNMG----------------IGHYGRNCTEHFLIATR------------------------------------------------GNPGSFTS---------HGLTDIR-----------------NIIFAPR------------------------SKHSEKPPE---------FWTIADRLAEHL----------------------------DGPRIELF-ARSSGLF----KREGWDSWG----------------AE RFI_31139_Reticulomyxa_filosa_569355952 PPHSVPIRADVMHFDFKA--------LAN--E------QLRISGRLFDVIMMDPPWQLA-------------------------SSNPTRGVAIG-----YEQLTDES-----ILALPIPKLQ-SD-GFLLVWTINA--------KYRLALQ----------MFK----KWGYRIVN--DIAWVKQTV-------------------------NRRIARG------------------HGFYLQHAKETCLVGFK------------------------------------------------GEEKKVGF---------VSGVCA------------------DVIYSVR------------------------RGQSQKPVE---------IYEYIERLVP------------------------------NGCYLEIF-GRRNN------LRDYWVTIG----------------NE F442_02656_Phytophthora_parasitica_P10297_570995458 PAGSSFAQRDVRTLHQLA--------L-----------------GQHKLILMDPPWQNK-------------------------SVSRGKR---------YNTFDHTD-----LLRINIPHIADPNECILAVWVTNRPR------YMTYLREQ---------ALP----SWGFTYHA--CWYWLKLSK-------------------------NGELVTP------------------LDSTHRLPVETLLVAYR------------------------------------------------AKDQKHEQLL-------RRRLGEKM----------------RIVVSIP------------------------LRHSWKPPPE--------CFFDKDIMST------------------------------SHRKVELF-AR--E------LRPHWTSIG-----------NEVLKFQ ETSY1_42765_Candidatus_Entotheonella_sp_TSY1_575405212 SNSPHSAADDLLA-------C---G-FPP---------------HSFSTVLADPPWRFT-------------------------NRTGKMAPEHRRLSR-YPTLTLEE-----IADLPLAQLVQPD-SHLYLWVPNA--------LLAEGLD----------VMR----RWGFTYKT--NLVWYKIRRD------------------------GGPDRRG------------------VGFYFRNVTELVLFGVR------------------------------------------------GRMRTLA----------PGRRQE------------------NLLASQK------------------------QEHSRKPDT---------FYDLIERCSP------------------------------GP-YLELF-AR--H------PRPGWHQFG----------------NE GbCGDNIH3_7033_Granulibacter_bethesdensis_CGDNIH3_586601520 MTKQPDPIAEFRN-------Q-----LNG---------------GNFATVLADPPWRFQ-------------------------NRTGKMAPEHRRLSR-YGTMELPE-----IMALPVSEVTAKT-AHLYLWVPNA--------LLPEGLA----------VMQ----AWGFNYKS--NLVWHKIRKD------------------------GGSDGRG------------------VGFYFRNVTELVLFGVK------------------------------------------------GKNARTEA---------PGRRQV------------------NLLATQK------------------------REHSRKPDE---------FYDIVEACSP------------------------------GP-YLELF-AR--G------TRPGWCAWG-----------NQAEEYD TTHERM_00704040_Tetrahymena_thermophila_SB210_586728217 PDNSIPICSDVTKLNFQA--L-----IDA--Q------M-RHAGKMFDVIMMDPPWQLS-------------------------SSQPSRGVAIA-----YDSLSDEK-----IQNMPIQSLQ-QD-GFIFVWAINA--------KYRVTIK----------MIE----NWGYKLVD--EITWVKKTV-------------------------NGKIAKG------------------HGFYLQHAKESCLIGVK------------------------------------------------GDVDNGRF---------KKNIAS------------------DVIFSER------------------------RGQSQKPEE---------IYQYINQLCP------------------------------NGNYLEIF-ARRNN------LHDNWVSIG----------------NE TTHERM_00136470_Tetrahymena_thermophila_146175568 RTSQSIIECDLRYFDFTY--L-T--------N------I-F---DSFDVVMIAPPF---------------------------------------------DAISIQE-----VFELKVELIS-KQ-GFLFFWAKDV--------PTATSYE----------IMS----KWGYDVID--QIIWVKIDKN------------------------EGKMILEDK----------------PDKYFYNSNEMCLIGVK------------------------------------------------KHPSSKGVEY-------QSKVSN------------------NIIVSHQ------------------------PQNQSCPDQ---------IYDIIDLMMP------------------------------GSKKIELF-TKQKV------IRGGWFGLE---------------HKF RirG_092820_Rhizophagus_irregularis_DAOM_197198w_595481916 PEWCVPIKADVLTFEWDE--------FAK--E------------CQFDVILMDPPWQLA-------------------------THAPTRGVAIA-----YQQLPDVC-----IEELPIPKLQ-KN-GFLFIWVINN--------KYSKAFE----------MMK----KWGYTYCD--DITWVKQTV-------------------------NRRMAKG------------------HGFYLQHAKETCLMGRK------------------------------------------------GEDPPGC----------NHSISS------------------DVIFSER------------------------RGQSQKPEE---------LYEMIEELVP------------------------------NGNYLEIF-GRKNN------LRDYWVTIG----------------NE RirG_018940_Rhizophagus_irregularis_DAOM_197198w_595497236 ATPPTYLKADLRTFDFKS--L----------G------------TKFDVILIDPPLEEYCRRSPLVA-----------------GSN-------------LDYWDYDE-----IANLKIEDAAATP-SFIFIWSGDAD-------GLDRGRQ----------LLL----KWGYRRCE--DIVWIKTNKKW-----------------------DGSHHIE------------------PRSIFQRTKEHCIMGIK------------------------------------------------GTVRRSTDGHFI-----HCNVDT------------------DVIISEEPHY---------------------GIGTAKPEE---------LYHIIEHFCL------------------------------GRRRLELF-GEDHN------IRPGWLTVG-----------LSLSSSN TTHERM_00558100_Tetrahymena_thermophila_SB210_118378397 NHPPVYLKADLKYYDLSK--L-----------------------GKFDVIMMDPPWKEY-------------------------EERVQGLPIYSQYPEKFNSWDLNE-----IAALPIDEISDKP-SFLFLWVGSD--------HLDQGRE----------LFR----KWGYKRCE--DIVWVKTNKDKT----------------------KEYIELP------------------HSNLLVRVKEHCLVGLR------------------------------------------------GDVKRASDSHFI-----HANIDT------------------DVIVAEEPP----------------------LGSTQKPAE---------IYDIIERFCL------------------------------GRKRLELF-GEVHN------VRQGWLTIG-----------KLLDESN YCL055W_Saccharomyces_cerevisiae_S288c_6319795 TPFGCKIGIDSIVPTLNHWI------QNE--N------------LTFDVVMI-------------------------------------------------GCLTENQFIYPILTQLPLDRLISKP-GFLFIWANSQ--------KINELTK----------LLNNE--IWAKKFRRSEELVFVPIDK-------------------------KSPFYPGLDQD--------------DETLMEKMQWHCWMCIT------------------------------------------------GTVRRSTDGHLI-----HCNVDT------------------DLSIETKD-----------------------TTNGAVPSH---------LYRIAENFST------------------------------ATRRLHIIPARTGYETPVK-VRPGWVIVS-----------PDVMLDN H556_RS0109535_Brevundimonas_naejangsanensis_636828153 -------MIEPLPA------------------------------GPYSCILADPPWHHA-------------------------SRSPKGQTRRSPSHH-YRTMALAE-----IKALPVADVAAKD-CHLFLWTTGP--------HLQQAFL----------VMN----AWGFRYSSL-AFVWVKRRKQPDGDDDGVLFMD------------RRDLFTG------------------MGYTTRQNAELVLLGRR------------------------------------------------GAPKR------------LSKAIH------------------QIITAPR------------------------QEHSRKPSE---------AHSRIERYCD------------------------------GP-RLELF-AR--A------PRDGWTVWG----------------NE KPNIH27_19120_Klebsiella_pneumoniae_subsp_pneumoniae_KPNIH27_640854256 --------------------------------------------MNYDLIYCDPPWEYG-------------------------NRISNGAACNH-----YSTMSIDD-----LKFLPVRKLAADN-AVLAMWYTGT--------HNREAVE----------LAE----SWGFRVRTMKGFTWVKLNQNAADRFNKALSTGELVDFNDLLEMLDRETRMN------------------GGNHTRSNTEDVLIATR------------------------------------------------GTGLP------------RASASVK-----------------QVVHTCL------------------------GEHSAKPWE---------VRNRLEQLYG------------------------------DVKRIELF-AR--E------EWKGWDRWG-----------NQCNNSI SPRG_10355_Saprolegnia_parasitica_CBS_22365_641530415 APQRVHFAPDMQLTELG---------------------------MQFDVIVVDPPWAEVA------------------------TSG-------------EAIWTAQD-----LARLDVNGIGAYP-GVLFLWCGSGGTYDGVHSHFDEACD----------LVA----TW-----------WAKVQDV------------------------NERSGHG------------------LV---RRSKELCLLALR------------------------------------------------DHVWRDTSGHFV-----HANVDA------------------DVVIAPT------------------------TAGRAKPAA---------FYEVVERFCL------------------------------GRRRLDVF-GS--T------ARNGWVTLD---------------RFA _Lachnospiraceae_bacterium_3_1_46FAA_496677811 MPAVLFL-LELHRRRKGGYKI-----ENN---------------QKYNIIYADPPWRYQQKRLSGAA-----------------EHH-------------YPTMSVKD-----ICGLKVEEIAAKD-CVLFLWATFP--------QLPEALR----------VIK----AWGFQYKTV-AFVWLKQNKS------------------------GKGWFFG------------------LGFWTRGNAEICLLAIK------------------------------------------------GKPHR------------NSNRVH------------------QFLISPI------------------------RGHSQKPEE---------AREKIVELMG------------------------------DLPRVELF-AR--E------KTEGWDAWG-----------NEVESDI F820_RS0109105_Xylella_fastidiosa_653894579 TKHKANTASDVGRDLLARHGG-----------------------QRFHTILADPPWQFQ-------------------------NRTGKMAPEHKRLSR-YGTMTLDD-----IMMLPVEQLVTDT-AHLYLWVPNA--------LLPEGIK----------VLE----AWGFSYKS--NIVWHKVRKD------------------------GGPDGRG------------------VGFYFRNVTELVLFGVR------------------------------------------------GKNARTLA---------PGRRQV------------------NFLATQK------------------------REHSRKPDE---------FYDIVESCSP------------------------------GP-FLELF-AR--G------PRDGWKVWG-----------NQADKYY ON05_RS35435_Acaryochloris_sp_CCMEE_5410_657200356 ------------MPNTPPLPV-----------------------GAFSLIVVDPPWSYHLRESDKTHRG---------------RCP-------------YPSMTDEE-----IVAMPVSSIAAPD-SYLLLWTTNN--------HLPLAFK----------VME----SWGFEYKA--IHTWVKTTLD------------------------RSKIRYG------------------VGHYGRNATEHVLIGRK------------------------------------------------GKAKTFTALGL------TNIPTA--------------------FQAPL------------------------GQHSQKPEE---------FYQMADRLGDAL----------------------------GGQRIELF-SR--C------PRPGWESWG----------------AE K291_RS0125225_Ensifer_sp_USDA_6670_661268459 --------MTGWFFDPLLP-------------------------LHYEMIVIDPPWGFDLYSKEGAKKSA--------------LAK-------------YELMKDAE-----VRTLPVGKLASMD-CLLYCWATAP--------QLPLAIE----------CVK----AWGFQYKS--ILVWRKTTP-------------------------SGKIRMG------------------TGYRVRTTGEVVVVATL------------------------------------------------GNPKQ------------EAIPQT--------------------IFDGIA-----------------------REHSRKPDE---------LYALCDRVMP------------------------------HARRADVF-AR--E------QREGWHAFG-----------NEVTKFN SS17_3321_Escherichia_coli_O157:H7_str_SS17_666005014 ------------MT------------------------------KKYTLIYADPPWTFRDKATDGQRGA---------------SFK-------------YPVMSLLD-----ICRLPVWELAADN-CLLAMWWVPT--------QPLEALK----------VVE----AWGFRLVTMKGLTWNKCGKRQ-----------------------TDKLVMG------------------MGSTTRANSEDCLFAVK------------------------------------------------GNLPE------------R--INA------------------GIIQSFTAPR---------------------LDHSRKPDM---------AREKLVQLLG------------------------------DVPRIELF-AR--H------TSHGFDVWG-----------NQCGTPS EL18_01388_Nitratireductor_basaltis_667917580 ----MHLF-DWPFGDLNP--------------------------HSFDLIMADPPWAFELRSDKGEGKSA--------------QSH-------------YKCQTLDE-----IKALPVLDLAAPD-CLLWLWATNP--------MLPQAFE----------VMA----AWGFTFKT--AGAWGKTTV-------------------------NGKLAFG------------------TGYIFRSAHEPILIGTR------------------------------------------------GEPRTT-----------KSVRSL--------------------IMGQV------------------------REHSRKPEE---------AYAAAEKLIP------------------------------NARRLELF-SR--T------DRAGWEVWG-----------DEAGKFG BM_Bm2284d_Brugia_malayi_671410008 PPFSSFIINDACVAEALI--------RYG---------------KKFDFILLDPPWENK-------------------------SVK-------------RKTVYPTYGDHRWMLDLCLPELLKES-GLLAIWVTNNAK------HLKFTDN----------MIE----YFGFEKIA--TWRWLKVTN-------------------------SGEPVYN------------------LNSQHKQPFENIVFASC------------------------------------------------VAARRHY----------MNIANE------------------FALISTPS-----------------------AIHSRKPPLFPVLQALGILEETAEQL-------------------------------------ELY-GR--Y------LLPRTITVG-----------FEAAKLQ GSPATT00018667001_Paramecium_tetraurelia_strain_d4-2_145532196 VKS--YINCDIRYFNLDF--------LVE--K------V-----GGFDVVLMDPPWRIKGGQQNDSSFMFTNSKF---------SLD-------------YNTMSNQE-----IMDIKIEKLS-KK-GFLFLWILNT--------QLNIAYE----------MAS----KWGYEIVD--QIIWVKLNPQ------------------------GNNVYLS------------------TGYYFMHSFEICLVGYK------------------------------------------------CPPGEHVEY--------HSKISN------------------NIIFSPV------------------------RNKSQKPIE---------MYEIIELMMP------------------------------GAKKVEIF-ARNNN------LRHGWFSIG-----------NQLGETY TGFOU_268840_Toxoplasma_gondii_FOU_672286124 NTPSLCIQANLHHFDWGI--L----------G----G-------VKFDVILVDPPWQEYFDRCAAIGAT---------------NED-------------LTPWTLEE-----MLQLPVEKIGDTP-SFCFLWCGVT--------HLEDARQ----------LLH----KWGYRRCE--DICWLKTNKKAAQRRREQNAAHVNDVLDYK----ATQLVHD------------------ETSILQRTTEHCLMGIK------------------------------------------------GTVRRSQDSHFI-----HANLDT------------------DILISEQEEE---------------------VGCTRKPEE---------LYDIIERFCL------------------------------GRRRIELF-GRDWN------RRAGWVTVG-----------CEFGLTT JP75_07920_Devosia_riboflavina_674766351 ----------MTAWPFGA--M----------P------M-----FSFDVVMADPPWSFDNWSEGGNAKNA--------------KAQ-------------YDCMPTPD-----IKRLPVGHLAAGD-CWLWLWATYP--------MLPDAIE----------VMD----AWGFRYVT--AGPWVKRGT-------------------------SGKLAMG------------------TGYVLRSCSEIFLIGKN------------------------------------------------GEPKT------------HARDVR------------------NVLEAPR------------------------REHSRKPDE---------AYAMAEKLFG------------------------------PGRRADLF-SR--E------TRPGWTSWG-----------NESTKFD AF48_RS10595_Enterobacter_aerogenes_695800969 --------------------------MT----------------GKYTLIYADPPWSYRDKAADGDRGA---------------GFK-------------YPVMNVMD-----ICRLPVWELSADD-CLLAMWWVPT--------QPVEALK----------VVE----AWGFRLMTMKGFTWHKINKH------------------------KGNSAIG------------------MGHMTRANSEDCLFAVR------------------------------------------------GKLPERMDASICQ----H-------------------------VTAPR------------------------LENSRKPDV---------IREKLVQLLG------------------------------DVPRIELF-AR--Q------SSHGFDVWG-----------NQCIAPA RMATCC62417_10014_Rhizopus_microsporus_727142779 PPRSSFIMGSMTDSSLQQLS------DYV--S--S---L-----GGADLIIIDPPWPNK-------------------------SVHRSSK---------YETQDIYD-----LFTIPMKDMINVN-SVVAVWVTNKP-------KFRKFIIHK--------LFP----AWELECKA--EWVWLKVTT-------------------------EGQCIFP------------------LDSSHKKPYEQLIIGHR------------------------------------------------QKTSD------------L--PSR------------------HVIVSVPS-----------------------LRHSRKPPL---------QDVILPYLKNKD----------------------------RPVCVEMF-AR--C------LTPGWISWG-----------NECLKFQ RMATCC62417_07548_Rhizopus_microsporus_727145762 PEWCIPIKANVMTYDWDS--------LAK--E------------VQFDVIVTDPPWQLA-------------------------THAPTRGVAIA-----YQQLPDIC-----IEEIPIPKLQ-KN-GFLFIWVINN--------KYAKAFE----------LME----KWGYTYVD--DITWVKQTV-------------------------NRRMAKG------------------HGYYLQHAKETCLVGKK------------------------------------------------GQDPPNC----------RHSVGS------------------DVIFSER------------------------RGQSQKPEE---------LYELIEELVP------------------------------NGKYLEIF-GRKNN------LRDYWVTIG----------------NE MVEG_02535_Mortierella_verticillata_NRRL_6337_672827354 PGSRYEETNNVLDMDLKR--F----------G------------TDYQVIYMDPPLLRA-------------------------GEEPG-----------PNKITMEQ-----LATLDIGSIL-PK-GFLFVWIEKE--------FLPDIVR----------LAE----RWEFRYVE--NFCWIKRNV-------------------------NNLIARE------------------PSPYFNSSKLSCLIFRKE-----------------------------------------------GDVEL------------RHQRSP------------------DCVFDFPKPVNAA------------------TLSEEKPKF---------MYELIETLLPQAVYSESNPN--------------------GDKMMELW-ARPGT------RRKGWTSIC---------------QTK OT_ostta06g01320_Ostreococcus_tauri_693499469 VPDCVHMQKNIKSMKYET--L----------G------------TDYLGVLLNPPWDIE-------------------------NSPD------------RGDVTVDD-----IEAIPLEKLT-PL-GFIFIWVEKE--------NLSKVCD----------VMH----EKNFVYVE--NLTWVHLKP-------------------------NNTIVES------------------AARYLGRSHRTLLIFRRDVRDKRFVEG--------------------------------------KKIEL------------RHQRNSDVTL--------------DIIQTTK------------------------SGRRAIPEH---------VYKSIETLLPKAYEPGT-----------------------PGKLLELW-AEPGA------KRAGWTSVA----------------DS GLOINDRAFT_123982_Rhizophagus_irregularis_DAOM_181602_552937933 IHGSRYFENDILSMDLKK--L----------G------------QDFQAVYIDPPFLLP-------------------------DEEPS-----------PEKITLQQ-----FESLKVPDIV-PK-GFLFIWVEKE--------FIPDIVQ----------IAE----KWNFRYVE--NFCWIKKHI-------------------------NNQISRA------------------PYRYFNKSKLSLLIFRKE-----------------------------------------------GDIEL------------RHQRNP------------------DCVFDFIKPRTLE------------------MLTEGKPRV---------MYDIIETMLPQAVYNEQNPN--------------------GDKLLELW-AKKGS------HRQGWTTVV----------------QI RMATCC62417_05354_Rhizopus_microsporus_727148790 IVGSRYYEVDNIVSTDLTQ-Y----------G------------TDFNAVYMDPPFLLP-------------------------GEEPV-----------AGKITIDD-----FGALNVADIV-KA-GFLFIWLEKE--------WIQRVVN----------ITA----KWGFKYVE--NFCWIKKDV-------------------------NNQIHKS------------------PYRYFNKSKLSLLIFRKE-----------------------------------------------GDIEL------------RHQRNP------------------DCVFDFIRPKLPD------------------EISEKKPPF---------MYKVIETLLPTANYHIENNPN-------------------GERLLELW-AKKGQ------KRQGWTTVV----------------ER GSPATT00005554001_Paramecium_tetraurelia_strain_d4-2_145487402 YNGKNQLSANLMKQIKEGN-H----------Q------L-----FKLRIIVLKKILGTDLSQ---------------------YVKGVQGIFI--------DNLFRKD-----LKNLDLSKKLISN-GILFIWSDKG--------LINEILE----------IME----SKGFTYIE--NLVVVQLSLEKALEELNKHMKIEQ----------TEEAVLDNLNFLQQKVQVKDLIVNCPSKVLNQSKQVLIMFRK------------------------------------------------FDEQKTQLEL-------RHQRTP------------------DVLFDFVSN----------------------GKKSEKSKEY--------IYQTIETLLP------------------------------KSQLMEIF-AQRDQ------PRKDWISVC---------------ESK T424_RS0114345_Rhizobium_undicola_653315751 NTDAPSPSDDFTN-------F-----ISG---------------RKFATIMADPPWQFM-------------------------NRTGKVAPEHKRLNR-YGTMELDA-----IKALPVATACAPT-AHLYLWVPNA--------LLPEGLE----------VMK----AWGFNYKA--NIVWHKLRKD------------------------GGSDGRG------------------VGFYFRNVTELILFGTR------------------------------------------------GKNARTLP---------PGRSQV------------------NYIGTRK------------------------REHSRKPDE---------QYPLIESCSP------------------------------GP-YLEMF-GR--G------LRKGWTTWG-----------NQADETY STYLEM_4843_Stylonychia_lemnae_678336696 GLAQVIHPKEGIVKTNFDN-Y----------A------------KNVEAILINPCWVTQKDK----------------------KGK-------------VKGVTMDE-----FQQLNFSKNLMID-GLIFVWVEKE--------IISPVIK----------YFE----SQGLIYVE--NVCWVMLDQTKKEQVEATQSIDV-----------SPAYIRD------------------DYQYIRKSHKTLLMFRRLQKKN-------------------------------------------GNPLEL-----------RHQRTC------------------DVCFDFVDT----------------------NVHNYKPNEY--------LYKLIETLLPQSIVDEEKK---------------------HLRMIELW-AQDPK------PRKGWI------------------KFI consensus/100% .................................................................................................................h..h..........s.hh.W.....................................b...........h......................................................................................................................................................................................................................................................................................................... consensus/95% ..............................................h..lhhcPPh...............................................h.........h..h.l..h.....s.hhhW.............................hh......Wshp........W.+.p..........................................................s.b.hhh.................................................................................................................................P..................h.....................................chh.sp.................................... consensus/90% ..............................................a..lhhDPPh...............................................h.........h..h.l..l.....s.lhhWh............h...............hh......Wuhp......h.W.K.p.........................................................ps.c.hlh..p...........................................................................................h.s.............................p.KP..................h....................................hcla.up.............W.shs.................. consensus/85% .........p....................................a.hlhhDPPa............................................b..h...p.....h..l.l..l.....shlhlWh.s..........h....p..........hh......WGhpb.....h.WhK.p................................s.......................pps.E.hlhu.p................................................sp.......................................phh.s............................pspKP............b..hc.h....................................l-lF.uc..........p..W.shu.................. consensus/80% .........s....................................ashlhhDPPW............................p...............h..hs.pp.....l..lsl..l...p.uhlhlWh.s..........h..sbp..........hh......WGapb.p...h.WlK.s............................p...s...................s.h.pps.E.hLhu.+................................................up...................p...................phl.s............................popKP.b.........hb.hh-.h...................................blElF.uR..........p.sW.shG.................. consensus/75% .........sh...................................FshlhhDPPWpb..........................p...............a..hs.pc.....l..lsl.pl...p.uhlhlWhss..........h.bsbp..........hhp.....WGapbhp..ph.WlK.s............................p...s...................s.h.ppspE.hLhu.+................................................Gp...................ss..................pllhs............................popKP.b.........ha.hh-ph.s.................................blElF.uR..p.......p.sW.shG.................. consensus/70% ...s..h.s-h..hp..............................pFslIhhDPPWpb..........................p...............Ysshs.p-.....l..lsl.pl...s.uhlalWhss..........h.bubp..........hhp.....WGaphhp..ph.WlK.s............................p..bs...................s.hhppspE.hLlu.+................................................Gps..................ss..................sllhs............................pSpKP.b.........ha.hlEph.s..............................ss.blElF.uR..p.......+.sW.shG..................Back to Contents
Eukaryotic versions (For a proper resolution of the relationships please refer to tree GI Domain-architecture Pfam Gene name Len Taxonomy Species name Genbank # 178; METTL3/METTL14 159466562 CCCH+CCCH+N6-MTase MT-A70 CHLREDRAFT_128290 367 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii hypothetical protein CHLREDRAFT_128290 [Chlamydomonas reinhardtii]. 300265564 CCCH+CCCH+N6-MTase MT-A70 VOLCADRAFT_59216 287 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_59216 [Volvox carteri f. nagariensis]. 15236910 CCCH+CCCH+N6-MTase MT-A70 AT4G10760 685 eukaryota>viridiplantae Arabidopsis thaliana N6-adenosine-methyltransferase MT-A70-like protein [Arabidopsis thaliana]. 302815848 CCCH+CCCH+N6-MTase MT-A70 SELMODRAFT_130187 383 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_130187 [Selaginella moellendorffii]. 168053183 CCCH+CCCH+N6-MTase MT-A70 PHYPADRAFT_40965 556 eukaryota>viridiplantae Physcomitrella patens predicted protein, partial [Physcomitrella patens]. 641538296 CCCH+CCCH+N6-MTase MT-A70 SPRG_03347 394 eukaryota>stramenopiles Saprolegnia parasitica CBS 223.65 hypothetical protein SPRG_03347 [Saprolegnia parasitica CBS 223.65]. Aque1000027926 CCCH+CCCH+N6-MTase MT-A70 Aque1000027926 510 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.228194 Lgig1000006160 CCCH+CCCH+N6-MTase MT-A70 Lgig1000006160 526 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.88.185.1 21355141 CCCH+CCCH+N6-MTase MT-A70 Dmel_CG5933 608 eukaryota>metazoa>hexapoda Drosophila melanogaster inducer of meiosis 4 [Drosophila melanogaster]. 307194509 CCCH+CCCH+N6-MTase MT-A70 EAI_16445 549 eukaryota>metazoa>hexapoda Harpegnathos saltator N6-adenosine-methyltransferase 70 kDa subunit [Harpegnathos saltator]. 307182701 CCCH+CCCH+N6-MTase MT-A70 EAG_11443 548 eukaryota>metazoa>hexapoda Camponotus floridanus N6-adenosine-methyltransferase 70 kDa subunit [Camponotus floridanus]. 189238819 CCCH+CCCH+N6-MTase MT-A70 LOC656280 540 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit [Tribolium castaneum]. 193683437 CCCH+CCCH+N6-MTase MT-A70 LOC100159080 550 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit [Acyrthosiphon pisum]. 110749760 CCCH+CCCH+N6-MTase MT-A70 LOC551911 556 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit-like [Apis mellifera]. 158290414 CCCH+CCCH+N6-MTase MT-A70 AgaP_AGAP002895 621 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP002895-PA [Anopheles gambiae str. PEST]. 156548054 CCCH+CCCH+N6-MTase Pox_A31+MT-A70 LOC100122395 527 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to n6-adenosine-methyltransferase ime4 [Nasonia vitripennis]. 72085101 CCCH+CCCH+N6-MTase MT-A70 LOC589354 242 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to (N6-adenosine)-methyltransferase, partial [Strongylocentrotus purpuratus]. 321476680 CCCH+CCCH+N6-MTase MT-A70 DAPPUDRAFT_192406 537 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_192406 [Daphnia pulex]. 321452889 CCCH+CCCH+N6-MTase MT-A70 DAPPUDRAFT_305196 260 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_305196 [Daphnia pulex]. 321452885 CCCH+CCCH+N6-MTase MT-A70 DAPPUDRAFT_66395 225 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_66395, partial [Daphnia pulex]. 156398086 CCCH+CCCH+N6-MTase MT-A70 NEMVEDRAFT_v1g33607 237 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 221126117 CCCH+CCCH+N6-MTase MT-A70 LOC100197970 335 eukaryota>metazoa>cnidaria Hydra magnipapillata PREDICTED: similar to Methyltransferase like 3, partial [Hydra magnipapillata]. 47086489 CCCH+CCCH+N6-MTase MT-A70 mettl3 584 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio N6-adenosine-methyltransferase subunit METTL3 [Danio rerio]. 597501008 CCCH+CCCH+N6-MTase MT-A70 - 584 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio RecName: Full=N6-adenosine-methyltransferase subunit METTL3; AltName: Full=N6-adenosine-methyltransferase 70 kDa subunit; Short=MT-A70. 47227445 CCCH+CCCH+N6-MTase MT-A70 GSTEN:00024364:G:001 530 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 114651893 CCCH+CCCH+N6-MTase MT-A70 METTL3 558 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: methyltransferase like 3 isoform 5 [Pan troglodytes]. 114651897 CCCH+CCCH+N6-MTase MT-A70 METTL3 505 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: methyltransferase like 3 isoform 4 [Pan troglodytes]. 67078430 CCCH+CCCH+N6-MTase MT-A70 Mettl3 580 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus N6-adenosine-methyltransferase 70 kDa subunit [Rattus norvegicus]. 21361827 CCCH+CCCH+N6-MTase MT-A70 METTL3 580 eukaryota>metazoa>chordata>vertebrata Homo sapiens N6-adenosine-methyltransferase 70 kDa subunit [Homo sapiens]. 327285111 CCCH+CCCH+N6-MTase MT-A70 mettl3 569 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: N6-adenosine-methyltransferase 70 kDa subunit [Anolis carolinensis]. 77627973 CCCH+CCCH+N6-MTase MT-A70 Mettl3 580 eukaryota>metazoa>chordata>vertebrata Mus musculus N6-adenosine-methyltransferase subunit METTL3 [Mus musculus]. 114651889 CCCH+CCCH+N6-MTase MT-A70 METTL3 580 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: n6-adenosine-methyltransferase 70 kDa subunit isoform 3 [Pan troglodytes]. 114651891 CCCH+CCCH+N6-MTase MT-A70 METTL3 592 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: methyltransferase like 3 isoform 2 [Pan troglodytes]. 198413310 CCCH+CCCH+N6-MTase MT-A70 LOC100176137 305 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: similar to methyltransferase like 3 [Ciona intestinalis]. 210086854 CCCH+CCCH+N6-MTase MT-A70 BRAFLDRAFT_288147 554 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_288147 [Branchiostoma floridae]. 219434179 CCCH+CCCH+N6-MTase MT-A70 BRAFLDRAFT_217213 568 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_217213 [Branchiostoma floridae]. Smar1000006443 CCCH+CCCH+N6-MTase MT-A70 Smar1000006443 559 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR007641-PA pep:novel scaffold:Smar1:JH431796:151221:153694:-1 gene:SMAR007641 transcript:SMAR007641-RA Hrob1000005423 CCCH+CCCH+N6-MTase MT-A70 Hrob1000005423 264 eukaryota>metazoa>annelida Helobdella robusta 68987 Hrob1000012135 CCCH+CCCH+N6-MTase MT-A70 Hrob1000012135 262 eukaryota>metazoa>annelida Helobdella robusta 121421 Caps1000024491 CCCH+CCCH+N6-MTase MT-A70 Caps1000024491 517 eukaryota>metazoa>annelida Capitella spI fgenesh1_pg.C_scaffold_975000002 576692605 CCCH+CCCH+N6-MTase MT-A70 EGR_08908 500 eukaryota>metazoa Echinococcus granulosus N6-adenosine-methyltransferase subunit [Echinococcus granulosus]. 238661046 CCCH+CCCH+N6-MTase MT-A70 Smp_146300 630 eukaryota>metazoa Schistosoma mansoni expressed protein [Schistosoma mansoni]. 340369522 CCCH+CCCH+N6-MTase MT-A70 LOC100641238 509 eukaryota>metazoa Amphimedon queenslandica PREDICTED: N6-adenosine-methyltransferase subunit METTL3-like [Amphimedon queenslandica]. 674587266 CCCH+CCCH+N6-MTase MT-A70 HmN_000529400 505 eukaryota>metazoa Hymenolepis microstoma n6 adenosine methyltransferase 70 kDa [Hymenolepis microstoma]. 674263102 CCCH+CCCH+N6-MTase MT-A70 EmuJ_000651200 517 eukaryota>metazoa Echinococcus multilocularis n6 adenosine methyltransferase 70 kDa [Echinococcus multilocularis]. 674567341 CCCH+CCCH+N6-MTase MT-A70 EgrG_000651200 393 eukaryota>metazoa Echinococcus granulosus n6 adenosine methyltransferase 70 kDa [Echinococcus granulosus]. 485635833 CCCH+CCCH+N6-MTase MT-A70 EMIHUDRAFT_232802 315 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_232802 [Emiliania huxleyi CCMP1516]. 551562135 CCCH+CCCH+N6-MTase MT-A70 EMIHUDRAFT_211665 315 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_211665 [Emiliania huxleyi CCMP1516]. Uram1000000276 CCCH+CCCH+N6-MTase MT-A70 Uram1000000276 372 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.1_#_1214_#_combest_scaffold_1_3769 Spun1000003158 CCCH+CCCH+N6-MTase MT-A70 Spun1000003158 581 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (581 aa) Wseb1000002161 CCCH+CCCH+N6-MTase MT-A70 Wseb1000002161 416 eukaryota>fungi>basidiomycota Wallemia sebi estExt_fgenesh1_kg.C_70035 242208543 CCCH+CCCH+N6-MTase MT-A70 POSPLDRAFT_25577 295 eukaryota>fungi>basidiomycota Postia placenta Mad-698-R predicted protein, partial [Postia placenta Mad-698-R]. 527302065 CCCH+CCCH+N6-MTase MT-A70 FOMPIDRAFT_1022720 566 eukaryota>fungi>basidiomycota Fomitopsis pinicola FP-58527 SS1 hypothetical protein FOMPIDRAFT_1022720 [Fomitopsis pinicola FP-58527 SS1]. 58268080 CCCH+CCCH+N6-MTase MT-A70 CNE03860 406 eukaryota>fungi>basidiomycota Cryptococcus neoformans var. neoformans JEC21 mRNA methyltransferase [Cryptococcus neoformans var. neoformans JEC21]. 299747281 CCCH+CCCH+N6-MTase MT-A70 CC1G_14583 596 eukaryota>fungi>basidiomycota Coprinopsis cinerea okayama7#130 m6a methyltransferase [Coprinopsis cinerea okayama7#130]. 71017811 CCCH+CCCH+N6-MTase MT-A70 UM02989.1 395 eukaryota>fungi>basidiomycota Ustilago maydis 521 hypothetical protein UM02989.1 [Ustilago maydis 521]. Abis1000003455 CCCH+CCCH+N6-MTase MT-A70 Abis1000003455 312 eukaryota>fungi>basidiomycota Agaricus bisporus e_gw1.4.1246.1 164649925 CCCH+CCCH+N6-MTase MT-A70 LACBIDRAFT_243879 309 eukaryota>fungi>basidiomycota Laccaria bicolor S238N-H82 predicted protein, partial [Laccaria bicolor S238N-H82]. 50545848 CCCH+CCCH+N6-MTase MT-A70 YALI0B03498g 587 eukaryota>fungi>ascomycota Yarrowia lipolytica CLIB122 YALI0B03498p [Yarrowia lipolytica CLIB122]. 50310943 CCCH+N6-MTase MT-A70 KLLA0F09097g 524 eukaryota>fungi>ascomycota Kluyveromyces lactis NRRL Y-1140 hypothetical protein [Kluyveromyces lactis NRRL Y-1140]. 6321246 CCCH+N6-MTase MT-A70 YGL192W 600 eukaryota>fungi>ascomycota Saccharomyces cerevisiae S288c Ime4p [Saccharomyces cerevisiae S288c]. 50412773 CCCH+N6-MTase MT-A70 DEHA0B04491 536 eukaryota>fungi>ascomycota Debaryomyces hansenii CBS767 hypothetical protein DEHA0B04491 [Debaryomyces hansenii CBS767]. 1174426 CCCH+CCCH+N6-MTase MT-A70 - 600 eukaryota>fungi>ascomycota Saccharomyces cerevisiae S288c RecName: Full=N6-adenosine-methyltransferase IME4. 150864816 CCCH+CCCH+N6-MTase MT-A70 PICST_57562 531 eukaryota>fungi>ascomycota Scheffersomyces stipitis CBS 6054 activator of IME1 Predicted N6-adenine RNA methylase IME4 [Scheffersomyces stipitis CBS 6054]. 50284965 CCCH+N6-MTase MT-A70 CAGL0A03300g 488 eukaryota>fungi>ascomycota Candida glabrata CBS 138 hypothetical protein [Candida glabrata CBS 138]. 45198691 CCCH+N6-MTase MT-A70 AGOS_AFR173W 559 eukaryota>fungi>ascomycota Eremothecium gossypii ATCC 10895 AFR173Wp [Eremothecium gossypii ATCC 10895]. 68466659 CCCH+CCCH+N6-MTase MT-A70 CaO19.1476 543 eukaryota>fungi>ascomycota Candida albicans SC5314 hypothetical protein CaO19.1476 [Candida albicans SC5314]. Adig1000001851 CCCH+CCCH+N6-MTase MT-A70 Adig1000001851 262 eukaryota>cnidaria Acropora digitifera adi_v1.03360 514696822 CCCH+CCCH+N6-MTase MT-A70 PTSG_03395 797 eukaryota>choanoflagellida Salpingoeca rosetta N6-adenosine-methyltransferase 70 kDa subunit [Salpingoeca rosetta]. 145534770 CCCH+CCCH+N6-MTase Methyltransf_26+MT-A70 GSPATT00019450001 493 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145529029 CCCH+CCCH+N6-MTase Methyltransf_26+MT-A70 GSPATT00017263001 539 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 586734236 CCCH+CCCH+N6-MTase MT-A70 TTHERM_00962190 741 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 N6-adenosine-methyltransferase 70 kDa subunit (macronuclear) [Tetrahymena thermophila SB210]. 672280105 CCCH+CCCH+N6-MTase MT-A70 TGFOU_217350 453 eukaryota>alveolata>apicomplexa Toxoplasma gondii FOU putative methyltransferase MTA70, partial [Toxoplasma gondii FOU]. 124512114 CCCH+CCCH+N6-MTase MT-A70 PF07_0123 760 eukaryota>alveolata>apicomplexa Plasmodium falciparum 3D7 mRNA (N6-adenosine)-methyltransferase, putative [Plasmodium falciparum 3D7]. 221485567 CCCH+CCCH+N6-MTase MT-A70 TGGT1_028850 823 eukaryota>alveolata>apicomplexa Toxoplasma gondii GT1 N6-adenosine-methyltransferase 70 kDa subunit, putative [Toxoplasma gondii GT1]. 156087837 CCCH+CCCH+N6-MTase MT-A70 BBOV_III001900 641 eukaryota>alveolata>apicomplexa Babesia bovis T2Bo MT-A70 family protein [Babesia bovis T2Bo]. 307110630 N6-MTase MT-A70 CHLNCDRAFT_50385 309 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_50385 [Chlorella variabilis]. 116060398 N6-MTase MT-A70 Ot11g01290 371 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri Predicted N6-adenine methylase involved in transcription regulation (ISS) [Ostreococcus tauri]. 308809243 N6-MTase MT-A70 Ot11g01290 371 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri Predicted N6-adenine methylase involved in transcription regulation (ISS) [Ostreococcus tauri]. 255086517 N6-MTase MT-A70 MICPUN_76124 165 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein, partial [Micromonas sp. RCC299]. 158275861 N6-MTase MT-A70 CHLREDRAFT_174824 331 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein [Chlamydomonas reinhardtii]. 300268372 N6-MTase MT-A70 VOLCADRAFT_116042 245 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_116042, partial [Volvox carteri f. nagariensis]. 159474530 N6-MTase MT-A70 CHLREDRAFT_174824 331 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein [Chlamydomonas reinhardtii]. 303284481 N6-MTase MT-A70 MICPUCDRAFT_51320 357 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 hypothetical protein MICPUCDRAFT_51320 [Micromonas pusilla CCMP1545]. Cmer1000000605 N6-MTase MT-A70 Cmer1000000605 393 eukaryota>rhodophyta Cyanidioschyzon merolae CME116C similar to (N6-adenosine)-methyltransferase 452822052 N6-MTase MT-A70 Gasu_34680 285 eukaryota>rhodophyta Galdieria sulphuraria mRNA (2'-O-methyladenosine-N6-)-methyltransferase [Galdieria sulphuraria]. 544210442 N6-MTase MT-A70 CYME_CME116C 392 eukaryota>rhodophyta Cyanidioschyzon merolae strain 10D similar to (N6-adenosine)-methyltransferase [Cyanidioschyzon merolae strain 10D]. 569355952 N6-MTase MT-A70 RFI_31139 322 eukaryota>rhizaria Reticulomyxa filosa MT-A70 family protein, partial [Reticulomyxa filosa]. 121913835 N6-MTase MT-A70 TVAG_062450 392 eukaryota>parabasalia Trichomonas vaginalis G3 MT-A70 family protein [Trichomonas vaginalis G3]. 123390303 N6-MTase MT-A70 TVAG_002370 412 eukaryota>parabasalia Trichomonas vaginalis G3 MT-A70 family protein [Trichomonas vaginalis G3]. 154414896 N6-MTase MT-A70 TVAG_136190 397 eukaryota>parabasalia Trichomonas vaginalis G3 MT-A70 family protein [Trichomonas vaginalis G3]. 121880798 N6-MTase MT-A70 TVAG_002370 412 eukaryota>parabasalia Trichomonas vaginalis G3 MT-A70 family protein [Trichomonas vaginalis G3]. 121908694 N6-MTase MT-A70 TVAG_389630 359 eukaryota>parabasalia Trichomonas vaginalis G3 MT-A70 family protein [Trichomonas vaginalis G3]. 154413191 N6-MTase MT-A70 TVAG_062450 392 eukaryota>parabasalia Trichomonas vaginalis G3 MT-A70 family protein [Trichomonas vaginalis G3]. 121914692 N6-MTase MT-A70 TVAG_136190 397 eukaryota>parabasalia Trichomonas vaginalis G3 MT-A70 family protein [Trichomonas vaginalis G3]. Aque1000012323 N6-MTase MT-A70 Aque1000012323 451 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.212591 307170446 N6-MTase MT-A70 EAG_00575 205 eukaryota>metazoa>hexapoda Camponotus floridanus Methyltransferase-like protein KIAA1627-like protein [Camponotus floridanus]. 307177286 N6-MTase MT-A70 EAG_12613 382 eukaryota>metazoa>hexapoda Camponotus floridanus Methyltransferase-like protein KIAA1627-like protein [Camponotus floridanus]. 19920926 N6-MTase MT-A70 Dmel_CG7818 397 eukaryota>metazoa>hexapoda Drosophila melanogaster CG7818 [Drosophila melanogaster]. 158297043 N6-MTase MT-A70 AgaP_AGAP008111 392 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP008111-PA [Anopheles gambiae str. PEST]. 48138147 N6-MTase MT-A70 LOC409900 390 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: methyltransferase-like protein 14 homolog [Apis mellifera]. 193577905 N6-MTase MT-A70 LOC100163326 387 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: methyltransferase-like protein 14 homolog [Acyrthosiphon pisum]. 193664699 N6-MTase MT-A70 LOC100169342 366 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: methyltransferase-like protein 14 homolog [Acyrthosiphon pisum]. 91089719 N6-MTase MT-A70 LOC663857 390 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: methyltransferase-like protein 14 homolog [Tribolium castaneum]. 156553899 N6-MTase MT-A70 LOC100117110 390 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: methyltransferase-like protein 14 homolog [Nasonia vitripennis]. 291232903 N6-MTase MT-A70 LOC100376676 456 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: methyltransferase-like protein 14-like [Saccoglossus kowalevskii]. 115935304 N6-MTase MT-A70 LOC579598 363 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to MGC79735 protein [Strongylocentrotus purpuratus]. 321457952 N6-MTase MT-A70 DAPPUDRAFT_130106 168 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_130106 [Daphnia pulex]. 221131975 N6-MTase MT-A70 LOC100197023 446 eukaryota>metazoa>cnidaria Hydra vulgaris PREDICTED: N6-adenosine-methyltransferase subunit METTL14-like [Hydra vulgaris]. 42517136 N6-MTase MT-A70 Mettl14 456 eukaryota>metazoa>chordata>vertebrata Mus musculus methyltransferase like 14 [Mus musculus]. 327274188 N6-MTase MT-A70 mettl14 456 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: N6-adenosine-methyltransferase subunit METTL14 [Anolis carolinensis]. 24308265 N6-MTase MT-A70 METTL14 456 eukaryota>metazoa>chordata>vertebrata Homo sapiens N6-adenosine-methyltransferase subunit METTL14 [Homo sapiens]. 109467560 N6-MTase MT-A70 RGD1304822_predicted 456 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to CG7818-PA [Rattus norvegicus]. 71896697 N6-MTase MT-A70 METTL14 459 eukaryota>metazoa>chordata>vertebrata Gallus gallus N6-adenosine-methyltransferase subunit METTL14 [Gallus gallus]. 224049178 N6-MTase MT-A70 METTL14 459 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: methyltransferase-like protein 14 [Taeniopygia guttata]. 46309507 N6-MTase MT-A70 mettl14 455 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio N6-adenosine-methyltransferase subunit METTL14 [Danio rerio]. 326918994 N6-MTase MT-A70 LOC100540744 490 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: methyltransferase-like protein 14-like [Meleagris gallopavo]. 47207634 N6-MTase MT-A70 GSTEN:00009248:G:001 241 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 114595809 N6-MTase MT-A70 METTL14 456 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: N6-adenosine-methyltransferase subunit METTL14 [Pan troglodytes]. 219440655 N6-MTase MT-A70 BRAFLDRAFT_58874 164 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_58874 [Branchiostoma floridae]. 198424026 N6-MTase MT-A70 LOC100176240 474 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: methyltransferase-like protein 14 homolog [Ciona intestinalis]. Smar1000012137 N6-MTase MT-A70 Smar1000012137 431 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR002698-PA pep:novel scaffold:Smar1:JH431216:18104:19856:1 gene:SMAR002698 transcript:SMAR002698-RA Caps1000008038 N6-MTase MT-A70 Caps1000008038 338 eukaryota>metazoa>annelida Capitella spI fgenesh1_pg.C_scaffold_355000005 Hrob1000012559 N6-MTase MT-A70 Hrob1000012559 369 eukaryota>metazoa>annelida Helobdella robusta 79167 576693404 N6-MTase MT-A70 EGR_08094 418 eukaryota>metazoa Echinococcus granulosus N6-adenosine-methyltransferase subunit [Echinococcus granulosus]. 340382361 N6-MTase MT-A70 LOC100635916 450 eukaryota>metazoa Amphimedon queenslandica PREDICTED: methyltransferase-like protein 14 homolog [Amphimedon queenslandica]. Sarc1000000133 N6-MTase MT-A70 Sarc1000000133 361 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (361 aa) 290979461 N6-MTase MT-A70 NAEGRDRAFT_72415 451 eukaryota>heterolobosea Naegleria gruberi strain NEG-M predicted protein [Naegleria gruberi strain NEG-M]. 284086029 N6-MTase MT-A70 NAEGRDRAFT_72415 451 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 551613601 N6-MTase MT-A70 EMIHUDRAFT_449178 245 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_449178 [Emiliania huxleyi CCMP1516]. Mver1000003113 N6-MTase MT-A70 Mver1000003113 309 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 mRNA (2'-O-methyladenosine-N6-)-methyltransferase (309 aa) Pisp1000005866 N6-MTase MT-A70 Pisp1000005866 165 eukaryota>fungi>neocallimastigomycota Piromyces sp estExt_Genewise1Plus.C_1020015 Uram1000008906 N6-MTase MT-A70 Uram1000008906 267 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.75_#_199_#_combest_scaffold_75_134233 Bcir1000003454 N6-MTase MT-A70 Bcir1000003454 264 eukaryota>fungi>mucoromycotina Backusella circina estExt_fgenesh1_pg.C_270053 Lhya1000001414 N6-MTase MT-A70 Lhya1000001414 272 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora estExt_Genewise1.C_100084 Crev1000002253 N6-MTase Mnd1+MT-A70 Crev1000002253 412 eukaryota>fungi>kickxellomycotina Coemansia reversa fgenesh1_kg.9_#_19_#_isotig04348 595497236 N6-MTase MT-A70 RirG_018940 470 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 197198w Kar4p [Rhizophagus irregularis DAOM 197198w]. 595481916 N6-MTase MT-A70 RirG_092820 303 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 197198w Ime4p [Rhizophagus irregularis DAOM 197198w]. Ccor1000007140 N6-MTase MT-A70 Ccor1000007140 165 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus CE21212_12273 Spun1000000693 N6-MTase MT-A70 Spun1000000693 357 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (357 aa) Spun1000006218 N6-MTase MT-A70 Spun1000006218 358 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (358 aa) 164645185 N6-MTase MT-A70 LACBIDRAFT_319029 563 eukaryota>fungi>basidiomycota Laccaria bicolor S238N-H82 predicted protein [Laccaria bicolor S238N-H82]. Wseb1000004329 N6-MTase MT-A70 Wseb1000004329 501 eukaryota>fungi>basidiomycota Wallemia sebi estExt_fgenesh1_kg.C_210042 58271296 N6-MTase MT-A70 CNI00700 586 eukaryota>fungi>basidiomycota Cryptococcus neoformans var. neoformans JEC21 transcription regulator [Cryptococcus neoformans var. neoformans JEC21]. 527293720 N6-MTase MT-A70 FOMPIDRAFT_1134066 613 eukaryota>fungi>basidiomycota Fomitopsis pinicola FP-58527 SS1 hypothetical protein FOMPIDRAFT_1134066 [Fomitopsis pinicola FP-58527 SS1]. 299741172 N6-MTase MT-A70 CC1G_11190 708 eukaryota>fungi>basidiomycota Coprinopsis cinerea okayama7#130 transcription regulator [Coprinopsis cinerea okayama7#130]. 220730232 N6-MTase MT-A70 POSPLDRAFT_106379 611 eukaryota>fungi>basidiomycota Postia placenta Mad-698-R predicted protein [Postia placenta Mad-698-R]. 116504623 N6-MTase MT-A70 CC1G_11190 680 eukaryota>fungi>basidiomycota Coprinopsis cinerea okayama7#130 hypothetical protein CC1G_11190 [Coprinopsis cinerea okayama7#130]. Abis1000008599 N6-MTase MT-A70 Abis1000008599 649 eukaryota>fungi>basidiomycota Agaricus bisporus Genemark.8250_g Pbla1000007785 N6-MTase MT-A70 Pbla1000007785 331 eukaryota>fungi>basal Phycomyces blakesleeanus fgeneshPB_pg.16__43 Mcir1000003212 N6-MTase MT-A70 Mcir1000003212 195 eukaryota>fungi>basal Mucor circinelloides e_gw1.02.942.1 50548917 N6-MTase MT-A70 YALI0C17017g 311 eukaryota>fungi>ascomycota Yarrowia lipolytica CLIB122 YALI0C17017p [Yarrowia lipolytica CLIB122]. 150864497 N6-MTase MT-A70 PICST_35413 385 eukaryota>fungi>ascomycota Scheffersomyces stipitis CBS 6054 hypothetical protein PICST_35413 [Scheffersomyces stipitis CBS 6054]. 46434912 N6-MTase MT-A70 CaO19.3736 369 eukaryota>fungi>ascomycota Candida albicans SC5314 hypothetical protein CaO19.3736 [Candida albicans SC5314]. 6319795 N6-MTase MT-A70 YCL055W 335 eukaryota>fungi>ascomycota Saccharomyces cerevisiae S288c Kar4p [Saccharomyces cerevisiae S288c]. 50285123 N6-MTase MT-A70 CAGL0B00462g 324 eukaryota>fungi>ascomycota Candida glabrata CBS 138 hypothetical protein [Candida glabrata CBS 138]. 68485255 N6-MTase MT-A70 CaO19.11221 369 eukaryota>fungi>ascomycota Candida albicans SC5314 hypothetical protein CaO19.11221 [Candida albicans SC5314]. 50423415 N6-MTase MT-A70 DEHA0E24156g 402 eukaryota>fungi>ascomycota Debaryomyces hansenii CBS767 hypothetical protein DEHA0E24156g [Debaryomyces hansenii CBS767]. 50304541 N6-MTase MT-A70 KLLA0C00693g 323 eukaryota>fungi>ascomycota Kluyveromyces lactis NRRL Y-1140 hypothetical protein [Kluyveromyces lactis NRRL Y-1140]. 45199255 N6-MTase MT-A70 AGOS_AFR736C 322 eukaryota>fungi>ascomycota Eremothecium gossypii ATCC 10895 AFR736Cp [Eremothecium gossypii ATCC 10895]. 384484780 N6-MTase MT-A70 RO3G_01664 250 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_01664 [Rhizopus delemar RA 99-880]. 727145762 N6-MTase MT-A70 RMATCC62417_07548 263 eukaryota>fungi Rhizopus microsporus Putative mRNA (2'-O-methyladenosine-N6-)-methyltransferase [Rhizopus microsporus]. Adig1000009633 N6-MTase MT-A70+MT-A70 Adig1000009633 214 eukaryota>cnidaria Acropora digitifera adi_v1.06363 514693100 N6-MTase MT-A70 PTSG_04805 593 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_04805 [Salpingoeca rosetta]. Ttra1000005250 N6-MTase MT-A70 Ttra1000005250 304 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 hypothetical protein (304 aa) 470407427 N6-MTase MT-A70 ACA1_366350 289 eukaryota>amoebozoa>acanthamoebidae Acanthamoeba castellanii str. Neff MTA70 family protein [Acanthamoeba castellanii str. Neff]. 403359546 N6-MTase MT-A70 OXYTRI_23292 520 eukaryota>alveolata>ciliophora Oxytricha trifallax MT-A70 family protein (macronuclear) [Oxytricha trifallax]. 145544559 N6-MTase MT-A70 GSPATT00003428001 317 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 586728217 N6-MTase MT-A70 TTHERM_00704040 428 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 MT-a70 family protein (macronuclear) [Tetrahymena thermophila SB210]. 145474019 N6-MTase MT-A70 GSPATT00000069001 251 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145525104 N6-MTase MT-A70 GSPATT00015836001 364 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 146142755 N6-MTase MT-A70 TTHERM_00704040 372 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 MT-A70 family protein (macronuclear) [Tetrahymena thermophila SB210]. 145499669 N6-MTase MT-A70 GSPATT00037207001 319 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 403376498 N6-MTase MT-A70 OXYTRI_18855 698 eukaryota>alveolata>ciliophora Oxytricha trifallax MT-A70 family protein (macronuclear) [Oxytricha trifallax]. 118378397 N6-MTase MT-A70 TTHERM_00558100 392 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 MT-a70 family protein [Tetrahymena thermophila SB210]. 145492063 N6-MTase MT-A70 GSPATT00034109001 319 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145476411 N6-MTase MT-A70 GSPATT00027862001 364 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 221488029 N6-MTase MT-A70 TGGT1_107030 525 eukaryota>alveolata>apicomplexa Toxoplasma gondii GT1 N6-adenosine-methyltransferase subunit, putative [Toxoplasma gondii GT1]. 156086210 N6-MTase MT-A70 BBOV_IV005850 448 eukaryota>alveolata>apicomplexa Babesia bovis T2Bo hypothetical protein [Babesia bovis T2Bo]. 672286124 N6-MTase MT-A70 TGFOU_268840 525 eukaryota>alveolata>apicomplexa Toxoplasma gondii FOU putative N6-adenosine-methyltransferase [Toxoplasma gondii FOU]. 514485079 N6-MTase MT-A70 CAOG_04822 317 eukaryota Capsaspora owczarzaki ATCC 30864 MT-A70 family protein [Capsaspora owczarzaki ATCC 30864]. 320169965 N6-MTase MT-A70 CAOG_04822 424 eukaryota Capsaspora owczarzaki ATCC 30864 MT-A70 family protein [Capsaspora owczarzaki ATCC 30864]. # 45; METTL4 224046124 alpha-helical+N6-MTase MT-A70 METTL4 481 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: methyltransferase-like protein 4 [Taeniopygia guttata]. 327269895 alpha-helical+N6-MTase MT-A70 mettl4 479 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: methyltransferase-like protein 4 [Anolis carolinensis]. 118086830 alpha-helical+N6-MTase MT-A70 METTL4 475 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: similar to Methyltransferase like 4 [Gallus gallus]. 326917444 alpha-helical+N6-MTase MT-A70 LOC100547828 474 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: methyltransferase-like protein 4-like [Meleagris gallopavo]. 145275206 alpha-helical+N6-MTase MT-A70 METTL4 472 eukaryota>metazoa>chordata>vertebrata Homo sapiens methyltransferase-like protein 4 isoform 1 [Homo sapiens]. 55647269 alpha-helical+N6-MTase MT-A70 METTL4 472 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: methyltransferase-like protein 4 isoform X1 [Pan troglodytes]. 109487662 Nalpha-helical+N6-MTase MT-A70 RGD1306451_predicted 471 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to methyltransferase like 4 [Rattus norvegicus]. 74315949 alpha-helical+N6-MTase MT-A70 Mettl4 471 eukaryota>metazoa>chordata>vertebrata Mus musculus methyltransferase-like protein 4 [Mus musculus]. 189522093 alpha-helical+N6-MTase MT-A70 mettl4 450 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: methyltransferase-like protein 4 isoform X1 [Danio rerio]. 47217445 alpha-helical+N6-MTase MT-A70 GSTEN:00031779:G:001 308 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 340370562 alpha-helical+N6-MTase MT-A70+MT-A70 LOC100633541 403 eukaryota>metazoa Amphimedon queenslandica PREDICTED: methyltransferase-like protein 4 [Amphimedon queenslandica]. 190582266 alpha-helical+N6-MTase MT-A70 TRIADDRAFT_58838 331 eukaryota>metazoa>placozoa Trichoplax adhaerens hypothetical protein TRIADDRAFT_58838 [Trichoplax adhaerens]. 162690420 N6-MTase MT-A70 PHYPADRAFT_206270 428 eukaryota>viridiplantae Physcomitrella patens predicted protein [Physcomitrella patens]. 168011388 N6-MTase MT-A70 PHYPADRAFT_206270 428 eukaryota>viridiplantae Physcomitrella patens predicted protein [Physcomitrella patens]. Mver1000006359 N6-MTase Methyltransf_26+MT-A70 Mver1000006359 466 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (466 aa) Spun1000004502 N6-MTase MT-A70 Spun1000004502 418 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (418 aa) 18394726 N6-MTase MT-A70 AT1G19340 414 eukaryota>viridiplantae Arabidopsis thaliana methyltransferase-like protein 2 [Arabidopsis thaliana]. Lhya1000004692 N6-MTase MT-A70 Lhya1000004692 398 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora estExt_Genemark1.C_470043 198422905 N6-MTase MT-A70 LOC100186432 385 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: methyltransferase-like protein 4 [Ciona intestinalis]. 307172265 N6-MTase MT-A70 EAG_10107 385 eukaryota>metazoa>hexapoda Camponotus floridanus Methyltransferase-like protein 4 [Camponotus floridanus]. 727142779 N6-MTase MT-A70 RMATCC62417_10014 371 eukaryota>fungi Rhizopus microsporus hypothetical protein RMATCC62417_10014 [Rhizopus microsporus]. Bcir1000011661 N6-MTase Methyltransf_26+MT-A70 Bcir1000011661 371 eukaryota>fungi>mucoromycotina Backusella circina fgenesh1_pg.4_#_30 161078350 N6-MTase MT-A70 Dmel_CG14906 359 eukaryota>metazoa>hexapoda Drosophila melanogaster CG14906 [Drosophila melanogaster]. 24647514 N6-MTase MT-A70 Dmel_CG14906 359 eukaryota>metazoa>hexapoda Drosophila melanogaster CG14906 [Drosophila melanogaster]. 322796786 N6-MTase MT-A70 SINV_06005 356 eukaryota>metazoa>hexapoda Solenopsis invicta hypothetical protein SINV_06005, partial [Solenopsis invicta]. 221118051 N6-MTase MT-A70 LOC100203523 355 eukaryota>metazoa>cnidaria Hydra magnipapillata PREDICTED: similar to Methyltransferase-like protein 4 [Hydra magnipapillata]. 46123695 N6-MTase MT-A70 FG06225.1 333 eukaryota>fungi>ascomycota Fusarium graminearum PH-1 hypothetical protein FG06225.1 [Fusarium graminearum PH-1]. Ccor1000001300 N6-MTase Methyltransf_26+MT-A70 Ccor1000001300 308 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus gm1.1428_g 384493544 N6-MTase MT-A70 RO3G_08740 305 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_08740 [Rhizopus delemar RA 99-880]. 189238810 N6-MTase MT-A70 LOC655139 288 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: similar to Methyltransferase-like protein 4, partial [Tribolium castaneum]. 307197295 N6-MTase MT-A70 EAI_06255 287 eukaryota>metazoa>hexapoda Harpegnathos saltator Methyltransferase-like protein 4 [Harpegnathos saltator]. 210094981 N6-MTase MT-A70 BRAFLDRAFT_237642 244 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_237642, partial [Branchiostoma floridae]. Hrob1000005904 N6-MTase MT-A70 Hrob1000005904 243 eukaryota>metazoa>annelida Helobdella robusta 69400 Caps1000003296 N6-MTase MT-A70 Caps1000003296 236 eukaryota>metazoa>annelida Capitella spI e_gw1.295.23.1 156223537 N6-MTase MT-A70 NEMVEDRAFT_v1g95490 230 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 156393637 N6-MTase MT-A70 NEMVEDRAFT_v1g95490 230 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 158293422 N6-MTase MT-A70 AgaP_AGAP008665 225 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP008665-PA, partial [Anopheles gambiae str. PEST]. 321477348 N6-MTase MT-A70 DAPPUDRAFT_25900 216 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_25900, partial [Daphnia pulex]. Lgig1000004573 N6-MTase MT-A70 Lgig1000004573 215 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.35.28.1 302799717 N6-MTase MT-A70 SELMODRAFT_114834 207 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_114834, partial [Selaginella moellendorffii]. Uram1000004450 N6-MTase MT-A70 Uram1000004450 208 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.22_#_168_#_combest_scaffold_22_46365 545366117 N6-MTase MT-A70 COCSUDRAFT_36424 197 eukaryota>viridiplantae>chlorophyta Coccomyxa subellipsoidea C-169 MT-A70 [Coccomyxa subellipsoidea C-169]. 470510758 N6-MTase MT-A70 ACA1_149840 196 eukaryota>amoebozoa>acanthamoebidae Acanthamoeba castellanii str. Neff MTA70 family [Acanthamoeba castellanii str. Neff]. Mcir1000000489 N6-MTase MT-A70 Mcir1000000489 195 eukaryota>fungi>basal Mucor circinelloides Mucci1.e_gw1.1.979.1 302759497 N6-MTase MT-A70 SELMODRAFT_80323 190 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_80323, partial [Selaginella moellendorffii]. Pbla1000000301 N6-MTase MT-A70 Pbla1000000301 185 eukaryota>fungi>basal Phycomyces blakesleeanus gw1.70.18.1 307104593 N6-MTase MT-A70 CHLNCDRAFT_36694 164 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_36694 [Chlorella variabilis]. # 11; 403348598 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ+ZZ+ZZ OXYTRI_05013 822 eukaryota>alveolata>ciliophora Oxytricha trifallax MT-A70 family protein (macronuclear) [Oxytricha trifallax]. 145473723 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ+ZZ GSPATT00027481001 712 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145486788 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ+ZZ GSPATT00032234001 711 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145532196 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ+ZZ GSPATT00018667001 707 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 544211235 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ+ZZ CYME_CMH026C 626 eukaryota>rhodophyta Cyanidioschyzon merolae strain 10D similar to (N6-adenosine)-methyltransferase [Cyanidioschyzon merolae strain 10D]. Cmer1000000996 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ+ZZ Cmer1000000996 627 eukaryota>rhodophyta Cyanidioschyzon merolae CMH026C similar to (N6-adenosine)-methyltransferase 545702233 N6-MTase+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ+ZZ Gasu_54970 601 eukaryota>rhodophyta Galdieria sulphuraria mRNA (2'-O-methyladenosine-N6-)-methyltransferase [Galdieria sulphuraria]. 284095325 N6-MTase+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ NAEGRDRAFT_30463 473 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 290998263 N6-MTase+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ NAEGRDRAFT_30463 473 eukaryota>heterolobosea Naegleria gruberi strain NEG-M predicted protein [Naegleria gruberi strain NEG-M]. 145512105 N6-MTase MT-A70 GSPATT00039029001 454 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein, partial (macronuclear) [Paramecium tetraurelia strain d4-2]. 145493475 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ GSPATT00034815001 424 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 146144702 N6-MTase+ZZ+ZZ MT-A70 TTHERM_00136470 540 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 MT-A70 family protein (macronuclear) [Tetrahymena thermophila SB210]. 146175568 N6-MTase+ZZ+ZZ MT-A70 TTHERM_00136470 540 eukaryota>alveolata>ciliophora Tetrahymena thermophila MT-A70 family protein (macronuclear) [Tetrahymena thermophila]. Ttra1000006356 N6-MTase+ZZ+ZZ MT-A70+ZZ+ZZ Ttra1000006356 1237 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 hypothetical protein (1237 aa) 470518935 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ ACA1_074420 1067 eukaryota>amoebozoa>acanthamoebidae Acanthamoeba castellanii str. Neff Putative N6adenosine-methyltransferase [Acanthamoeba castellanii str. Neff]. 89301120 N6-MTase+ZZ+ZZ+ZZ+ZZ MT-A70+ZZ+ZZ TTHERM_00388490 2070 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 MT-A70 family protein (macronuclear) [Tetrahymena thermophila SB210]. # 5; 238665683 N6-MTase+BIR MT-A70+BIR Smp_172190.1 779 eukaryota>metazoa Schistosoma mansoni expressed protein [Schistosoma mansoni]. 238665684 N6-MTase MT-A70 Smp_172190.2 629 eukaryota>metazoa Schistosoma mansoni expressed protein [Schistosoma mansoni]. 674260189 N6-MTase MT-A70 EmuJ_000931900 618 eukaryota>metazoa Echinococcus multilocularis methyltransferase protein 14 [Echinococcus multilocularis]. 674562007 N6-MTase MT-A70 EgrG_000931900 618 eukaryota>metazoa Echinococcus granulosus methyltransferase protein 14 [Echinococcus granulosus]. 674588329 N6-MTase MT-A70 HmN_000449900 612 eukaryota>metazoa Hymenolepis microstoma methyltransferase protein 14 [Hymenolepis microstoma]. # 4; 693499469 N6-MTase MT-A70 OT_ostta06g01320 439 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri MT-A70-like [Ostreococcus tauri]. 255076305 N6-MTase - MICPUN_108123 323 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 116058220 N6-MTase MT-A70 Ot06g01500 270 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri methyltransferase MT-A70, putative (ISS) [Ostreococcus tauri]. 303279795 N6-MTase - MICPUCDRAFT_58792 264 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. # 4; 321445585 N6-MTase MT-A70 DAPPUDRAFT_70851 101 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_70851, partial [Daphnia pulex]. 115803226 N6-MTase MT-A70 LOC594744 96 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to Methyltransferase like 3, partial [Strongylocentrotus purpuratus]. Hrob1000012694 N6-MTase MT-A70 Hrob1000012694 75 eukaryota>metazoa>annelida Helobdella robusta 153237 321445586 N6-MTase MT-A70 DAPPUDRAFT_70850 61 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_70850, partial [Daphnia pulex]. # 13; Eukaryotic subclade-6-related 672827354 N6-MTase MT-A70 MVEG_02535 470 eukaryota>fungi Mortierella verticillata NRRL 6337 hypothetical protein MVEG_02535 [Mortierella verticillata NRRL 6337]. Mver1000002542 N6-MTase MT-A70 Mver1000002542 471 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (471 aa) Uram1000004716 N6-MTase MT-A70 Uram1000004716 457 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.24_#_143_#_combest_scaffold_24_50604 Pbla1000012755 N6-MTase MT-A70 Pbla1000012755 454 eukaryota>fungi>basal Phycomyces blakesleeanus estExt_fgeneshPB_pg.C_40189 Spun1000005457 N6-MTase MT-A70 Spun1000005457 454 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (454 aa) Lhya1000007007 N6-MTase MT-A70 Lhya1000007007 441 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora estExt_Genewise1Plus.C_880061 727148790 N6-MTase GAD+MT-A70 RMATCC62417_05354 435 eukaryota>fungi Rhizopus microsporus hypothetical protein RMATCC62417_05354 [Rhizopus microsporus]. Mcir1000010337 N6-MTase MT-A70 Mcir1000010337 436 eukaryota>fungi>basal Mucor circinelloides fgenesh1_kg.09_#_17_#_987_1_CCIA_CCIB_EXTA Bcir1000006779 N6-MTase MT-A70 Bcir1000006779 435 eukaryota>fungi>mucoromycotina Backusella circina estExt_Genewise1.C_720022 384502026 N6-MTase MT-A70 RO3G_17115 426 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_17115 [Rhizopus delemar RA 99-880]. 552937933 N6-MTase MT-A70 GLOINDRAFT_123982 426 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 181602 hypothetical protein GLOINDRAFT_123982 [Rhizophagus irregularis DAOM 181602]. Crev1000000518 N6-MTase MT-A70 Crev1000000518 423 eukaryota>fungi>kickxellomycotina Coemansia reversa e_gw1.2.97.1 470419934 N6-MTase MT-A70 ACA1_219460 387 eukaryota>amoebozoa>acanthamoebidae Acanthamoeba castellanii str. Neff MT-A70 protein [Acanthamoeba castellanii str. Neff]. Uram1000001485 N6-MTase MT-A70 Uram1000001485 829 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.5_#_767_#_combest_scaffold_5_103034 Mver1000001135 N6-MTase MT-A70 Mver1000001135 786 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (786 aa) Pbla1000005949 MYB+N6-MTase MT-A70 Pbla1000005949 761 eukaryota>fungi>basal Phycomyces blakesleeanus fgeneshPB_pg.6__310 Lhya1000007505 N6-MTase - Lhya1000007505 198 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora e_gw1.100.12.1 Pisp1000009414 N6-MTase MT-A70 Pisp1000009414 197 eukaryota>fungi>neocallimastigomycota Piromyces sp gm1.11659_g 384494112 N6-MTase - RO3G_09313 164 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_09313 [Rhizopus delemar RA 99-880]. Bcir1000007777 N6-MTase - Bcir1000007777 155 eukaryota>fungi>mucoromycotina Backusella circina fgenesh1_pm.3_#_58 # 2; 326428930 N6-MTase MT-A70+MT-A70 PTSG_05864 555 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_05864 [Salpingoeca rosetta]. 514690366 N6-MTase MT-A70+MT-A70 PTSG_05864 555 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_05864 [Salpingoeca rosetta]. # 2; 67901006 N6-MTase MT-A70 AN7490.2 546 eukaryota>fungi>ascomycota Aspergillus nidulans FGSC A4 hypothetical protein AN7490.2 [Aspergillus nidulans FGSC A4]. 70989679 N6-MTase MT-A70 AFUA_2G05600 455 eukaryota>fungi>ascomycota Aspergillus fumigatus Af293 MT-A70 family [Aspergillus fumigatus Af293]. # 2; 320165059 N6-MTase MT-A70 CAOG_07090 511 eukaryota Capsaspora owczarzaki ATCC 30864 hypothetical protein CAOG_07090 [Capsaspora owczarzaki ATCC 30864]. 470293128 N6-MTase MT-A70 CAOG_07090 511 eukaryota Capsaspora owczarzaki ATCC 30864 hypothetical protein CAOG_07090 [Capsaspora owczarzaki ATCC 30864]. # 2; Chet1000009456 N6-MTase MT-A70 Chet1000009456 494 eukaryota>fungi>ascomycota Cochliobolus heterostrophus estExt_fgenesh1_pg.C_390006 111064233 N6-MTase MT-A70 SNOG_06702 472 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_06702 [Phaeosphaeria nodorum SN15]. # 2; 678336696 N6-MTase - STYLEM_4843 437 eukaryota>alveolata>ciliophora Stylonychia lemnae methyltransferase mt- [Stylonychia lemnae]. 403331225 N6-MTase MT-A70 OXYTRI_15421 384 eukaryota>alveolata>ciliophora Oxytricha trifallax methyltransferase MT-A70, putative (ISS) (macronuclear) [Oxytricha trifallax]. # 2; 671410008 N6-MTase MT-A70 BM_Bm2284d 398 eukaryota>metazoa>nematoda Brugia malayi Protein Bm2284, isoform d [Brugia malayi]. 170590806 N6-MTase MT-A70 Bm1_43505 338 eukaryota>metazoa>nematoda Brugia malayi MT-A70 family protein [Brugia malayi]. # 2; 123486533 N6-MTase - TVAG_138980 366 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. 123473707 N6-MTase - TVAG_312160 365 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. # 2; Adig1000003645 N6-MTase MT-A70+MT-A70 Adig1000003645 320 eukaryota>cnidaria Acropora digitifera adi_v1.21218 Adig1000021966 N6-MTase MT-A70+MT-A70 Adig1000021966 320 eukaryota>cnidaria Acropora digitifera adi_v1.16957 # 2; 485631354 N6-MTase MT-A70 EMIHUDRAFT_205550 311 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_205550 [Emiliania huxleyi CCMP1516]. 551588467 N6-MTase MT-A70 EMIHUDRAFT_205550 311 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_205550 [Emiliania huxleyi CCMP1516]. # 2; 145487402 N6-MTase - GSPATT00005554001 308 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145546436 N6-MTase - GSPATT00024232001 302 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. # 2; 115915952 N6-MTase MT-A70 LOC579669 237 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus]. 115974039 N6-MTase MT-A70 LOC579669 196 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: hypothetical protein, partial [Strongylocentrotus purpuratus]. # 2; 156201156 N6-MTase MT-A70 NEMVEDRAFT_v1g224635 197 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 156328704 N6-MTase MT-A70 NEMVEDRAFT_v1g224635 197 eukaryota>metazoa>cnidaria Nematostella vectensis hypothetical protein NEMVEDRAFT_v1g224635, partial [Nematostella vectensis]. # 2; 674560581 N6-MTase MT-A70 EgrG_000328700 69 eukaryota>metazoa Echinococcus granulosus n6 adenosine methyltransferase ime4 [Echinococcus granulosus]. 674580335 N6-MTase MT-A70 EmuJ_000328700 69 eukaryota>metazoa Echinococcus multilocularis n6 adenosine methyltransferase ime4 [Echinococcus multilocularis]. # 1; 299472858 N6-MTase+SWIB SWIB Esi_0052_0135 513 eukaryota>stramenopiles Ectocarpus siliculosus EsV-1-129 [Ectocarpus siliculosus]. # 1; 323453071 N6-MTase+ANK RCC1_2+Ank_2+Ank_2+MT-A70 AURANDRAFT_63473 2507 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_63473 [Aureococcus anophagefferens]. 300257334 N6-MTase MT-A70 VOLCADRAFT_98443 1121 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_98443 [Volvox carteri f. nagariensis]. 145340055 N6-MTase MT-A70 AT4G09980 963 eukaryota>viridiplantae Arabidopsis thaliana methyltransferase-like protein 1 [Arabidopsis thaliana]. Pram1000002112 N6-MTase Glyco_hydro_1+MT-A70 Pram1000002112 930 eukaryota>stramenopiles Phytophthora ramorum 82533 307215508 N6-MTase MT-A70 EAI_03217 902 eukaryota>metazoa>hexapoda Harpegnathos saltator Methyltransferase-like protein KIAA1627-like protein [Harpegnathos saltator]. 124806530 N6-MTase MT-A70 PFL1715w 646 eukaryota>alveolata>apicomplexa Plasmodium falciparum 3D7 mRNA methyltransferase, putative [Plasmodium falciparum 3D7]. Smar1000010256 N6-MTase+DUF572 DUF572+MT-A70 Smar1000010256 640 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR004352-PA pep:novel scaffold:Smar1:JH431477:154003:157599:1 gene:SMAR004352 transcript:SMAR004352-RA 71013063 N6-MTase MT-A70 UM02405.1 630 eukaryota>fungi>basidiomycota Ustilago maydis 521 hypothetical protein UM02405.1 [Ustilago maydis 521]. Cmer1000000469 N6-MTase - Cmer1000000469 628 eukaryota>rhodophyta Cyanidioschyzon merolae CMD131C hypothetical protein 403366155 N6-MTase - OXYTRI_19511 588 eukaryota>alveolata>ciliophora Oxytricha trifallax methyltransferase MT-A70, putative (ISS) (macronuclear) [Oxytricha trifallax]. Ttra1000008696 N6-MTase MT-A70 Ttra1000008696 556 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 hypothetical protein (556 aa) 307108462 N6-MTase - CHLNCDRAFT_144074 531 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_144074 [Chlorella variabilis]. 156207497 N6-MTase MT-A70 NEMVEDRAFT_v1g221917 497 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 89306372 N6-MTase - TTHERM_00301770 475 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 hypothetical protein TTHERM_00301770 (macronuclear) [Tetrahymena thermophila SB210]. Ccor1000008813 N6-MTase MT-A70 Ccor1000008813 469 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus estExt_fgenesh1_pg.C_3190005 545711291 N6-MTase MT-A70 Gasu_13930 449 eukaryota>rhodophyta Galdieria sulphuraria methyltransferase [Galdieria sulphuraria]. 470390089 N6-MTase MT-A70 ACA1_156200 440 eukaryota>amoebozoa>acanthamoebidae Acanthamoeba castellanii str. Neff MT-A70 protein [Acanthamoeba castellanii str. Neff]. 159480906 N6-MTase MT-A70 CHLREDRAFT_168079 430 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein, partial [Chlamydomonas reinhardtii]. 19113968 N6-MTase MT-A70 SPAC22G7.07c 413 eukaryota>fungi>ascomycota Schizosaccharomyces pombe 972h- mRNA (N6-adenosine)-methyltransferase (predicted) [Schizosaccharomyces pombe 972h-]. 570995458 N6-MTase MT-A70 F442_02656 382 eukaryota>stramenopiles Phytophthora parasitica P10297 hypothetical protein F442_02656 [Phytophthora parasitica P10297]. 17531953 N6-MTase MT-A70 CELE_C18A3.1 365 eukaryota>metazoa>nematoda Caenorhabditis elegans C18A3.1 [Caenorhabditis elegans]. 118354144 N6-MTase - TTHERM_01005150 342 eukaryota>alveolata>ciliophora Tetrahymena thermophila hypothetical protein TTHERM_01005150 (macronuclear) [Tetrahymena thermophila]. 313234377 N6-MTase MT-A70 GSOID_T00012466001 333 eukaryota>metazoa>chordata Oikopleura dioica unnamed protein product [Oikopleura dioica]. 85108254 N6-MTase MT-A70 NCU08328 322 eukaryota>fungi>ascomycota Neurospora crassa OR74A hypothetical protein NCU08328 [Neurospora crassa OR74A]. 641530415 N6-MTase MT-A70 SPRG_10355 286 eukaryota>stramenopiles Saprolegnia parasitica CBS 223.65 hypothetical protein SPRG_10355 [Saprolegnia parasitica CBS 223.65]. Aque1000011545 N6-MTase MT-A70 Aque1000011545 282 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.211813 307107027 N6-MTase - CHLNCDRAFT_134188 278 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_134188 [Chlorella variabilis]. 145512411 N6-MTase - GSPATT00010893001 275 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 569429466 N6-MTase MT-A70 RFI_04550 237 eukaryota>rhizaria Reticulomyxa filosa hypothetical protein RFI_04550, partial [Reticulomyxa filosa]. Psoj1000016503 N6-MTase MT-A70 Psoj1000016503 230 eukaryota>stramenopiles Phytophthora sojae 144700 Bden1000006054 N6-MTase MT-A70 Bden1000006054 183 eukaryota>fungi>chytridiomycota Batrachochytrium dendrobatidis Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (183 aa) Bnat1000020351 N6-MTase MT-A70 Bnat1000020351 144 eukaryota>rhizaria Bigelowiella natans fgenesh1_pg.85_#_83 Hrob1000005324 N6-MTase MT-A70+MT-A70 Hrob1000005324 142 eukaryota>metazoa>annelida Helobdella robusta 164465 159464034 N6-MTase MT-A70 CHLREDRAFT_146896 122 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein, partial [Chlamydomonas reinhardtii]. 115759069 N6-MTase MT-A70 LOC757579 49 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to n6-adenosine-methyltransferase ime4, partial [Strongylocentrotus purpuratus]. Proakaryotic homologs GI Operons Arch Pfam-Arch Gene name len phylogeny Species Genbank descriptions #; BglII-/REase-associated 503247195 N6-MTase*-><-MunI N6-MTase MT-A70 CENSYa_0595 216 archaea Cenarchaeum symbiosum transcriptional regulator [Cenarchaeum symbiosum]. <-118575782_?<-118575783_?<-118575784_?||118575785_?->118575786_?->118575787_?-><-118575788_?||503247195_N6-MTase*-><-118575790_MunI||118575791_?->118575792_?->118575793_?-><-118575794_?||118575795_?-><-118575796_? 553802969 BglII-><-N6-MTase* N6-MTase MT-A70 HMPREF0742_RS10030 283 bacteria>actinobacteria Rothia aeria MT-A70 protein [Rothia aeria]. <-739427288_?<-553802963_?<-553802964_?<-553802965_?||553802966_?->553802967_?->553802968_BglII-><-553802969_N6-MTase*<-553802970_?<-553802972_?||553802973_?->553802974_?-><-739427273_?<-553802977_?||739427276_?-> 503250901 BglII->N6-MTase*-> N6-MTase MT-A70 ETHHA_RS08455 222 bacteria>firmicutes Ethanoligenens harbinense S-adenosylmethionine-binding protein [Ethanoligenens harbinense]. 503250895_?->503250896_?-><-503250897_?<-503249903_?<-754032519_?<-503249901_?||503250900_BglII->503250901_N6-MTase*-><-503250902_?<-503250904_?<-503250905_?<-503250906_?||503250907_?-><-754031295_?||754032521_?-> 653315751 BglII-><-N6-MTase*||?->?-><-?||?->?->?->ParB-> N6-MTase MT-A70 T424_RS0114345 220 bacteria>proteobacteria>alphaproteobacteria Rhizobium undicola S-adenosylmethionine-binding protein [Rhizobium undicola]. 739204316_?->739204347_?->653315740_?->653315743_?->653315745_?->739204348_?->653315749_BglII-><-653315751_N6-MTase*||653315753_?->739204349_?-><-653315755_?||653315757_?->653315759_?->653315761_?->653315763_ParB-> 586601520 <-N6-MTase*<-BglII<-?<-?||?->HNH-> N6-MTase MT-A70 GbCGDNIH3_7033 215 bacteria>proteobacteria>alphaproteobacteria Granulibacter bethesdensis CGDNIH3 Adenine-specific methyltransferase [Granulibacter bethesdensis CGDNIH3]. 586601513_?->586601514_?->586601515_?->586601516_?->586601517_?->586601518_?-><-586601519_?<-586601520_N6-MTase*<-586601521_BglII<-586601522_?<-586601523_?||586601524_?->586601525_HNH->586601526_?-><-586601527_? 661268459 N6-MTase*-><-McrB-NTD+REase N6-MTase MT-A70 K291_RS0125225 189 bacteria>proteobacteria>alphaproteobacteria Ensifer sp. USDA 6670 MT-A70 family protein [Ensifer sp. USDA 6670]. 696510181_?-><-661268448_?<-661268450_?||661268452_?->661268455_?->696510167_?->661268457_?->661268459_N6-MTase*-><-661268460_McrB-NTD+REase||661268462_?->661268464_?-><-661268618_?||661268466_?->696510168_?->661268470_?-> 515934135 BglII->N6-MTase*-> N6-MTase MT-A70 H156_RS0101780 216 bacteria>proteobacteria>gammaproteobacteria Methylococcus capsulatus S-adenosylmethionine-binding protein [Methylococcus capsulatus]. 651602654_?->651602655_?->515934136_BglII->515934135_N6-MTase*->515934134_?->499262072_?->515934133_?->499262074_?->499262075_?->515934131_?->499262077_?-> 653894579 ASCH->?->?->?->?->BglII->N6-MTase*-> N6-MTase MT-A70 F820_RS0109105 230 bacteria>proteobacteria>gammaproteobacteria Xylella fastidiosa S-adenosylmethionine-binding protein [Xylella fastidiosa]. 653894443_?->490185352_ASCH->490185644_?->544952717_?->490185356_?->490185357_?->653894557_BglII->653894579_N6-MTase*-><-740452319_?<-653894719_?<-653894722_? # 27; DCM/DAM associated 657200356 REase-><-?<-?<-N6-MTase*<-?<-DCM+DCM N6-MTase MT-A70 ON05_RS35435 186 bacteria>cyanobacteria Acaryochloris sp. CCMEE 5410 hypothetical protein [Acaryochloris sp. CCMEE 5410]. <-498167725_?||498167728_REase-><-748211640_?<-748211643_?<-657200356_N6-MTase*<-498167740_?<-498167742_DCM+DCM<-657200357_?<-498167747_?<-498167748_? 667917580 <-N6-MTase*<-DCM N6-MTase MT-A70 EL18_01388 190 bacteria>proteobacteria>alphaproteobacteria Nitratireductor basaltis Adenine-specific methyltransferase [Nitratireductor basaltis]. <-667917573_?<-667917574_?<-667917575_?<-667917576_?<-667917577_?<-667917578_?<-667917579_?<-667917580_N6-MTase*<-667917581_DCM<-667917582_?<-667917583_?||667917584_?-><-667917585_?<-667917586_?<-667917587_? 515104987 <-N6-MTase*<-Methylase<-?<-DCM N6-MTase MT-A70 RPHASCH2410_RS0109895 193 bacteria>proteobacteria>alphaproteobacteria Rhizobium phaseoli DNA methyltransferase [Rhizobium phaseoli]. <-515104976_?<-748177413_?<-515104978_?<-515104979_?<-515104981_?<-515104982_?<-515104984_?<-515104987_N6-MTase*<-515104989_Methylase<-515104990_?<-515104991_DCM<-657763447_?||515104993_?-><-657763450_?<-515104997_? 501064335 N6-MTase*->Methylase-> N6-MTase MT-A70 XAUT_RS18300 194 bacteria>proteobacteria>alphaproteobacteria Xanthobacter autotrophicus MT-A70 family protein [Xanthobacter autotrophicus]. <-753820203_?||501064331_?->501064332_?-><-753820205_?<-753820207_?||501064334_?->753820214_?->501064335_N6-MTase*->501064336_Methylase->753820216_?->753820218_?->501064339_?-><-753820221_?||753820227_?->501064341_?-> 674766351 DCM+DCM->N6-MTase*-><-?||?->?->?->?->Terminase_LS->Terminase_LS-> N6-MTase MT-A70 JP75_07920 190 bacteria>proteobacteria>alphaproteobacteria Devosia riboflavina DNA methyltransferase [Devosia riboflavina]. <-674766344_?||674766345_?-><-674766346_?||674766347_?->674766348_?->674766349_?->674766350_DCM+DCM->674766351_N6-MTase*-><-674766352_?||674766353_?->674766354_?->674766355_?->674766356_?->674766357_Terminase_LS->674766358_Terminase_LS-> # 27; 511283520 - N6-MTase MT-A70 279 bacteria>actinobacteria Mycobacterium abscessus adenine-specific DNA methyltransferase [Mycobacterium abscessus]. 496677811 - N6-MTase MT-A70 208 bacteria>firmicutes Lachnospiraceae bacterium 3_1_46FAA DNA methyltransferase [Lachnospiraceae bacterium 3_1_46FAA]. 636828153 RuvC->?->?->N6-MTase*-> N6-MTase MT-A70 H556_RS0109535 199 bacteria>proteobacteria>alphaproteobacteria Brevundimonas naejangsanensis hypothetical protein [Brevundimonas naejangsanensis]. <-737320074_?||636828147_?->636828148_?->636828149_?->636828150_RuvC->658447872_?->636828152_?->636828153_N6-MTase*->737320076_?-> 544667667 - N6-MTase MT-A70 212 bacteria>proteobacteria>alphaproteobacteria Thalassobacter arenae type II DNA modification methyltransferase [Thalassobacter arenae]. 496698392 - N6-MTase MT-A70 212 bacteria>proteobacteria>alphaproteobacteria Afipia sp. 1NLS2 S-adenosylmethionine-binding protein [Afipia sp. 1NLS2]. 495609045 - N6-MTase MT-A70 217 bacteria>proteobacteria>alphaproteobacteria Maritimibacter alkaliphilus DNA methyltransferase [Maritimibacter alkaliphilus]. 516541036 ParB->?->N6-MTase*->?->VRR-NUC->VRR-NUC-> N6-MTase MT-A70 LOKHON_RS09140 173 bacteria>proteobacteria>alphaproteobacteria Loktanella hongkongensis hypothetical protein [Loktanella hongkongensis]. <-516541043_?<-702932437_?||702932435_?->516541040_?->516541039_?->702932433_ParB->516541037_?->516541036_N6-MTase*->648455747_?->702932431_VRR-NUC->702932430_VRR-NUC->702932429_?->516541031_?->516541030_?->516541029_?-> 575405212 <-N6-MTase* N6-MTase MT-A70 ETSY1_42765 198 bacteria>proteobacteria>deltaproteobacteria Candidatus Entotheonella sp. TSY1 S-adenosylmethionine-binding protein [Candidatus Entotheonella sp. TSY1]. 575405210_?-><-575405211_?<-575405212_N6-MTase* 640854256 <-N6-MTase*<-?<-?<-?<-NUMOD4 N6-MTase MT-A70 KPNIH27_19120 214 bacteria>proteobacteria>gammaproteobacteria Klebsiella pneumoniae subsp. pneumoniae KPNIH27 DNA methyltransferase [Klebsiella pneumoniae subsp. pneumoniae KPNIH27]. <-640854249_?<-640854250_?||640854251_?-><-640854252_?<-640854253_?<-640854254_?<-640854255_?<-640854256_N6-MTase*<-640854257_?<-640854258_?<-640854259_?<-640854260_NUMOD4<-640854261_?<-640854262_?||640854263_?-> 554729604 RecT->?->?->N6-MTase*->?-><-Phage_integrase<-?<-?||Exonuc_VII-> N6-MTase MT-A70 G966_02949 220 bacteria>proteobacteria>gammaproteobacteria Escherichia coli UMEA 3323-1 hypothetical protein G966_02949 [Escherichia coli UMEA 3323-1]. 554729597_?->554729598_?->554729599_?->554729600_?->554729601_RecT->554729602_?->554729603_?->554729604_N6-MTase*->554729605_?-><-554729606_Phage_integrase<-554729607_?<-554729608_?||554729609_Exonuc_VII-><-554729610_?<-554729611_? 666005014 N6-MTase*->?-><-?||Phage_integrase-> N6-MTase MT-A70 SS17_3321 208 bacteria>proteobacteria>gammaproteobacteria Escherichia coli O157:H7 str. SS17 Adenine DNA methyltransferase, phage-associated [Escherichia coli O157:H7 str. SS17]. 666005007_?->666005008_?->666005009_?->666005010_?->666005011_?->666005012_?->666005013_?->666005014_N6-MTase*->666005015_?-><-666005016_?||666005017_Phage_integrase->666005018_?-><-666005019_?<-666005020_?<-666005021_? 695800969 RecT->?->?->?->N6-MTase*->?-><-Phage_integrase<-?<-?||Exonuc_VII-> N6-MTase MT-A70 AF48_RS10595 197 bacteria>proteobacteria>gammaproteobacteria Enterobacter aerogenes adenine methylase [Enterobacter aerogenes]. 695800963_?->695801432_?->695800964_?->695800965_RecT->695800966_?->695801433_?->695800967_?->695800969_N6-MTase*->695800970_?-><-695800972_Phage_integrase<-505805395_?<-505183232_?||518922309_Exonuc_VII->695800973_?->505805397_?-> 556471807 N6-MTase*-> N6-MTase MT-A70 AGZ61752.1 214 viruses Phormidium phage MIS-PhV1A DNA Methyltransferase [Phormidium phage MIS-PhV1A]. 556471801_?->556471802_?->556471803_?->556471804_?->556471805_?->556471806_?->556471807_N6-MTase*->556471808_?->556471809_?->556471810_?->556471811_?->556471812_?->556471813_?->556471814_?-> 337731296 N6-MTase*-> N6-MTase MT-A70 gp5 211 viruses>dsdna viruses, no rna stage>caudovirales EBPR siphovirus 4 hypothetical protein [EBPR siphovirus 4]. 337731292_?->337731293_?->337731294_?->337731295_?->337731296_N6-MTase*->337731297_?->337731298_?->337731299_?->337731300_?->337731301_?->337731302_?->337731303_?-> # 1; 497433097 - N6-MTase SP+MT-A70 290 bacteria>actinobacteria Actinomyces sp. oral taxon 175 SAM-binding domain protein [Actinomyces sp. oral taxon 175].Back to Contents
Boundaries and core MTase elements Str-3 Str-4 Str-5 Str-6 Str-7 Str-1 Str-2 FINAL -------------------------EEEHHHHHHHH--H--------------EEEE----------------HHHHH--HHHHHHHHHHHHHHH-----EEEEEEEEEEEE-------------HHHHHHHHHHHHHHHHH---EEEEEEEEEEE----EE--H---HHHHHHHHHHH-------------------------------HHHHHHHHHH------EEEEEE-----HHHHHHHHH---EEEEE--HHHHHHHHH-- ALIGN -------------------------EEEHHHHHHHH--H--------------EEEEE-------------HHHHHHH--HHHHHHHHHHHHHHH-----EEEEEEE-------------------HEHHHHHHHHHHHHHHHH-HH-HEEEEE------EEE--------EEEEEE-------------------------------HHHHHHHHHHHHH-----EEEEE------HHHHHHHH----EEEHHHHHHHHHHHH--- HMM --------------------E----EEEHHHHHHHH--HH---------EEEEEEEEE---EEEEEEE------HHHH--HHHHHHHHHHHHHHHH----EEEEEEEEEEEE-------------HHHHHHHHHHHHHHHHH---HEEEEEEEEE-----EEEE------EEEEEEE----------------E------EEE----HHHHHHHHHHHHHH-----EEEEEE------HHHHHHHHH----EEE--HHHHHH----- FREQ ------------------------------HHHHHH--H--------EE-----EE----------------EEE-----HHH---HHHHHHHHH-----HHHHHHHHHHHHH------------EEEE--HHHHHHHHHHH---EEE-EEEEEEE-----HHHH---HHHHHHHHHHH--------------EE------------------H-EEEEE------EEEE--------HHHHHHHH-H--------HHHHHHHHH-- PSSM -------------------------EEEEHHHHHHH--H--------------EEEE----HHHHHH---------HH--HHHHHHHHHHHHHHH-----EEEEEEE-------------------HHHHHHHHHHHHHHHH----E----EEEEE------------HHHHHHHHHH--------------------------------HHHHHHHHHH-------EEEE------HHHHHHHHH---EE-EE--HHHHHH----- GUITHDRAFT_103022_Guillardia_theta_CCMP2712_551670651 STTE--KGWDVAEG---SSKR----TVVCMDALEWM--MQSENEGLKGGMFVGSVLTSLPDISELQFPQVSEGEKLER--YKGWFVDTAAMILNRIPAGQFAIFYQSDVRVCTKE--------GQVEDWIDKAALCYEASKRTSCKQLWHKYALTCSPGTRSVGR---PTLSHIVCFSNG-ATYKRDRFPAPDVFYR-GEMIWPRAIGLDACVLCLAFLRNL-GNVSTVIDPFCGRGTTLAVANALGMDAVGVELSPKRCRIATSLS Smin1000020133_Symbiodinium_minutum_Mf_105b01_Smin1000020133 RTGG--RKMMPKEA--PNGRR----EVICEDALEWI-EKQGHFPS---G---SMVFTSLPDMSE--VVEFA-PR-FED--WEDFFMKAVRHILTALPYGSVAAFYQTDVRLP-TE--------GQ----VSKAFLVLKAAEAV--------------PEARGFGG---G------CV---------------------------KVMGVSATATVLKWATRRLAGLHTVIDPFCGAGTVLAMANAFGLDAIGVDLSPKRIKQAQRLD DICPUDRAFT_50950_Dictyostelium_purpureum_330842901 LQLN--KEKGLIDK--FGVYR----DVYCMDAVQWL-NNNAIDPN-------TSVITSLPDITE--VSGFT----LEQ--YKQWFTNTVQLIASKLSDNNVGIVYQTDIK---YHWKHDRSLIEE---YIDKGYLAMKGIEAAGCKVVWHKIMAASDLTKMIITK-NKSSFTHMICFAKQPTNIKYQD-NTPDINTR-GDMVWSRAMGLNACEISTSYVRG--IGSHTVLDPFCGKGSVLAVANVYGLNGIGIDLSTSKFRNSFNLQ DDB_G0285285_Dictyostelium_discoideum_AX4_66809181 LELN--KKNGILDR--VGVYR----DIYCMDAVQWL-KENEISPQ-------SSVITSLPDISE--VSSMN----LEQ--YKQWFTDTVSLITSKLDEKNVAIVYQTDIK---KKWKMDRGIIEE---YVDKGYLAQKGAEISGCKVIWHKIMTAHS----NISN-NKATFTHMICFSKNPTNIKYQE-NTPDIGGR-GSMVWSKAMGLNACVIAILYCRS--IGSTTIIDPFCGKGSVLAVANVYGLDSVGVDLSSGKTKNSFNLQ DFA_07250_Dictyostelium_fasciculatum_470249763 IKEN--KIHVPKDR--LKVTR----QVYCIDAIEWL-KNNELSPN-------TSVITSLPDIVE--MSGYT----LPQ--YKEWFVNAVRLITSKLSDNNVAIFYQTDIK---RKWKKDKSVTEE---YVDKGYMVQKGAELSECKVLFHKLMLSHPVETEVVTR-NKASFTHMICIAKNPSSIMHQD-NTPDVAPR-GAMVWPKAMGLNACLVAVRFIRG--VGSTSILDCFCGKGSVLGTANLLGLSSIGVDLSVAKCRNSHSLV SAMD00019534_083600_Acytostelium_subglobosum_LB1_735852337 IKEN--RTNASRIQ--KRVMR----DITCQDAVQWL-KDNTVASG-------TSVITSLPDIVE--MTGYS----LQQ--YRDWFVDTVELITSKLSDNNVAIFYQTDIR---RKVKGNKGIVDE---YLDKGYLCAKGAERSGCKMVWHKLMYSVAPELGKVSRGQTPGFSHMLCFAKKPGLLTYQE-QTPDIAPR-GGMVWKKAMGLNACMVALRYIRG--VGCNTVLDTFCGKGSVLAAANMLGLHAIGVDLSISKTRHSSNLV MXDZ_RS0208475_Myxococcus_xanthus_499869570 MVDE--QAGATGAA-----KR----TVYCEDALVWL-EARPVLEG-------SSAIASLPDWSE--FPSLS----LAE--WKAWFIRAAALILARVPPEGVAIFYQTDVK---DE--------GT---WVDKGYLVARAAEEVGVDLLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDMAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARNLR LILAB_RS07805_Myxococcus_fulvus_760026550 MVDE--REGAAGAA-----KR----TVYCEDALVWL-EARPALAG-------SSAIASLPDGSE--FPSLS----LAD--WKAWFIRAAALILARVPPEGVAIFYQTDVK---EE--------GT---WVDKGYLVSRAAEEVGVELLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDLAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDSVGVELSRKRARKARNLR MXAN_RS00775_Myxococcus_xanthus_763416965 MVDE--QAGATGAA-----KR----TVYCEDALVWL-EARPVLEG-------SSAIASLPDWSE--FPSLS----LAE--WKAWFIRAAALILARVPPEGVAIFYQTDVK---DE--------GT---WVDKGYLVARAAEEVGVDLLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDMAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARNLR LILAB_07935_Myxococcus_fulvus_HW-1_337257340 MVDE--REGAAGAA-----KR----TVYCEDALVWL-EARPALAG-------SSAIASLPDGSE--FPSLS----LAD--WKAWFIRAAALILARVPPEGVAIFYQTDVK---EE--------GT---WVDKGYLVSRAAEEVGVELLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDLAK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDSVGVELSRKRARKARNLR A176_RS20440_Myxococcus_sp_(contaminant_ex_DSM_436)_488713785 MVDE--RDGDTGAA-----RREPQRTVDCEDALAWL-EARPVLEG-------SSAIASLPDWSE--FPTLS----LAD--WKAWFIRAAALILARVPPEGVAIFYQTDVK---EE--------GT---WVDKGYLVSRAAEEVGVDLLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-VRVDMGK-STPDVLPEAGEVTWTRGMGVEACLIACRFILEH-TRTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARNLR DB31_RS18015_Hyalangium_minutum_763330777 MTES--GAGGASER--PEGQR----TVECADARVWL-EGRQVLEG-------CSAITSLPDVSE--FPELS----LAE--WKQWFIRAAVLVMSKVPAQGVAIFYQTDVK---KD--------GA---WVDKGYLISKAAEEAGCELLWHKVVCRRPPGTVTFGR---PAYSHMLCFSRG-IRVDLGK-ATPDVLPDAGEVTWTRGMGLHACLAACRFILEH-TATRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRAKKARVLR PPL_08445_Polysphondylium_pallidum_PN500_281204782 LKDN--K-NKDNVA--KSVYR----NLFCMDALEYI-KNNELEKT-------TSIITSLPDIVE--MSGYT----LDR--YKTWFVNAITLICSKLTDNNVAIFYQTDVK---RKVKGNKGVVDE---YLDKGYMCSKGAEIAGCKMVWHKMMTSSPPELGKVARGTKSSFSHMICIARTPSNLIYQE-QTPDIAPR-GAMTWPKAMGLNACMVAAKYIRG--IGSTTILDPFCGKGSVLAIANLIGLNSIGVDLAISKVRHSCNLL MFUL124B02_RS00860_Myxococcus_fulvus_819023527 MLHT--TAG-----------R----TVHCEDALTWL-AAQPILTG-------CSAVASLPDASE--FPTLS----LAE--WKAWFIRAAALVMSRVPDDGVAIFYQTDVK---DE--------GL---WVDKGYLVSRAAEDSGMGLLWHKVVCRRAPGTVTFGR---PAYSHMLCFSRG-IRVDLGK-STADVLPDAGEVTWTRGMGVEACQLACRFILEH-TPTRTVVDPFCGHGTALAVANAMGLEAIGVELSRKRARKARNLR MYSTI_RS00680_Myxococcus_stipitatus_505158657 MADE--RDGTDGRA--LDARR----TVHCEDALTWL-AAQPVLTG-------CSAVASLPDASE--FPTLS----LAE--WKAWFTRAAALVMSRVPDDGVAIFYQTDVK---DE--------GL---WVDKGYLVSRAAEEAGLGMLWHKVVCRRAPGTVTFGR---PAYSHMLCFSKG-VRPDLAK-STADVLPEAGEVTWTRGMGVEACQLACRFILEQ-TSTRTVVDPFCGHGTALAVANAMGLQAVGVELSRKRARKARNLF Q664_RS19790_Cystobacter_violaceus_759680543 ---------MEQAP--SQGKR----TVHCADALAWL-EAQGVLAG-------CSLITSMPDVSE--FPSLS----LAQ--WKEWFVRTASLVLSRCPDDGVTIFYQTDIK---KD--------GT---WVDKGYLVQKAAEQLGHSLLWHKVVCRTPPGSITFGR---PAYSHMLCFSRG-LRAALSK-STADVLPQAGEVTWTRGMGVQACLVACRYVLEN-TPTRTIVDPFCGHGSVLAVANWLGLEAVGVELSRKRAKKARALQ D187_RS06910_Cystobacter_fuscus_488707209 MTSD--ERRAEREA--PQGER----TVHCADALAWL-EAQGVLEG-------CSLITSMPDVSE--FPTLT----LAE--WKDWFVRTAALVLSRCPDEGVTIFYQTDIK---KD--------GT---WVDKGYLVQKAAEQQGHALLWHKVVCRAPAGQTTFGR---PAYSHLLCFSRD-VRADLSR-STPDVLPQAGEVTWTRGMGVEACLAACRYVLEN-TSTRRIVDPFCGHGTVLAVANDLGLDAVGVELSRKRAKKARALR STAUR_RS01010_Stigmatella_aurantiaca_488687410 MAEH--GEAENTPP--PPGRR----TVECAEAVAWL-SGRGVLEG-------CSVITSLPDLSE--FPALS----LAE--WKQWFIRAAALVMAKVPPEGVALFYQTDVK---HE--------GT---WVDKGYLVSRAAEEAGQETLFHKVVCRRPPGTVTFGR---PAYSHLLGFSRG-VRLALSK-ATADVLPEAGEVTWTRGMGVRACLAACRFIQEH-TPTRTVVDPFCGHGTVLAVANALGLDAVGVELSRKRARKARALR COCOR_RS39470_Corallococcus_coralloides_504213579 MTDG--RSGTAAA------KR----TVYCEDALAWL-DARPVLEG-------CSMVASLPDVSE--FPQLT----VPQ--WKDWFVGAAAKVLSRVPEDGVAVFYQSDVK---KD--------GA---WVDKGYLVSKAAEAAGCDTLWHKVVCRRTPGTVTFGR---PAYSHLLCFSRG-LKADAAK-STADVLPDPGEVTWTRGMGLNACLVACRFILEQ-TRTRTVVDPFCGHGTALAVANALGLDAVGVELSRKRARRARNLQ Anae109_2889_Anaeromyxobacter_sp_Fw109-5_152029321 ------MPGSLLSRPISAPRR----AVHVGDGVAWL-EAGALPAD-------HALVTSLPDASE--LPALG----ADG--WRRWFLAAAERACRAVADEAVAIFYQTDVK---RD--------GA---WVDKAFLVQLAAERAGSALLWHKIVCRVAPGTTTVGR---PAYAHLLCVSRA-LRLAPGQ-SSPDVLPVAGAMTWPRAMPLEACAAAARFLVAH-TRCRTVVDPFCGLGSMLAVANAHGLDAVGVELSRQRAERARALA PSR1_03713_Anaeromyxobacter_sp_PSR-1_775300299 MADGRPQAGEVGAR---APRR----DVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AAG--WEAWFVDVAALACAAVDPGAPAVFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLLFHKIVCRVPPGTATFGR---PAYAHLLCCARA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAD-TACRTVVDPFCGLGTMLAVANAHGLDAIGVELSRRRADRARRLH ANAE109_RS14815_Anaeromyxobacter_sp_Fw109-5_752809907 ---------------MSAPRR----AVHVGDGVAWL-EAGALPAD-------HALVTSLPDASE--LPALG----ADG--WRRWFLAAAERACRAVADEAVAIFYQTDVK---RD--------GA---WVDKAFLVQLAAERAGSALLWHKIVCRVAPGTTTVGR---PAYAHLLCVSRA-LRLAPGQ-SSPDVLPVAGAMTWPRAMPLEACAAAARFLVAH-TRCRTVVDPFCGLGSMLAVANAHGLDAVGVELSRQRAERARALA Adeh_4079_Anaeromyxobacter_dehalogenans_2CP-C_85777006 MADGRPQAGEVGAR---APRR----DVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AAG--WEAWFVDVAALACAAVDAGAPAIFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLRFHKIVCRVPPGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDAVGVELSRRRADRARRLQ A2CP1_RS21285_Anaeromyxobacter_dehalogenans_506415536 MSDGR-QAGGVGAR---APRR----EVRCGDGVSFLREAAPLPPD-------HALVTSLPDASE--LPALG----AEG--WEAWFVDVAALACAAVAPGAPAIFYQTDVK---RD--------GA---WVDKAQLVARGAARAGARLLFHKIVCRVPPGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDALGVELSRRRADRARRLH ANAEK_RS21100_Anaeromyxobacter_sp_K_501520222 MSDGR-QAGGVGAR---APRR----EVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AEG--WEAWFVDVAALACAAVAPGAPAIFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLLFHKIVCRVPAGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACQAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDGVGVELSRRRADRARRLQ ADEH_RS21070_Anaeromyxobacter_dehalogenans_752814769 ----------MGAR---APRR----DVRCGDGVAFLREAAPLPPD-------HALVTSLPDASE--LPALG----AAG--WEAWFVDVAALACAAVDAGAPAIFYQTDVK---RD--------GA---WVDKAHLVALGAARAGARLRFHKIVCRVPPGTATFGR---PAYAHLLCCSRA-LRLDPAR-ATPDVLPALGQMPWPRAMGAAACEAVARFLLAE-TGCRTVVDPFCGLGTMLAVANAHGLDAVGVELSRRRADRARRLQ M201_gp12_Halovirus_HCTV-2_509139482 -------------------MS--C-DIYQADGIEWC-RENPNA---------GAVVTSLPDPENIIFPEGS-A--YEGRPW-AWFREAIDACAEATHPNAPLVLRQTDRR---DN--------GT-KSKAALAFDVLLENQDDDWRCLWHKIVLHQDPETTNIHR---PTYSHLLAFGRP--QVGPGS-RTPDVLRP-GDKLYANGMGLATAERAVQFAG---SAHEVIVDPFCGRGTVPVMADALGYSAIGVDLDPEQVQHARGLT consensus/100% ....................p.....l...-u..ah..................hhsShPD..p..hs..s....h....a..aF..sh..h..........hhbQoD.+....p..............hsbu..h.b........................h.....s......sh...........................pshsh.ss..s..ah.......p.llDsFCG.Go..s.As..GhpulGl-Ls..b.p.u..L. consensus/95% ....................R.....l.s.Dul.ah.................uhlsShPD.sE..hs.hs....h....ac.WF..sh..h...hs..ssslhYQoDl+...pc.............blsKu.hh.bus........aHKhh...s.....hsp...sshsHhlshup........p..ssDl....G...a.+uMsh.As..sh.ah.......psllDsFCG.GohLuhANh.GhpulGV-Lu..+.cpu..L. consensus/90% ...................bR....pl.s.Dul.al........s.......puhlsSLPD.sE..hs.hs....h....ac.WF..sh.bh...hs..ssslhYQTDl+...cc.............alDKuahh.buu..ss....aHKhh...ss....hs+...ssasHhlChu+...ph...p..osDl....G.h.Ws+uMGh.AC.hshbah......spollDPFCG.GohLAhANh.GLpulGV-LS.p+scpup.L. consensus/85% ...................bR....sV.C.Dul.aL.ps.sh..s.......puhlsSLPDhoE..hs.hs....h....ac.WFhpsh.hhhs.lss.ssAlFYQTDlK...cc........s....alDKuaLh.buA..sG..hlaHKlhhp.ss.p.shuR...suauHhlChu+s..ph...c.sTsDlhs..G.hsWs+uMGh.AC.hsh+alb...s.spTllDPFCG.GohLAlANh.GLsulGV-LS.p+scpup.L. consensus/80% ...................bR....sV.C.Dul.aL.cs.sl..s.......puhlsSLPDhSE..hsshs....h....ac.WFlpsh.hhhu.lss.ssAIFYQTDlK...cc........G....aVDKuaLl.buAbbsG..hlaHKlhhp.sP.s.shuR...PuauHhlChu+s..ph...c.sTsDVhP..G.hsWs+uMGl.ACbhsh+alb...s.scTllDPFCG.GohLAVANh.GLculGV-LS.p+sc+Ap.L. consensus/75% b..................bR....sV.C.Dul.WL.cs.sl..s.......puhlsSLPDhSE..hPshu....h....Wc.WFlcshsLhhu.lss.usAIFYQTDlK...c-........G....aVDKuaLV.buAcbsGhphlaHKlhhp.sPsosshGR...PuauHhLChu+s..pls.sc.sTPDVhP..GphsWsRuMGlpACbhshRalb...s.s+TVlDPFCG.GohLAVANh.GL-ulGV-LSbpRsc+A+sLp consensus/70% h.p...b...........scR....sV.C.DAlsWL.cupsl..s.......suhlTSLPDhSE..hPsLu....hsp..W+sWFlcsAsLlhu.lsspuVAIFYQTDVK...c-........G....WVDKuaLV.+uAEcsGhclLWHKlVCR.sPGTsThGR...PAYoHhLChSRu.l+ls.uc.uTPDVLP..GphsWsRuMGlpACbhAhRFlb.p.TssRTVVDPFCG+GThLAVANAhGLDAlGVELSR+RA++ARsLpBack to Contents
General notesIn eukaryotes, this clade of MTases are found in Guillardia, Dinoflagellates and Amoebozoans. The Dictyostelium DICPUDRAFT_50950 protein is the prototype of this family.Interestingly, they do not possess the frequently observed DAM -strand-4 motifs, and instead have a SLPD motif. This type of motif is only present in this clade of methylases and not outside of it. The characteristic sequence features of this family include a DxxC motif after strand-1, D/E in strand-2, D after strand-3, SLPD instead of DPPY after strand-4, DxK after strand-5, HK in strand-6 and H and R flanking strand-7. The Methylase fused to NTF2 in dinoflagellate,seems to be a fragment, but Guillardia (gi|551670651) has a full sequence and was used to reconstitute the Dinoflagellate methylase as it is divided between two proteins, Smin1000020134 and Smin1000020133. Smin1000020134 is N-terminal and has PPR repeats at the N-terminus, whereas Smin1000020133 is fused to an NTF2 domain at the C-terminus. This fusion is like the Ot12g00270 clade where the predicted Dinoflagellate RNA methylases (two of them) are fused to PPR repeats, although they have been independently derived in the eukaryotes. |
GI Operons/Domain architectures Arch Pfam-Arch Gene name len phylogeny Species Genbank descriptions #; Eukaryotic versions 66809181 <-DAM* DAM DDB_G0285285 440 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium discoideum AX4 hypothetical protein DDB_G0285285 [Dictyostelium discoideum AX4]. <-66809167_?||66809169_?-><-66809171_?<-66809173_?||66809175_?->66809177_?-><-66809179_?<-66809181_DAM*<-111226459_?<-66809185_?<-66809187_?||66809189_?-><-66809191_?<-66809193_?||66809195_?-> 470249763 <-DAM* DAM DFA_07250 372 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium fasciculatum hypothetical protein DFA_07250 [Dictyostelium fasciculatum]. <-470249749_?||470249751_?->470249753_?-><-470249755_?||470249757_?-><-470249759_?||470249761_?-><-470249763_DAM*<-470249765_?<-470249767_?||470249769_?-><-470249771_?||470249773_?-><-470249775_?<-470249777_? 281204782 DAM*-> DAM PPL_08445 361 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 hypothetical protein PPL_08445 [Polysphondylium pallidum PN500]. <-281204775_?||281204776_?->281204777_?-><-281204778_?<-281204779_?<-281204780_?<-281204781_?||281204782_DAM*-><-281204783_?<-281204784_?||281204785_?->281204786_?-><-281204787_?<-281204788_?||281204789_?-> 735852337 <-DAM* DAM N6_N4_Mtase SAMD00019534_083600 341 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_083600, partial [Acytostelium subglobosum LB1]. <-735852330_?<-735852331_?<-735852332_?||735852333_?-><-735852334_?<-735852335_?||735852336_?-><-735852337_DAM*<-735852338_?<-735852339_?<-735852340_?<-735852341_?<-735852342_?<-735852343_?||735852344_?-> 551670651 <-DAM* DAM SP+N6_N4_Mtase GUITHDRAFT_103022 318 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_103022 [Guillardia theta CCMP2712]. <-551670637_?||551670639_?-><-551670641_?||551670643_?->551670645_?->551670647_?-><-551670649_?<-551670651_DAM*<-551670653_?<-551670655_?||551670657_?->551670659_?-><-551670661_?||551670663_?->551670665_?-> 330842901 DAM*-> DAM N6_N4_Mtase DICPUDRAFT_50950 259 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium purpureum hypothetical protein DICPUDRAFT_50950, partial [Dictyostelium purpureum]. 330842901_DAM*-><-330842915_?||330842903_?->330842905_?->330842907_?->330842909_?-><-330842917_?||330842911_?-> Smin1000020133\ DAM Smin1000020133 193 eukaryota>alveolata>dinophyceae Symbiodinium minutum Mf 1.05b.01 Smin1000020134/ DAM Smin1000020134 633 eukaryota>alveolata>dinophyceae Symbiodinium minutum Mf 1.05b.01 # 25; Prokaryotic homologs 488687410 <-METHYLASE<-?<-5xTM+HISKIN<-?<-DAM*||?->?->SUKH->?-><-DOC DAM N6_N4_Mtase STAUR_RS01010 231 bacteria>proteobacteria>deltaproteobacteria Stigmatella aurantiaca hypothetical protein [Stigmatella aurantiaca]. <-739729589_?<-488687416_?<-488687441_?<-488687447_METHYLASE<-488687432_?<-503139379_5xTM+HISKIN<-488687400_?<-488687410_DAM*||488687402_?->488687445_?->488687427_SUKH->488687389_?-><-739729599_DOC||503139381_?->739729588_?-> 760026550 <-HISKIN||3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE-> DAM N6_N4_Mtase LILAB_RS07805 264 bacteria>proteobacteria>deltaproteobacteria Myxococcus fulvus hypothetical protein [Myxococcus fulvus]. 503702586_?->503702587_?-><-503702588_HISKIN||760026545_3xTM->503702590_?-><-503702591_?<-760026547_ABHYDROLASE3||760026550_DAM*-><-760029538_?<-760026553_?||760029540_?->503702597_5xTM+HISKIN->503702598_?->760026556_METHYLASE->503702600_?-> 819023527 <-METHYLASE<-?<-5xTM+HISKIN<-?||?->ABHYDROLASE3->?-><-DAM*||ABHYDROLASE3-> DAM N6_N4_Mtase MFUL124B02_RS00860 237 bacteria>proteobacteria>deltaproteobacteria Myxococcus fulvus hypothetical protein [Myxococcus fulvus]. <-819023518_METHYLASE<-819023520_?<-819023522_5xTM+HISKIN<-819023524_?||819023525_?->819038976_ABHYDROLASE3->819023526_?-><-819023527_DAM*||819023528_ABHYDROLASE3->819023531_?->819038978_?-><-819023533_?<-819023534_?||819023535_?->819023536_?-> 505158657 <-METHYLASE<-?<-5xTM+HISKIN<-?||?->ABHYDROLASE3->?-><-DAM*||?->ABHYDROLASE3->?-><-?<-?<-3xTM DAM N6_N4_Mtase MYSTI_RS00680 245 bacteria>proteobacteria>deltaproteobacteria Myxococcus stipitatus hypothetical protein [Myxococcus stipitatus]. <-505158650_METHYLASE<-505158651_?<-505158652_5xTM+HISKIN<-505158653_?||505158654_?->763426938_ABHYDROLASE3->505158656_?-><-505158657_DAM*||505158658_?->505158659_ABHYDROLASE3->505158660_?-><-505158661_?<-505158662_?<-505158663_3xTM||505158664_?-> 488713785 3xTM->?-><-?<-ABHYDROLASE3<-?||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE-> DAM N6_N4_Mtase A176_RS20440 250 bacteria>proteobacteria>deltaproteobacteria Myxococcus sp. (contaminant ex DSM 436) hypothetical protein [Myxococcus sp. (contaminant ex DSM 436)]. <-488713778_?<-488713779_?||768721037_3xTM->768721038_?-><-488713782_?<-488713783_ABHYDROLASE3<-488713784_?||488713785_DAM*-><-488713786_?<-768721088_?||768721089_?->768721039_5xTM+HISKIN->488713790_?->768721040_METHYLASE->488713792_?-> 763416965 <-METHYLASE<-?<-5xTM+HISKIN<-?||?->?-><-DAM*||ABHYDROLASE3->?-><-?<-3xTM DAM N6_N4_Mtase MXAN_RS00775 246 bacteria>proteobacteria>deltaproteobacteria Myxococcus xanthus hypothetical protein [Myxococcus xanthus]. <-521966331_?<-499869564_METHYLASE<-499869565_?<-499869566_5xTM+HISKIN<-521966329_?||499869568_?->759926767_?-><-763416965_DAM*||499869571_ABHYDROLASE3->499869572_?-><-499869573_?<-521966328_3xTM||499869575_?->521966327_?->499869577_?-> 337257340 <-HISKIN||3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE-> DAM N6_N4_Mtase LILAB_07935 246 bacteria>proteobacteria>deltaproteobacteria Myxococcus fulvus HW-1 hypothetical protein LILAB_07935 [Myxococcus fulvus HW-1]. 337257333_?->337257334_?-><-337257335_HISKIN||337257336_3xTM->337257337_?-><-337257338_?<-337257339_ABHYDROLASE3||337257340_DAM*-><-337257341_?<-337257342_?||337257343_?->337257344_5xTM+HISKIN->337257345_?->337257346_METHYLASE->337257347_?-> 499869570 3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?<-?||?->5xTM+HISKIN->?->METHYLASE-> DAM N6_N4_Mtase MXDZ_RS0208475 266 bacteria>proteobacteria>deltaproteobacteria Myxococcus xanthus hypothetical protein [Myxococcus xanthus]. <-499869577_?<-521966327_?<-499869575_?||521966328_3xTM->499869573_?-><-499869572_?<-499869571_ABHYDROLASE3||499869570_DAM*-><-759926767_?<-499869568_?||521966329_?->521966330_5xTM+HISKIN->499869565_?->499869564_METHYLASE->521966331_?-> 763330777 DAM*->?->5xTM+HISKIN->?->METHYLASE-> DAM N6_N4_Mtase DB31_RS18015 234 bacteria>proteobacteria>deltaproteobacteria Hyalangium minutum hypothetical protein [Hyalangium minutum]. 763331669_?-><-763330769_?<-763331672_?<-763330771_?<-763331675_?<-763330773_?||763330775_?->763330777_DAM*->763330780_?->763330783_5xTM+HISKIN->763330787_?->763330788_METHYLASE->763331677_?->763331678_?-><-763330791_? 759680543 <-5xTM+HISKIN<-?<-?<-?<-?||?->ABHYDROLASE3-><-DAM*<-?<-?<-?||?-><-?||ABHYDROLASE3-> DAM N6_N4_Mtase Q664_RS19790 237 bacteria>proteobacteria>deltaproteobacteria Cystobacter violaceus hypothetical protein [Cystobacter violaceus]. <-759680531_5xTM+HISKIN<-759680612_?<-759680533_?<-759680534_?<-759680537_?||759680540_?->759680613_ABHYDROLASE3-><-759680543_DAM*<-759680545_?<-759680548_?<-759680552_?||759680555_?-><-759680557_?||759680560_ABHYDROLASE3->759680563_?-> 488707209 <-DAM*<-?||?-><-?||?->STYKIN->STYKIN-> DAM N6_N4_Mtase D187_RS06910 240 bacteria>proteobacteria>deltaproteobacteria Cystobacter fuscus hypothetical protein [Cystobacter fuscus]. <-759717612_?<-488707202_?<-488707203_?||488707204_?-><-488707206_?||759717613_?-><-759717277_?<-488707209_DAM*<-488707210_?||759717614_?-><-759717615_?||759717616_?->759717617_STYKIN->759717619_STYKIN->759717279_?-> 504213579 3xTM->?-><-?<-ABHYDROLASE3||DAM*-><-?||5xTM+HISKIN->METHYLASE-> DAM N6_N4_Mtase COCOR_RS39470 243 bacteria>proteobacteria>deltaproteobacteria Corallococcus coralloides hypothetical protein [Corallococcus coralloides]. <-759606359_?<-504213574_?<-759604248_?||504213575_3xTM->504213576_?-><-504213577_?<-504213578_ABHYDROLASE3||504213579_DAM*-><-504213580_?||504213581_5xTM+HISKIN->759606361_METHYLASE-><-759606363_?||759604249_?->504213586_?->504213587_?-> 85777006 <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM-> DAM N6_N4_Mtase Adeh_4079 252 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter dehalogenans 2CP-C conserved hypothetical protein [Anaeromyxobacter dehalogenans 2CP-C]. 85776999_?-><-85777000_?<-85777001_Glyoxalase_4<-85777002_TRANSGLUTAMINASE||85777003_?->85777004_?-><-85777005_Acyl-ACP_TE||85777006_DAM*->85777007_METHYLASE-><-85777008_DnaJ||85777009_Ferredoxin-RRM->85777010_?->85777011_?-><-85777012_?||85777013_?-> 501520222 <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM-> DAM N6_N4_Mtase ANAEK_RS21100 247 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter sp. K hypothetical protein [Anaeromyxobacter sp. K]. 501520215_?-><-501520216_?<-501520217_Glyoxalase_4<-501520218_TRANSGLUTAMINASE||501520219_?->501520220_?-><-501520221_Acyl-ACP_TE||501520222_DAM*->501520223_METHYLASE-><-501520224_DnaJ||501520225_Ferredoxin-RRM->501520226_?->501520227_?-><-501520228_?||501520229_?-> 506415536 <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM-> DAM N6_N4_Mtase A2CP1_RS21285 247 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter dehalogenans hypothetical protein [Anaeromyxobacter dehalogenans]. 506415530_?-><-506415531_?<-506415532_Glyoxalase_4<-506415533_TRANSGLUTAMINASE||506415534_?->501520220_?-><-506415535_Acyl-ACP_TE||506415536_DAM*->506415537_METHYLASE-><-506415538_DnaJ||506415539_Ferredoxin-RRM->506415540_?->506415541_?-><-506415542_?||506415543_?-> 775300299 <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ DAM N6_N4_Mtase PSR1_03713 244 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter sp. PSR-1 hypothetical protein PSR1_03713 [Anaeromyxobacter sp. PSR-1]. 775300292_?-><-775300293_?<-775300294_Glyoxalase_4<-775300295_TRANSGLUTAMINASE||775300296_?->775300297_?-><-775300298_Acyl-ACP_TE||775300299_DAM*->775300300_METHYLASE-><-775300301_DnaJ 152029321 <-Endonuclease_5<-?<-TIMbarrel_redox||?-><-?||DAM*-><-?<-URI<-Patatin DAM N6_N4_Mtase Anae109_2889 225 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter sp. Fw109-5 conserved hypothetical protein [Anaeromyxobacter sp. Fw109-5]. <-152029314_?||152029315_?-><-152029316_Endonuclease_5<-152029317_?<-152029318_TIMbarrel_redox||152029319_?-><-152029320_?||152029321_DAM*-><-152029322_?<-152029323_URI<-152029324_Patatin||152029325_?-><-152029326_?||152029327_?-><-152029328_? 752809907 <-Endonuclease_5<-?<-TIMbarrel_redox<-?||DAM*-><-?<-URI<-Patatin DAM SP+N6_N4_Mtase ANAE109_RS14815 216 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter sp. Fw109-5 hypothetical protein [Anaeromyxobacter sp. Fw109-5]. 752809903_?-><-752809904_?||752809905_?-><-752809906_Endonuclease_5<-752809043_?<-501045893_TIMbarrel_redox<-501045895_?||752809907_DAM*-><-501045897_?<-501045898_URI<-501045899_Patatin||752809908_?->501045902_?-><-501045903_?<-501045904_? 752814769 <-Glyoxalase_4<-TRANSGLUTAMINASE||?->?-><-Acyl-ACP_TE||DAM*->METHYLASE-><-DnaJ||Ferredoxin-RRM-> DAM SP+N6_N4_Mtase ADEH_RS21070 215 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter dehalogenans hypothetical protein, partial [Anaeromyxobacter dehalogenans]. 499742384_?-><-499742385_?<-499742386_Glyoxalase_4<-499742387_TRANSGLUTAMINASE||499742388_?->499742389_?-><-499742390_Acyl-ACP_TE||752814769_DAM*->499742392_METHYLASE-><-499742393_DnaJ||499742394_Ferredoxin-RRM->499742395_?->499742396_?-><-752814525_?||499742398_?-> # 1; 509139482 DnaJ->?->?->?->?->?->ParB+DAM->DAM*->?->?->?->?->?->DAM-> DAM UPF0020 M201_gp12 213 viruses>dsdna viruses, no rna stage Halovirus HCTV-2 AdoMet-MTase [Halovirus HCTV-2]. 509139477_DnaJ->509139478_?->509139479_?->509139480_?->509139548_?->509139549_?->509139481_ParB+DAM->509139482_DAM*->509139483_?->509139484_?->509139485_?->509139550_?->509139486_?->509139487_DAM->509139488_?->Back to Contents
Str-3 Str-4 Str-5 Str-6 Str 7 Synapomorphic strand Str-1 Str-2 ALIGN ------HH-HH-----H------------EEEEEE--------------------------------------------------HH-HHHHH-HHH----------------HEEEE----EEEEEE-------HHHHHHH----EEE------EEEEE--------------------HHEEEE----------E--------------------------------------------------------------------------------------------EEEE----E-------------------------HHHHHHHHHHEE-----EEEEEE-----EHHEEHH---EEEEE--HHHHH-HHHHHHHHHHHHHHH-----HHHHHHH---------HHHHHH----------------EEE----------------HHHHHHHHH-------------------- HMM --HH-HHH-HHHHHHHHHHH-------EEEEEEEE--------------------------E--E--E-------E-E------HHH-HHHHH-HH-----------------HEEE-----EEEEEEE------HHHHHHH--HHHHHH-HHHHHEEEE-------HHHHHHHHHHH--EEEEEE--------------------EEEEE---------------------------------------------E-----E-EE------------EE-------EEEHEH----H--HH---------------------HHHHHHHHHHHH----EEEEEE----HHHHHHHHH---EEEEEE--HHHH-HHHHHHHHHHHHH-----HEEHEEHHH--------HHHHHHHH--------E-----EEEEE-----EEHHHHHHHHHHHHHHHHEEE-----EEHHHHEEEE--- FREQ ---------------HHHHH---------EEEEE--------------------------------------------------HHH-HHHHH-HHH----------------HHH------EEEEEEE--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PSSM -----HHH-HHHHH-HHHHHH--------EEEEE------------------------------------------------------HHHHH-HHH----------------HHH------EEEEEEE-------HHHHHHH----H-------EEEEEE--------------------EEEEE-------------------------------------------------------------------------------------------------------EEE-------------------------------HHHHHHHHHH-------EEE-------HHHHHHHH----EEEE---HHHH-HHHHHHHHHHHHH------HHEE------------HHHHHHH-------EEE-----EEEEE-----EEE------HHHHHHHHHH-------------EEEE--- FINAL --HH-HHH-HHHHHHHHHHH---------EEEEEE-----------------------------E-------------------HHH-HHHHH-HHH----------------HHH------EEEEEEE-------HHHHHH-------------EEEEEE-------------------EEEEEE-------------------------------------------------------------------------------E------------E---------EEEE-------------------------------HHHHHHHHHH-------EEE------HHHHHHHHH----EEEE---HHHH-HHHHHHHHHHHH--------EE-------------HHHHHHH---------------EEEEE-----EE-------HHHHHHHHH--------------EEEE--- NAEGRDRAFT_76461_Naegleria_gruberi_strain_NEG-M_290971699 ERNG-YSA-ILLYGDVLELANI-FPKKVSYDLMICDLPY--------------------G-V--F--KD-SP-YDV-LFT----DDQ-LKEFI-DNL----------------YQITPDTNFPTWIFFG-----EYKQIVKLQELIELKK-GNAVICIWVK----NGRQFGANFGTYKYESFLLCF----------PNKQV-----NIKPQ---------------------------------------------GFSLSPF-SI------------CF------PSETNFL----K--DS-NS---------RE-CNKGQKPLSLISWLVYQFSNVDGIVLDLCSGTATTAVAAVSYGRNSISLESNHGQF-EHAAERLKLSEFEKVNFEPLVVCTVSEDKEKKEKKKSTKRKTAPTKK-TPKKASKKKKTETPLKGSKMQSKKAEEIAMKYIDSEAVEMASDEENVSEKDEEEDDEE---------------------- ADA73_RS21195_Bacteroides_fragilis_695344547 IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERQIQEELECRNKVQEEKEI------REEKQNQ-------------------------------------------------------------------------------------------- LEP1GSC165_RS0218310_Leptospira_santarosai_696345163 -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIKTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLELNFD----------------------------------------------------------------------------------------------------------- LEP1GSC071_RS16130_Leptospira_santarosai_696349061 -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIKTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLGLNFES---------------------------------------------------------------------------------------------------------- LEP1GSC076_RS08120_Leptospira_696229311 -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIRTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLELNFES---------------------------------------------------------------------------------------------------------- LEP1GSC068_2949_Leptospira_sp_Fiocruz_LV3954_410015573 -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIRTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLELNFES---------------------------------------------------------------------------------------------------------- LEP1GSC071_3962_Leptospira_santarosai_str_JET_410804029 -----MATHHLYYGDCLENLPK-IPDS-SVDLILCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILIF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLKLLRYLIKTYSNPGDTVLDNCMGHGTTGIAATELGRNFIGMERDKEYF-NKAQRKIQMAETRTQLGLNFES---------------------------------------------------------------------------------------------------------- T343_RS0105895_Leptospira_licerasiae_495865040 -----MST-ELHLGDCLKILPK-IPDS-SIDLIFCDLPY--------------------G-T-----TD-CP-WDK-IIP--------MEKLW-PEY----------------ERISKENT--PIILTA---SQPFTTYLINSNPKNFRY-E----LIWYK-TKASGFLLAKKRPNKSHEN-ILVF----------YKKQP-----VYNPI--K--YEI-D-ERYRR--K-G-K-T-LGNGN--QSTVFRI---T-GEKSKNY-QY--LD-S-GS---RY------PDSVLCF-P--S--ES-EV--G---------MHPTQKPTRLLRFLIKSFSNPGDLVLDNCMGHGTTGIAAVELGRNFIGIEKERSYF-KKAESKIRMAEKRYSLGLDFET---------------------------------------------------------------------------------------------------------- LEP1GSC132_RS14950_Leptospira_kirschneri_490906211 -----MAT-LLYHGDCLNHLPK-IPDA-SVDLIFCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SQPFTNYLINSNPKKFRY-E----LIWYK-TKASGFLNAKSRPNKSHEN-ILVF----------YGKQP-----VYNPI--K--YVI-D-ERYKR--K-G-K-T-LGNGN--QSTVFAI---R-GEKSENY-QY--LD-D-GS---RF------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLRLLRYLIRTYSNPGDTVLDNCMGHGTTGIAAVELARDFIGMEMDKEYF-EKAKRKIQMAETRTQLELNFES---------------------------------------------------------------------------------------------------------- LEP2GSC066_RS0103865_Leptospira_santarosai_648272077 -----MATHHIYHGDCLKILPK-ISDA-SVDLIFCDLPY--------------------G-T-----TD-CA-WDI-IIP--------MEKLW-PEY----------------ERISKKKT--PIILTG---SQPFTNYLINSNPKNFRY-E----LIWYK-TKASGFLNAKTRPNKSHEN-ILVF----------YKKQP-----IYNPI--K--YEI-D-ERYRR--K-G-K-T-LGNGN--QSTVFSI---R-GEKSENY-QY--LD-D-GS---RY------PDSVLCF-P--S--ES-ET--G---------MHPTQKPLRLLRYLIKTFSNPGDTVLDNCMGHGTTGIASIELGRNFIGIERDKDYF-QKAKSKIKMAETRTQLGLNFES---------------------------------------------------------------------------------------------------------- LEP1GSC193_RS01970_Leptospira_alstonii_738085263 -----MAT-HLYHGDCLDNLPK-IPDA-SVDLIFCDLPY--------------------G-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKERT--PIILTG---SHPFTNYLINSNPKNFRYYE----LIWYK-TKASGFLNAKTRPNKSHEN-ILVF----------YKNQP-----VYNPI--K--YQI-D-ERYKR--K-G-K-T-LGNGN--QSTVFTI---R-GEKSENY-QY--LD-D-GF---RY------PDSVLCF-P--S--ES-EI--G---------MHPTQKPLRLLRYLIKTYSNVGDTVLDNCMGHGTTGIAAIELARDFIGMEMDKEYF-EKAKRKIQMAETRIQLELNFES---------------------------------------------------------------------------------------------------------- AAY48_RS15275_Leptospira_interrogans_446543012 -----MAT-HLYHGDCLSHLPK-IPDT-SVDLIFCDLPY--------------------A-T-----TD-CS-WDV-IIP--------MEKLW-PEY----------------ERISKEKT--PIILTG---SHPFTNYLINSNPKNFRYYE----LIWYK-TKASGFLNAKTRPNKSHEN-ILIF----------YKNQP-----VYNPI--K--YEI-D-KRYKR--K-G-K-I-QGKGH--QSTVFTI---S-GEKSENY-QY--LD-D-GF---RY------PDSVLCF-P--S--EF-EI--G---------MHPTQKPLRLLRYLIKTYSHVGDTVLDNCMGHGTTGIAAVELARNFIGMEMDKEYF-EKAKRKIQMAETRTQLELNFES---------------------------------------------------------------------------------------------------------- BAPNAU_RS08760_Bacillus_amyloliquefaciens_549781736 LELN-----RIYQMDCLEGMKL-IPDN-SIDMILCDLPY--------------------G-Q-----TS-NS-WDS-VLP--------LDKLW-KQY----------------NRIIKDNG--AIVLTA---KGKFKINLINSNFKNYRY-E----WVWDK-NKGANFPHVKRMPLNVHEY-VVVF----------YNQQP-----VYNPQ--M--TEG-K-PYKQI--R-------KEESL--K---GIA---D-NINR----KT-TIS-N-GK---RY------PKSIIRV-EGIA-----QR---------NI-CHPTQKPVELFEYLIKTYSNEGDIVLDNCMGSGTTAVACEKLNRKWIGFEIVKEYI-AIANKRLDYF---HSISES------------------------------------------------------------------------------------------------------------- JO84_gp345_Aureococcus_anophagefferens_virus_672551258 ---M-----NILEGDCLFHMKN-ISDK-SVDMILCDLPY--------------------G-T-----TK-NK-WDS-VIP--------FYDLW-ENY----------------NRIIKDNG--AIVLFG---SQPFTTKLISSNMKDFRY-C----LVWEK-NKFSDFLNAKRKPMKTNED-ICIF----------YKKQP-----TYNPQ--Y--TYS-T-PYTRW--N-------TQSAV--D---KQT---N-YGGHKQN-IS--KS-D-GK---RL------PTTVLKF-N--R-----IE---------RP-DHPTQKPIDLLEWLIKTYSNENELILDNCMGVGSTGIAAKNTNRRFIGIEKDQNYF-KKATENLI------------------------------------------------------------------------------------------------------------------------ BCPBV781_gp10_Burkholderia_phage_Bcep781_23752321 HVNY-----ELYWGDCLDLMRL-LPDA-SVDMVMCDLPY--------------------G-T-----TA-CA-WDS-VLP--------FDALW-AQY----------------RRIVKSRG--AVVLTA---AQPFTSALVASNFEWFKY-D----WVWAK-NRPTNFAHAKNKPMPKHES-VLVFSPGTTVHASQSKLRM-----TYNPQG-L--TRI-E-PRKMK----------TYNTD--A---MFS---K-RGSHGEY-----TQ-E-FT---NY------PHSLLEF-S--T-----DQ-----LN-----LHPTAKPVALMEYLIRTYTSEGDTVLDNCMGSGTTGVACINTGRRFIGMEKDADYA-LIATGRMR---EAIDRD----------IPIDLC----------------------------------------------------------------------------------------------- BCPBV43_gp10_Burkholderia_phage_Bcep43_41057660 HVNY-----ELYWGDCLDLMRL-LPDA-SVDMVMCDLPY--------------------G-T-----TA-CA-WDS-VLP--------FDALW-AQY----------------RRIVKSRG--AVVLTA---AQPFTSALVASNFEWFKY-D----WVWAK-NRPTNFAHAKNKPMPKHES-VLVFSPGTTVHASQFKLRM-----TYNPQG-L--TRI-E-PRKMK----------TYNTD--A---MFS---K-RGSHGEY-----TQ-E-FT---NY------PHSLLEF-S--T-----DQ-----LN-----LHPTAKPVALMEYLIRTYTSEGDTVLDNCMGSGTTGVACINTGRRFIGMEKDADYA-LIATGRMR---EAIDRD----------IPIDLC----------------------------------------------------------------------------------------------- HMPREF9022_RS14035_Erysipelotrichaceae_bacterium_2_2_44A_496003735 -MES-----YIKHGDCLEVMKD-IPDK-SIDMILCDLPY--------------------G-T-----TQ-CK-WDV-VIP--------FDKLW-EQY----------------CRVAKDNA--AIVLFG---AEPFSSRLRLSNVQMYKY-D----WIWDK-VKGTGFLNAKKQPLRNHEV-ICVF----------YKSQC-----TYNPQ--M--TSG-Q-RKVSY--R-------RKGLQ--T---DVY---G-QADEDYI-----YD-S-AA---RY------PRSIQVF-S--A--DT-QK---------CS-LQPTQKPIALLEYLIRTYTNDYDIVLDNCMGSGSTCIAAQNTNRKYIGIESEESIF-NTAKDRIK----------------------------MNKTQL-----QLF------------------------------------------------------------------------------ LF41_RS05185_Lysobacter_dokdonensis_738211128 --MI-----DLYQGDCLEVMGR-LPSN-SVDLILCDLPY--------------------G-T-----TS-CK-WDS-VIP--------FDALW-SQY----------------RRIAKRNA--AIVLTA---NQPFTTALIASNLCEFRY-T----WVWDKVNRPTGFLNAKLRPLRAFED-VCVF----------YRAQP-----TYNPQK-W---RG-E-PYKTT----------HGSSG--E---AYH---R-TETRTQV-----CA-D-GM---RY------PQDLIRI-K--A-----DN-----RGVEGR-VHPTQKPVALMEYLVKTYSNEGDTVLDNCMGSGTTGVACANTGRRFIGIERDADYF-TIASKRVG---VAGAKP------R-VWVPLAVSGD--------------------------------------------------------------------------------------------- B4072_RS12970_Bacillus_subtilis_752704809 LKKK-----RIYQMDCLEGMPL-IPDK-SIDMILCDLPY--------------------G-T-----TR-NK-WDS-IIP--------FDKLW-EQY----------------KRIIKDNG--AIVLTA---AQPFTSALIMSNVKDFKY-E----WIWKK-SNGTGHLNAKRMPMKDHES-ILVF----------YKKQP-----TYNPQ-------GIV-PYNRV--T-------RRGGN--G---GNY---N-SSNT----SN--FQ-E-YT---NY------PRTIQQF-A--Y-----DK---------KK-YHPTQKPVALFEYLIKTYSNEGDTVLDNCMGSGTTAVACENLNRKWIGFETESKYI-EIANNRLKEL-HSISNF--------------------------------------------------------------------------------------------------------------- M089_3211_Bacteroides_ovatus_str_3725_D9_iii_649530658 -------------MDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-AN-WDR-QIP--------LTALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PSERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EITERRIQ---EELEC--------------------RNKAQEEKEI-------------------REEKQINKSMTWTKKES---------------------------------------------- B4069_2083_Bacillus_subtilis_751875430 LKKK-----RIYQMDCLEGMPL-IPDK-SIDMILCDLPY--------------------G-T-----TR-NK-WDS-IIP--------FDKLW-EQY----------------KRIIKDNG--AIVLTA---AQPFTSALIMSNVKDFKY-E----WIWKK-SNGTGHLNAKRMPMKDHES-ILVF----------YKKQP-----TYNPQ-------GIV-PYNRV--T-------RRGGN--G---GNY---N-SSNT----SN--FQ-E-YT---NY------PRTIQQF-A--Y-----DK---------KK-YHPTQKPVALFEYLIKTYSNEGDTVLDNCMGSGTTAVACENLNRKWIGFETESKYI-EIANNRLKEL-HSISNF--------------------------------------------------------------------------------------------------------------- BAT_RS05535_Bacillus_pumilus_489305499 LELN-----RIYQMDCLEGMKL-IPDE-SVDLILCDLPY--------------------G-T-----TD-VKRWDK-IIP--------IEKLW-EQY----------------KRIIKETG--NVVLFG---SQPFTSYLVNSNPSMFRY-E----WIWDK-TKGANFLNSNHQPLKVHEN-ILVF--SKLPASPNKKGTA-----TYFPQK-T--EGK-E-YKVKR----------SSHKG--E---IFN---G-GSLRDNF-----EKVN-EG---RH------PVSIQTF-L--K-----DK---------DN-IHPTQKPVEMCEYLIRTYTDQSDIVLDNCMGSGTTAVASIISQRKWIGFETDPTFY-QLANKRLE---QVQLGD------D-L-ASYQ------------------------------------------------------------------------------------------------- JI66_RS07000_Lactobacillus_kunkeei_736516671 MNKA-----QLQKGDCIKLMHE-LPDK-SVDMILCDLPY--------------------G-I-----TN-HK-WDS-IIP--------YDDLW-TEY----------------ERIIKDNG--AIVLFG---AEPFSTKLRMSNIKLYRY-D----WVWLK-SRATLFQMSHKRPMNKHEL-ISVF----------YKHLP-----TYNPQ--M--SKG-K-PYKTN--G-------RRERK--A---SGF---L-SSGMVNI-PR--NN-K-GT---RY------PTTILDFPN--S-----NA---------KR-YHPTEKPINILSYLIKTYTNENEVVLDNCMGSGSTGVACIRNNRKFIGFELNGHYF-EVAQKRIN---NELDSL--------------------------------------------------------------------------------------------------------------- BAT_RS05555_Bacillus_pumilus_489305329 LELN-----RIYQRDCIEGMRM-LPDK-SIDMILCDLPY--------------------G-T-----TR-NK-WDI-VIP--------LDSLW-EQY----------------ERVVKDNG--AIVLTA---AQPFTSLLVSSNPKLFRY-D----ITWDK-KQITGFLNAKRMPLRKHED-ILIF----------YKKPP-----TYNPQ--F--TFG-D-SYEV---R-------RKHST--S---NYG---S-QNEN----ET--KS-D-GR---RY------PTSIIEI-P--QIR----E---------KG-GHPTQKPVKLFEWLIKTFTNEGDIILDSCIGSGTTAVAATQLNRNFIGFEIETEYA-KRANQRLD---SSVRGSSIKEAPHDK------------------------------------------------------------------------------------------------------ HMPREF9505_RS14705_Enterococcus_faecalis_488335929 MELN-----KIYNEDCLEGMKR-ISDK-SIDMILCDLPY--------------------G-T-----TD-NK-WDV-IIP--------FDKLW-EQY----------------ERIIKDSG--AIVLTG---SQPFTTDIIMSNRKLFRY-E----WIWNK-NQASNFFMANKMPLKVHEN-ILVF----------YKKLP-----TYNKQ--MIPRTN-P-SVAI---A-------QERGY--V---YDG---A-KSDNYNISTV--KM-S-PK---GYDKNWKNPISILNI-N--QLKNNSNE---------RC-GHPTQKPVALFEHLIKTYTNEGEIVLDNCIGSGTTAVAAINTNRQFIGFEKEKEYF-DVAIERIK---KASEEDDSKV----------------------------------------------------------------------------------------------------------- H581_RS0105900_Paenibacillus_harenae_655162666 --LN-----TIIHGDCLDVMEE-IDAA-SIDMILCDLPY--------------------G-T-----TQ-ND-WDS-VIP--------LEKLW-TQY----------------KRIIKDNG--AIVLTA---QTPFDKVLGCSNLSMLRY-E----WIWEK-TSATGHLNANRMPLKAHEN-ILVF----------YKSLP-----DYNPQ--M--TTGHK-PVNSY--T-------KHQDD--G---SNY---G-KTKIGI--SG--GG-R-TD---RY------PRSVICF-P--T--DK-QI---------EA-LHPTQKPIELFKYLIETYSNVGDTVLDNCIGSGTTAAAALSCGRNFIGIEKEWKYV-QIARNRMEYV-QPVINF--------------------------------------------------------------------------------------------------------------- LPP122_RS10265_Lactobacillus_paracasei_695862061 -------------------MAE-LPTA-SIDMILCDLPY--------------------G------TTA-NT-WDK-IIP--------FASLW-GQY----------------ERLIKPQG--AIVLTA---NERFSADVVQSNPALYRY-K----WVWVK-NTVTNFVNAKNRPLSRFEE-ILVF---SKSGTANYGNSPDTIGMNYFPQG-L--MPY---------------------NK--TVTSRKY---EQSNQLHPW-NA--PD-TYTQEWTNY------PSDVLNY----K--SD-RT--G---------WHPTQKPVELFAYLIKTYSQPNDLVLDNCMGSGTTAIAAIDTDRHFIGYEISHEYW-QRANDRIA---NHHNT--------------------QTALF--------------------------------------------------------------------------------------- C171_RS19470_Paenibacillus_sp_FSL_H8-237_738793084 --LN-----QIINADCFDVFPN-IKTG-SVDMILCDLPY--------------------G-T-----TQ-SP-WDS-ILP--------FDQLW-MAY----------------ERIIKDNG--AIVLFA---KAPFDKALAASNMKLFKY-E----WIWEK-NKATGHLNKSLMPLQAHEN-ILVF----------YKRPP-----TYNAQ--M--SQGHK-PMNAA--T-------NNHKS--S---V-Y---G-DGIPWS--NE--AG-K-TE---RL------PRSVLYY-PVVN--ND-DP---------ER-IHPNQKPVELCEYFIRTYTNPGETVLDNCAGSCTTAVAASRTGRNYIAIERDKRHA-ADGTQRLKNM-QLTLF---------------------------------------------------------------------------------------------------------------- EH55_RS12840_Synergistes_jonesii_740126887 NPNG-----KLYHGDCLEIMKD-IPDG-SVDMVLCDLPY--------------------G-M-----TA-CD-WDV-VIP--------FEPLW-EHY----------------NRICKRNA--AVVLFS---QQPFTTDIINSNRKKFRY-E----IIYRK-TMKMGFLNAHKMPLKGHEN-ICVF----------YKALP-----TYNPQKTQ--SRN-R-PRIRV----------QEDAR--C---RIY---S-KFKGGVM-----VD-D-GS---RY------PESVIDF-S--N-----FNGGGFVKNRERT-KHPTQKPVPLLEYLIRTYSNEKETILDNCIGSGSTAVAAENTGRRWIGIEKEEHFC-EVAKKRIA---EAAAQG------R-LLLLQG------------------------------------------------------------------------------------------------- BCPBV1_gp10_Burkholderia_phage_Bcep1_38638617 -----------MFGDCLLAMHE-LPAQ-SVDLVLCDLPY--------------------G-T-----TR-NR-WDT-PLD--------LSRLW-VAY----------------RHVCKPGA--PVLLFA---QTPFDKVLGASNLPELRY-E----WIWEK-TNATGFLNAKRAPLKAHEN-ILVF----------CDRAP-----TYRPI--K--TSG-H-VRKTS--T-------RLG-Y--S---SNY---G-AQAVSS------YD-S-TE---RY------PRSVLRF-A--S--DK-QR---------SK-LHPTQKPVALLEYLIRTHAAPGAVVLDNCMGCASTALAAMQAGCAFIGIENDVEHF-ETAQRRVR------------------------DY--QP------------------------------------------------------------------------------------------ EH55_RS07365_Synergistes_jonesii_740127826 NPNG-----KLYHGDCLEIMKD-IPDG-SVDMVLCDLPY--------------------G-M-----TA-CD-WDV-VIP--------FEPLW-EHY----------------NRICKRNA--AVVLFS---QQPFTTDIINSNRKKFRY-E----IIYRK-TMKMGFLNAHKMPLKGHEN-ICVF----------YKALP-----TYNPQKTQ--SRN-R-PRIRV----------QEDAR--C---RIY---S-KFKGGVM-----VD-D-GS---RY------PESVIDF-S--N-----FNGGGFVKNRERT-KHPTQKPVPLLEYLIRTYSNEKETILDNCIGSGSTAVAAENTGRRWIGIEKEERFC-EVAKKRIA---EAAAQG------R---LLQG------------------------------------------------------------------------------------------------- HMPREF1981_RS00510_Bacteroides_pyogenes_545404645 IRID-----EIYNEDCLEGMKR-IADR-SIDAIVCDLPY--------------------G-MLNRRNRY-AA-WDR-LIA--------LEPLW-EQY----------------RRIIKPDS--PVILFA---QGMFTARLLMSQPRLWRY-N----LVWYK-DRASGHLNANRMPLRKHED-ILVF----------YEHLP-----VYHPQ--M--IPC---DKSQR--N-H---G-RRTRQ--TFTNRCY---G-DMRMTEV-RI--AD-D------KY------PTSVIQI-A--K--EH-KN--G--AF-----YHPTQKPVALVEYLIRTYTDKGDVVLDNCMGSGTTAIAAIRSGRHYIGFETDAGYC-RIAGERIK---AE-KL--------------------TETENENNPI------NNENNA---------------------------------------------------------------------- BcepNY3gene09_Burkholderia_phage_BcepNY3_149882909 ANRC-----ELMFGDCLLAMHE-LPAQ-SVDLVLCDLPY--------------------G-T-----TR-NR-WDT-PLD--------LSRLW-VAY----------------RHVCKPGA--PVLLFA---QTPFDKVLGASNLPELRY-E----WIWEK-TNATGFLNAKRAPLKAHEN-ILVF----------CDRAP-----TYRPI--K--TSG-H-VRKTS--T-------RLG-Y--S---SNY---G-AQAVSS------YD-S-TE---RY------PRSVLRF-A--S--DK-QR---------SK-LHPTQKPVALLEYLIRTHAAPGAVVLDNCMGCASTALAAMQAGCAFIGIENDVEHF-ETAQRRVR------------------------DY--RS------------------------------------------------------------------------------------------ N355_gp092_Cellulophaga_phage_phi13:2_526178238 MKRN-----EIYLGDCLELMPKHVEDK-SIDMIFCDLPY--------------------G-T-----TQ-CK-WDS-IID--------LDKLW-NEY----------------RRVIKDNG--VIVLFA---SQPFTSILTSSNLKMFKY-S----YTWDK-ITKTNHLNAKKQPLRQVED-ICVF----------YKKQP-----TYKPQ--G--LIE-C-EVSNF--RPN---HFKYKKG--E---KVY---G-EQKEHGN-----KS-T-YT---NY------PSNLIQY-S-----NG-NH---------NS-LHPTQKPLDLIEYMIKTYTNEGDLILDNTCGSGTTGLGAKNLGRNFIMMEQDPKYY-DVACKRVLT----------------------------------------------------------------------------------------------------------------------- P667_3626_Acinetobacter_baumannii_691080530 -MTF-----KLHHGDCLEIMAN-IPDQ-SIDMILCDLPY--------------------G-T-----TC-CA-WDT-VIS--------FNPLW-AHY----------------ERIIKPNG--AIVLFA---ANPFAAVLATSNLKLFRY-E----MIWEK-PAATGFLNAKKQPLRAHEN-ILVF----------YKSQP-----TYNPQ--K--TTG-H-KRKTA--K-------RKDIG--S---EHY---G-KQLNIKD-----YD-S-TE---RY------PRSVQLF-S--S--DK-QK---------SN-LHPTQKPVALCEYLIRTYTNVGEVVLDNCMGSGTTGIACINTDRKFIGIEKEAKYF-EIAKKRLA------------------------DA--VEIKQT-----ELFSEVV-------------------------------------------------------------------------- LSJ_RS10550_Lactobacillus_salivarius_763125951 MDLK-----K---GDCLELLGG-VQDM-SIDLILCDLPY--------------------G-T-----TR-NK-WDK-IID--------LDKLW-EHY----------------NRIIKDNG--AIVLFS---QQPFSSKLIESNPKMFRY-E----RIWTK-GLATGHLNAKKMPLKKHEN-ILVF----------YKKLP-----TYNPQ--W--WYS-T-PYKV---K-------QGRSK--S---SNY---D-KQRPYTP-SE--SK-D-GR---RY------PVDIIEF-K--H-----DG---------KK-LHPTQKPVALLEYLIKTYTNEGDTVLDNCMGSGSTGVACANTNRNFIGIELSSEYY-NIAKDRIE---KAVAK---------------------------------------------------------------------------------------------------------------- LSJ_3100c_Lactobacillus_salivarius_690349817 --MK-----K---GDCLELLGG-VQDM-SIDLILCDLPY--------------------G-T-----TR-NK-WDK-IID--------LDKLW-EHY----------------NRIIKDNG--AIVLFS---QQPFSSKLIESNPKMFRY-E----RIWTK-GLATGHLNAKKMPLKKHEN-ILVF----------YKKLP-----TYNPQ--W--WYS-T-PYKV---K-------QGRSK--S---SNY---D-KQRPYTP-SE--SK-D-GR---RY------PVDIIEF-K--H-----DG---------KK-LHPTQKPVALLEYLIKTYTNEGDTVLDNCMGSGSTGVACANTNRNFIGIELSSEYY-NIAKDRIE---KAVAK---------------------------------------------------------------------------------------------------------------- B4145_RS14775_Bacillus_subtilis_516293791 IQLN-----KAYQLDCLEGMKL-IPDK-SVDMILCDLPY--------------------G-T-----TQ-NK-WDS-IIP--------LDKLW-EQY----------------ERIIKDNG--AIVLTA---QTPFDKVLGGSNLKLLKY-E----WIWEK-NRGTGHLNAKKMPMKNHEN-ILVF----------YKKLP-----TYNPQ--M--REG-E-PYQRLNCS-------KNALN--K---GNY---G-KTKD----SHSTVS-D-GK---RY------PLSVLDF-A--V-----VE---------RT-IHPTQKPVELFEYLIKTYTNEGEIVLDNCLGSGTTAIACELNNRKWIGFETEQQYI-ELINKRLDSIQLNYNLENLNGLT--------------------------------------------------------------------------------------------------------- LF41_2421_Lysobacter_dokdonensis_DS-58_702087568 -------------------MGR-LPSN-SVDLILCDLPY--------------------G-T-----TS-CK-WDS-VIP--------FDALW-SQY----------------RRIAKRNA--AIVLTA---NQPFTTALIASNLCEFRY-T----WVWDKVNRPTGFLNAKLRPLRAFED-VCVF----------YRAQP-----TYNPQK-W---RG-E-PYKTT----------HGSSG--E---AYH---R-TETRTQV-----CA-D-GM---RY------PQDLIRI-K--A-----DN-----RGVEGR-VHPTQKPVALMEYLVKTYSNEGDTVLDNCMGSGTTGVACANTGRRFIGIERDADYF-TIASKRVG-----VAGAKP------R-VWVPLAVSGD------------------------------------------------------------------------------------------- PLA107_32876_Pseudomonas_amygdali_pv_lachrymans_str_M301315_330989854 KEEI-----QLYKGDCLELMKS-IPDA-SVDMILCDLPY--------------------G-T-----TQ-NK-WDC-PID--------LSRLW-PEY----------------WRICKPSA--AIILTA---QTPFDKILGASQIGHLKY-E----WIWEK-TAATGFLNAKKSPLKAHEN-VLVF----------YRKQP-----TYNPA--M--TAG-HTIKRTN--A-------SYANH--G---ANY---G-KSSSVRA----PYE-S-TE---RY------PRSVQKL-P--K--DN-RL---------KN-QHPTQKPVALMEYLIRTYTNEGDIVLDNCMGSGTTGVACIHSGRRFIGIERDEKIF-GTASDRIASAIALRNTPVPQIELFGTA----------------------------------------------------------------------------------------------------- AAY85_RS20710_Pseudomonas_amygdali_763469483 KEEI-----QLYKGDCLELMKS-IPDA-SVDMILCDLPY--------------------G-T-----TQ-NK-WDC-PID--------LSRLW-PEY----------------WRICKPSA--AIILTA---QTPFDKILGASQIGHLKY-E----WIWEK-TAATGFLNAKKSPLKAHEN-VLVF----------YRKQP-----TYNPA--M--TAG-HTIKRTN--A-------SYANH--G---ANY---G-KSSSVRA----PYE-S-TE---RY------PRSVQKL-P--K--DN-RL---------KN-QHPTQKPVALMEYLIRTYTNEGDIVLDNCMGSGTTGVACIHSGRRFIGIERDEKIF-GTASDRIASAIALRNTPVPQIELFGTA----------------------------------------------------------------------------------------------------- LEP1GSC108_RS06340_Leptospira_weilii_490637745 -----MRT-DLHYANCFKIFPT-IPDK-SIHLILCDLPY--------------------G-T-----TD-CE-WDI-LLP--------FEALW-KEY----------------ERIITDNG--AIILTA---SQPFTTKLINSNPKLFRY-E----LIWYK-SKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----TYNPQ--K--YQI-D-PKFQR--K-G---K-SKKKP--QSSLFNI---R-GKKSESY-QY--FD-N-GL---RH------PDSVLCF-P--S--EM-RK--G---------MHPTQKPVALMKFLVSSYSNVGDTVLDNCMGSGTTGIACVELDRNFIGIEQEEEFF-ELASRRIATANKIRRLESIESAFSKQKKETEKNL---------------------------------------------------------------------------------------------- LSS_RS02805_Leptospira_santarosai_490593464 NQKIEPSI-QLFNDDCFNRLPQ-IPDK-SIKMILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EQY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-STQKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLISSYSNAGDTVLDNCMGSGTTGVACVQTDRNFIGIEKEVEYF-ELAERRIEIAIKIRKLKTITSLFSEKENTND------------------------------------------------------------------------------------------------- LEP1GSC187_RS02085_Leptospira_santarosai_490621523 TQEIKSSI-QLFNDDCFNRLPQ-IPDK-SINMILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EHY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKRLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SAKKN--YSNLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPIALMNFLISSYSNAGDTILDNCMGSGTTGVACIQTDRNFIGIEKEEEYF-ELAKRRIEIAIKIRKLKTITSLFSEKENTND------------------------------------------------------------------------------------------------- LEP1GSC070_RS05690_Leptospira_santarosai_490626771 KQKIEPSI-QLFNDDCFNRLPQ-IPDK-SIKMILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EQY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKRPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-STQKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLIKSYSNTGDTVLDNCMGSGTTGVACVQTDRNFIGIEKEVEYF-ELAERRIEIAIKIRKLKTITSLFSEKENTND------------------------------------------------------------------------------------------------- LEP2GSC171_RS25015_Leptospira_weilii_757125221 ------------------MFPA-IPDK-SIHLILCDLPY--------------------G-T-----TD-CE-WDI-ILP--------FEALW-KEY----------------ERIITDNG--AIILTA---SQPFTTKLINSNPKLFRY-E----LIWYK-SKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----TYNPQ--K--YQI-D-PKFQR--K-G---K-SKKKP--QSSLFNI---R-GKKSESY-QY--FD-N-GL---RH------PDSVLCF-P--S--EM-RK--G---------MHPTQKPVALMKFLVSSYSNVGDTVLDNCMGSGTTGVACAELDRNFIGIEQEEEFF-ELASRRIATANKIRRLESLESAFSKQKKETEKNL---------------------------------------------------------------------------------------------- LEP1GSC086_RS09945_Leptospira_weilii_490633882 NKNTEPSI-QLFNDDCFNIFPQ-IPDK-SVNLVLCDLPY--------------------G-T-----TD-CS-WDK-VLP--------FKELW-EQY----------------NRMIVENG--AVILTA---SQPFTTALINSNPKNFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SSKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLISSYSNVGDTILDNCMGSGTTGVACIQTDRNFIGIEKEEEYF-ELAQRRIEIAKKIRRLKTLPSIFSEKEKTDE------------------------------------------------------------------------------------------------- LEP1GSC133_0802_Leptospira_borgpetersenii_serovar_Pomona_str_200901868_464395818 LERAETSI-QLFNDDCFNIFPK-IPDK-SINLVLCDLPY--------------------G-T-----TD-CS-WDK-ILP--------FKELW-EQY----------------NRMIVENG--AVILTA---SQPFTTALINSNPKNFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKRP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SSKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLIRSYSNVGDTILDNCMGSGTTGVACVKTDRNFIGVEKEEEYF-DLANRRIEIAKKVRRLKALPSIFSEKEKTDE------------------------------------------------------------------------------------------------- LSS_RS12690_Leptospira_santarosai_490596716 NQKIEPSI-QLFNDDCFNIFPQ-IPDK-SINLILCDLPY--------------------G-T-----TD-CS-WDT-ILP--------FKPLW-EQY----------------NRVIVENG--AIILTA---SQPFTTALINSNPKHFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKLP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-STKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLISSYSNVGDRILDNCMGSGTTGVACVQTDRNFIGIEKEVEYF-ELAERRIEIAKKVRRLKILPSIFSEKENKDE------------------------------------------------------------------------------------------------- LEP1GSC133_RS04220_Leptospira_borgpetersenii_763222313 MERAETSI-QLFNDDCFNIFPK-IPDK-SINLVLCDLPY--------------------G-T-----TD-CS-WDK-ILP--------FKELW-EQY----------------NRMIVENG--AVILTA---SQPFTTALINSNPKNFRY-E----LIWYK-TKASGFLNANKMPNKSHEN-ILIF----------YKKRP-----VYNPQ--K--YQI-D-PKFQR--K-G---K-SSKKN--YSKLFNV---R-GPKSETY-QY--LD-L-GQ---RH------PDSVLCF-P--S--ES-GK--G---------IHPTQKPTALMNFLIRSYSNVGDTILDNCMGSGTTGVACVKTDRNFIGVEKEEEYF-DLANRRIEIAKKVRRLKALPSIFSEKEKTDE------------------------------------------------------------------------------------------------- BF33_RS19405_Bacillus_cereus_756411756 --LN-----EIHNMDCLEGMKL-LQSK-SIDMILCDLPY--------------------GVT-----AR-NK-WDV-IIP--------FDKLW-EQY----------------ERIIKDNG--AIVLTA---TQPFASKLIMSNPDLFRY-D----WIWEK-TLATGHLNAKKMPMRAHES-ILVF----------YKKLP-----TYNPM--K--TKGHA-PVNSY--T-------KHQDD--G---TNY---G-KTKVGI--SG--GG-S-TE---RY------PRSVQRF-S--A--DK-QK---------EA-IHPTQKPVALFEYLIKTYTNEGETILDNCMGSGTTAVAAINTNRNFVGFEKDLEIH-AAANQRINNL---QQVLQ------------V--------I---------------------------------------------------------------------------------------- BCEP1808_RS36475_Burkholderia_vietnamiensis_500205672 KNNP-----VLMQGDCLELLET-IPDN-SIDMVCCDMPY--------------------G-T-----TN-CR-WDA-TLD--------LRRLW-AQY----------------RRVTTENA--AIVLFA---QTPFDKVLGVSNLEWLRY-E----LIWQK-THATGHLNAKKMPMKAHEN-ILVF----------YNKLP-----TYNPQ--K--TTG-H-IRKTS--V-------KRRDN--T---SVY---G-EQNFVEL----SYE-S-TD---RH------PRSVLTF-P--K--DT-QR---------IA-LHPTQKPLALIEWLVSTFTNEGDAVLDNCMGSGTTGEACQRLGRRFVGMELDESHF-AVASSRILSGGVPALRNAA------------------------------------------------------------------------------------------------------------- BACPLE_RS11710_Bacteroides_plebeius_494836062 AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---G-KVPNPTF-RN--EN-R-GT---RY------PRSVIYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL-------------------------------------------------------------------------------------------------------- HMPREF1007_RS17585_Bacteroides_sp_4_1_36_495941257 AKDI-----TLYKADCLEVMPL-LPES-SIDLVLCDPPF--------------------G-T-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTALFG---SEPFSSLLRYSNLDEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------CKGKT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---N-NVPNPAF-RN--EN-R-GI---RY------PRSVKYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIKTYTKEGDTVLDFASGSMSTAIACIYTNRKCICIEKDEKYF-SQGEGRVR-----NEYQHTAGLRFLNKVICK------------------------------------------------------------------------------------------------- ADC57_RS01530_Bacteroides_490416978 AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---N-NVPNPTF-RN--EN-K-GT---RY------PRSVIYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL-------------------------------------------------------------------------------------------------------- M099_RS00805_Bacteroides_vulgatus_696379522 AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRT-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---G-KVPNPTF-RN--EN-R-GT---RY------PRSVIYF-K--A--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL-------------------------------------------------------------------------------------------------------- AMCSP14_RS14090_Streptococcus_pneumoniae_446323973 MEID-----KIIKKDVLEFMET-IPDN-KIDLIVTDPPYLINYKT--------------N---WRK-EK-HK-FSN-VIKNDNNPEL-IKEYI-KEC----------------YRILKDDT--AIYIFC---SFDKVDFFKKEIEKYFSV-K-NI-IIWRK-NNHT-AGDLEAQFGKQYEM-IILA----------NKGRK--------------------------------------KFN--------------GERLTDV-----WD---FK---RV--------------S--S-----DK------L-----LHQNQKPIELIKRCIVKHSDVGDTVFDGFMGSGTTALAALETDRHFIGTEIDEYYF-GIAEERIK-----NHNAQ---LSLFDEV---------------------------------------------------------------------------------------------------- EL26_10775_Tumebacillus_flagellatus_660643078 MTNI-----TITNEDCMCLLRR-TESE-SVDLVLTDPPY--------------------G-I-EF--RA-TR-GSK-VAT--------AKGIL-NDH-KDNIGFLESVAVEL-YRVLKPNS--HLYWFT-R-WDKVEEQLPMLRRCGFRP-K-NA-MIWIK-GGHGMSDTLG-AYAPEYEV-VLFC----------HKGRR--------------------------------------LLN------------E-------------VD---GR---KR------HTDVLRF----S--KI-AP--G------SL-VHSHQKPTALLEFLIQKSSNAGDLVLDPFLGSGSTALAARNTGRSFVGCELSEEIF-QIAQQQLAA----------------------------------------------------------------------------------------------------------------------- OBV_RS14990_Oscillibacter_valericigenes_503885074 TDGL-----HLMDG--IKGLLS-LPQH-SVDMLLTDPPY--------------------G-T-----TR-NF-WDV-PLP--------LPKLW-EAV----------------RWAVKPEG--AVLLFA---QCPYDKVLGASNLPMLRY-E----WIWYK-ERGTGFLNANRAPLKKSEN-ILVF---------YQKSP------VYHPQ--F--TYG-EPYRKTF--P-------RSG-T--S---SNY---G-KFERTAS----VSN-D-GR---RY------PGNVLFI-P--T-----VT---------GG-VHPTQKPVELCEYLIKTYTDEGAVVADICAGSGTTAVAALNTGRHFVCFEIAPAFY-SSATGRLEQAR-LAVERGEKGV---------------------------------------------------------------------------------------------------------- N007_RS30730_Alicyclobacillus_acidoterrestris_665851256 MELN-----RIYQMDVLDGVKL-VADN-SVDLVVTDPPYLMNYRS--------------N---RRV-VR-NK-FDY-IHNDQSSYDL-IATFI-DEC----------------YRVMKDNT--AIYMFC---SWHHIDYFKQQFERKFKL-K-NL-IVWNK-NNHG-SGDLKGAYAPKHEL-ILFG----------HKG--------------------------------------RSLLQ--------------HKRIPDV-----ID---CD---KI--------------P--S-----AK------L-----THPTEKPVELLTIFILNSSQPGDVVLDGFIGTGATAVACVNTGRNFIGFETEPQYI-EIANKRLE-----GLL---------------------------------------------------------------------------------------------------------------- N007_05790_Alicyclobacillus_acidoterrestris_ATCC_49025_529047177 MELN-----RIYQMDVLDGVKL-VADN-SVDLVVTDPPYLMNYRS--------------N---RRV-VR-NK-FDY-IHNDQSSYDL-IATFI-DEC----------------YRVMKDNT--AIYMFC---SWHHIDYFKQQFERKFKL-K-NL-IVWNK-NNHG-SGDLKGAYAPKHEL-ILFG----------HKG--------------------------------------RSLLQ--------------HKRIPDV-----ID---CD---KI--------------P--S-----AK------L-----THPTEKPVELLTIFILNSSQPGDVVLDGFIGTGATAVACVNTGRNFIGFETEPQYI-EIANKRLE-----GLL---------------------------------------------------------------------------------------------------------------- CDQ29993.1_Streptococcus_pneumoniae_698840876 MEID-----KIIKKDVLEVMAT-IPDN-KIDLIVTDPPYLINYKT--------------N---WRK-EK-HK-FSN-VIKNDNNPEL-IKEYI-KEC----------------YRILKDDT--AIYIFC---SFDKVDFFKKEIEKYFSV-K-NI-IIWRK-NNHT-AGDLEAQFGKQYEM-IILA----------NKG--------------------------------------RKKFN--------------GERLTDV-----WD---FK---RV--------------S--S-----DK------L-----LHQNQKPIELIKRCIVKHSDVGDIVFDGFMGSGTTALAALETDRHFIGAEIDGYYF-GIAEERIK-----NHNAQ---LSLFDEV---------------------------------------------------------------------------------------------------- SEP9_059_Staphylococcus_phage_vB_SepS_SEP9_589891490 MELN-----KIYNEDCVAGMKN-MESG-SVDLVVTDPPYLVNYKT--------------G---RRK-DKTHR-FNK-VILNDDNEQL-IINYI-NEC----------------YRILKNNS--AMYMFC---SSDKVDFFKQQLEKKFKI-K-NM-IIWVK-NNHT-AGDLKGSFGRKYEI-IFLV----------VKG--------------------------------------KKHFN--------------GKRLTDI-----WG---FD---KV--------------S--G-----KN------Q-----LHQNQKPLDLIKQCIEKHSDKGDLVFDGFAGSGTTAIACKELERNFIGFELDKGYF-DIAIKRLE-----DYKGE-------------------------------------------------------------------------------------------------------------- G500_RS21745_Flexibacter_roseolus_737788178 KNVE-IKN-QLFLGDCLEILKA-IPSN-SIDCLITDPPYNISGYDHKKQI---------G---WLK-SN-DF-WKK-QKA----FKK-IDENW-DKFSDDDYESFTIEWLSEIKRIVKPNG--NIAIFGSY-HNIYKIGYLIEKLDLKTI-N-S--IIWYK-RNAFPNVTQ-RMFCESTEQ-IIWC-------VNESKKNA-KNW-TFNYK--I-----------------------MKELN------------G-GVQMRNL-----FD-----------V----PLTK----Q--S--ER-EF--G---------KHPSQKPLEVLNNLMLALTNEGDVVLDCFLGSGTTAVSALQHKRNFVGIEQNYDYL-QIAQRRLENIESVIFNKTEI------------------------------------------------------------------------------------------------------------ HPS42_RS08645_[Haemophilus]_parasuis_737515587 -----MNI-NLMQGDCLELLRD-IPDA-AVDMILTDPSY--------------------S-V-GM--TS-NS-IKS-SFNELSMVKPFFSQLF-KEF----------------KRVLKSDG--VAYIFTDWRTISFIQPILDAELGVKNV------LVWDK-AGRMSSSYG-----FYYEL-ILFA--------GNNKR---------------------------------------------------------KIHKKNI-----LK---AP---SF------ASNARKT----N---------G------EK-LHNAQKPIELLQELIINSSDEGDVVLDCFMGSGSTGVACLNTNRKFIGFEIDDKYF-HIAKDRIG-L-H-NRVSV-------------------------------------------------------------------------------------------------------------- EL26_RS10195_Tumebacillus_flagellatus_740246844 ----------------MCLLRR-TESE-SVDLVLTDPPY--------------------G-I-EF--RA-TR-GSK-VAT--------AKGIL-NDH-KDNIGFLESVAVEL-YRVLKPNS--HLYWFT-R-WDKVEEQLPMLRRCGFRP-K-NA-MIWIK-GGHGMSDTLG-AYAPEYEV-VLFC----------HKGRR--------------------------------------LLN------------E-------------VD---GR---KR------HTDVLRF----S--KI-AP--G------SL-VHSHQKPTALLEFLIQKSSNAGDLVLDPFLGSGSTALAARNTGRSFVGCELSEEIF-QIAQQQLAA----------------------------------------------------------------------------------------------------------------------- LEP1GSC194_RS08595_Leptospira_alstonii_523636963 -----LNT-KLFYDDCFNVLPK-IPDK-SVDLILSDLPY--------------------G-T-----TD-CF-WDK-ILP--------LDLLW-KEY----------------ERIIKDNG--AIILTS---CQPLTTRLICSNQKLFRY-E----LVWYK-SKPSGFLNAKKMPNKSHEN-ILIF----------YKRLP-----TYNPQ--K--FRI-D-PKFQK--K-G---K-SSKAG--I-NVFKV---S-GPKSENY-QY--LD-E-GL---RY------PDSVLCF-P--S--EF-AK--G---------MHPTQKPVSLMKFLVQSYSNVGDLVLDNCMGAGTTGVACVESDRNFIGIEKEKIYF-DLAKTRISNAKKLKSQNL-FVS---------------------------------------------------------------------------------------------------------- LEP1GSC172_RS05255_Leptospira_interrogans_488105867 -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNLQ--K--YCI-D-PKFQV--K-G---K-SSLQT--T-KFINI---S-GKKTLSY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAKKENEGYL-FSDIALSS----------------------------------------------------------------------------------------------------- LEP2GSC076_RS0118050_Leptospira_interrogans_446276826 -----MDI-RLYNRDCFKVLPK-IGDK-SVHLIFSDLPY--------------------G-K-----TV-CK-WDQ-ILS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPRLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-PKFHV--K-G---K-HSLQT--A-NFINI---K-GTKPLNY-QY--LE-D-GT---RY------PDSVLCF-P--S--ES-SK--G---------MHPTQKPVSLLNFLILSYTNKFDTVLDHCMGSGTTGVSCVKTERRFIGIEKDKGYF-KLAKSRISKAQKEKVETL-FSDLALSS----------------------------------------------------------------------------------------------------- IQ65_RS20335_Leptospira_interrogans_516471781 -----MDI-RLYNRDCFKVLPK-IGDK-SVHLIFSDLPY--------------------G-K-----TV-CK-WDQ-ILS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPRLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-PKFHV--K-G---K-HSLQT--A-NFINI---K-GTKLLNY-QY--LE-D-GT---RY------PDSVLCF-P--S--ES-SK--G---------MHPTQKPVSLLNFLILSYTNKFDTVLDHCMGSGTTGVSCVKTERRFIGIEKDKGYF-KLAKSRISKAQKEKVETL-FSDLALSS----------------------------------------------------------------------------------------------------- LEP1GSC041_RS17345_Leptospira_noguchii_490560754 -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNPQ--K--YCI-D-PKFQV--K-G---K-RSLQT--I-KFINI---S-GKKTLNY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAKKEKEENL-FSDIALSS----------------------------------------------------------------------------------------------------- LEP1GSC084_RS211440_Leptospira_interrogans_446276825 -----MDI-RLYNRDCFKVLPK-IGDK-SVHLIFSDLPY--------------------G-K-----TV-CK-WDQ-ILS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPRLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-PKFHV--K-G---K-HSLQT--A-NFINI---K-GTKLLNY-QY--LE-D-GT---RY------PDSVLCF-P--S--ES-SK--G---------MHPTQKPVSLLNFLILSYTNKFDTVLDHCMGSGTTGVSCVKTERKFIGIEKDKGYF-KLAKSRISKAQKEKVETL-FSDLALSS----------------------------------------------------------------------------------------------------- LEP2GSC066_RS0113465_Leptospira_santarosai_490627908 -----MDI-RLYNEDCFKVLPT-IKDK-SVHLIFSDLPY--------------------G-T-----ID-CV-WDR-VLP--------FDNLW-KEY----------------NRILIDNG--VILFTG---SQPFTTKIILSNPKHFRY-E----LIWYK-SKASGFLNAKLMPNKSHEN-ILVF----------YKKLP-----TYNPQ--K--YSI-D-LKFRA--K-G---K-FNKQT--S-KFINI---T-GPKNLNY-QY--ID-E-GL---RY------PDSVLCF-P--S--ES-QK--G---------MHPTQKPVSLLNFLILSYTNEFNTVLDHCMGSGTTGVSCVNTNRRFIGIEKDKGYF-DLAQSRISKAKNSISLDL-FSKVNLNS----------------------------------------------------------------------------------------------------- LEP1GSC186_RS16950_Leptospira_noguchii_490575676 -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIDNG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNPQ--K--YCI-D-PKFQV--K-G---K-RSLQT--I-KFINI---S-GKKTLNY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAMKEKEENL-FSDIALSS----------------------------------------------------------------------------------------------------- _Ruminococcus_sp_SR1/5_505338314 MSEA-----TLLQGDCLELMNR-IPDS-SIDMVLSDLPY--------------------G-T-----TR-CR-WDA-PIN--------LQELW-EQY----------------RRVVKENG--AIALFS---AQPFTTELISSNKAMYRY-E----WIWRK-TQPSGFMNAKKMPLRTHEN-IEIF----------YRKPP-----TYNPQ--M--THG-H-QRKTA--TAY---GTRESDG--S---SCY---G-REERNYT-----YD-S-TD---RY------PVDVLQY-S-----TG-DK---------SKRLHPTQKPVDLLEYLVKTYTNPGETVLDNCMGAGSTGVACLNTGREFVGIELDPEYY-QIAKERIE-----QHVEN------------I--------F---------------------------------------------------------------------------------------- LEP1GSC059_0080_Leptospira_phage_vB_LnoZ_CZ214-LE1_529283433 -----MDI-RLYNRDCFKVLPK-IKDK-SVHLIFSDLPY--------------------G-K-----TD-CK-WDK-VLS--------LENLW-KEY----------------NRILIENG--VVIFTG---NQPFTTQIIQSNPKLFRY-E----LIWYK-TKATGFMSAKIMPNRSHEN-ILVF----------YKRLP-----TYNPQ--K--YCI-D-PKFQV--K-G---K-RSLQT--I-KFINI---S-GKKTLNY-QY--LD-E-GT---RY------PDSVLCF-P--S--DS-NK--G---------MHPTQKPLSLLNFLILSYTNEFDTVLDHCMGSGTTGVACVKSNRRFIGIEKDKGYF-DLSKSRISKAKKEKEENL-FSDIALSS----------------------------------------------------------------------------------------------------- M123_RS17125_Bacteroides_fragilis_492201466 AKDI-----TLYKADCLEVMPF-LPES-SIDLVLCDPPF--------------------G-I-----TA-SQ-WDK-IIP--------FPEMW-KEI----------------RRVRKENA--PTVLFG---SEPFSSLLRCGNLEEFKY-D----WVWEK-SKASNFLLAKKQPLKAHEL-ISVF----------GKGRI-----PYYPI--M--EEG-E-PYGNR--T-------KRGSN--W---TGI---N-NVPNPTF-RN--EN-K-GT---RY------PRSVIYF-K--T--AE-SE--G------KT-IHVNQKPIALLQYLIRTYTKEGDTVLDFASGSMSTAIACIYTHRKCICIEKDETHF-SQGEKRVR-----NEYQY---LRL-------------------------------------------------------------------------------------------------------- HMPREF1033_RS05085_Tannerella_sp_6_1_58FAA_CT1_748669634 IEQD-----KIYNMDCLEGMKG-IADR-SIDAVIADLPY--------------------G-VLNRQNGA-AR-WDN-RIP--------LKPLW-EQY----------------LRITKPDS--PIILFA---QGMFSAELVLSQPKLWRY-N----LVWHK-DRVSGHLNANRMPLRQHED-ILVF----------YRKLP-----VYHPQ--M--IPC---PPEKR--N-H---G-RRKTE--GFTNRCY---G-GMKLAPV-RI--AD-D------KY------PTSVISV-P--K--EH-RK--G--TF-----YHPTQKPVALIEYLIRTYTDEGDTVLDNCIGSGTTAVAALRTGRHYIGFETDSGYC-GIAERRIREEIDQRERKDNEKNQ--------------------------------------------------------------------------------------------------------- M099_RS00920_Bacteroides_494836074 IEAD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------FAALW-EQY----------------QRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PPERR--Y-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKAQEEKEIREEKQNQ-------------------------------------------------------------------------------------------------- HMPREF1070_RS16045_Bacteroides_ovatus_490454463 IETD-----KIYHMDCIEGMRL-MADG-SVDAVIADLPY--------------------G-MLNHKNKA-AR-WDR-QIP--------LEPLW-EQY----------------LRVTKPES--PIILFA---QGMFTAELLLSQPRIWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-IIVF----------YKRQP-----VYHPQ--M--TPC---LPERR--N-H---G-RRKTE--GFTNRCY---G-AMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KR--G--AF-----YHPTQKPVALMEYLIRTYTDKGAVVLDNCIGSGTTAVAAIRTGRHYIGFETEKVYC-EIAERRIREEIERRNEAK-------------------------------------------------------------------------------------------------------------- EE52_RS20200_Bacteroides_494843167 IETD-----RIYLMDCMEGMKQ-IADS-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------FAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQVYC-EIAERRIQEELECRNKAQEEKEIREEKQNQ-------------------------------------------------------------------------------------------------- BACCOPRO_RS07720_Bacteroides_coprophilus_495417764 IKKD-----QIYHMDCLKGMKQ-MADR-SVDAIIADLPY--------------------G-VLNNRNTS-AG-WDK-QLP--------LEKLW-EEY----------------LRISKPES--PVILFG---QGMFTARLVLSQPKIWRY-N----LVWHK-DRVTGHLNANRMPLRQHED-IIVF----------YRKQP-----VYHPQ--M--KPC---PAEQR--N-H---G-RSKTR--GFTNRCY---G-QMNLTPI-RI--AD-D------KY------PTSVIAI-A--K--EH-CK--G--CF-----YHPTQKPVALLEYLIRTYTNEGDTVLDSCIGSGTTMVAAIRTGRHFIGFETEQSYF-ETALLRIAEETE-QNHQTTEINIQ-------------------------------------------------------------------------------------------------------- M082_RS10705_Bacteroides_490422204 IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-AN-WDR-QIP--------LTALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----VYHPQ--M--TPC---PSERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EITERRIQEELECRNKAQEEKEIREEKQINKSMTWTKKES---------------------------------------------------------------------------------------- ADC57_RS01565_Bacteroides_494743555 IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKVQEEKEIREEKQNQ-------------------------------------------------------------------------------------------------- BSFG_RS19480_Bacteroides_490416986 IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPFRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKVQEEKEIREEKQNQ-------------------------------------------------------------------------------------------------- BFAG_00704_Bacteroides_fragilis_3_1_12_313134650 -------------MDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-VN-WDR-QIP--------LAALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PPERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERQIQEELECRNKVQEEKEIREEKQNQ-------------------------------------------------------------------------------------------------- HMPREF1062_RS27355_Bacteroides_cellulosilyticus_494418810 IETD-----RIYLMDCMEGMKQ-IADG-SVDAIIADLPY--------------------G-VLNRSNPS-AN-WDR-QIP--------LTALW-EQY----------------RRITKPDS--PIILFG---QGLFSAWLMLSQPRLWRY-N----LVWQK-DRVTGHLNAKRMPLRQHED-ILVF----------YKKQP-----AYHPQ--M--TPC---PSERR--N-H---G-RRKTE--GFTNRCY---G-TMKLSPV-RI--AD-D------KY------PTSVIFM-P--K--EH-KK--G--AF-----YHPTQKPVALMEYLIRTYTDEGDVVLDNCIGSGTTAVAAIRTGRHYIGFEIEQAYC-EIAERRIQEELECRNKVQEEKEIREEKQNQ-------------------------------------------------------------------------------------------------- IQ65_RS13625_Leptospira_interrogans_446127730 -----ITT-DLYLDDCLDRLPK-IPDE-SIRLILADLPY--------------------G-T-----TR-CK-WDK-ALP--------LEFLW-REY----------------ERIIIDNG--AIILTA---SQPFTTALINSNPRLFRY-E----LIWYK-SKASGFLNAKKMPQKSHEN-ILIF----------YKKPP-----VYNPQ--T--YKI-N-PIYQR--K-GVKLR-KSHKP--E-SLFKL---S-NSDMNQY-RY--ID-D-GT---RL------PDSVLCF-A--S--EF-QK--G---------MHPTQKPVALMDFLIRSYSNISDTVLDNCMGSGTTGVACIRAGRNFVGIEKDKDIF-DVASRRIEIAHTIHKLNSLPSLFR-------------------------------------------------------------------------------------------------------- LEP2GSC168_RS0121290_Leptospira_interrogans_446127731 -----ITT-DLYLDDCLDRLPK-IPDE-SIRLILADLPY--------------------G-T-----TR-CK-WDK-VLP--------LEFLW-REY----------------ERIIIDNG--AIILTA---SQPFTTALINSNPRLFRY-E----LIWYK-SKASGFLNAKKKPQKSHEN-ILIF----------YKKPP-----VYNPQ--T--YKI-N-PIYQR--K-GVKLR-KSHKP--E-SLFKL---S-NSDMNQY-RY--ID-D-GT---RL------PDSVLCF-A--S--EF-QK--G---------MHPTQKPVALMDFLIRSYSNISDTVLDNCMGSGTTGVACIRAGRNFVGIEKDKDIF-DVASRRIEIAHTIHKLNSLPSLFR-------------------------------------------------------------------------------------------------------- AZ40_RS07215_Aeromonas_jandaei_752537274 ---M-----KLLQGDCLSLLPS-LPDN-SIDMVLADPPY--------------------G-T-----TQ-CK-WDS-VID--------LAAMW-REL----------------ERVCKPNS--AIVMTA---AQPFTAQLVCSNIGMFKY-E----IIWEK-GNATGFLNAKKQPLRAHES-VLVF----------YRQQP-----TYNPQ--M--TSG-H-ARKTS--K-------RKTVN--S---ECY---G-KALSLTE-----YD-S-TE---RY------PRSVQFF-S--S--DK-QR---------GS-YHATQKPVALMEWLIRSFSNPADVVLDFCMGSGTTGVACLNTGREFIGMEMDTEIF-KVATSRID----SLINKEAA------------------------------------------------------------------------------------------------------------ HMPREF1019_RS05155_Campylobacter_sp_10_1_50_496651971 ---------IILQGNSLEIIKG-IPTN-SIDLIFADPPYWMRVDG--V-LKRPEGKEFDG-------CN-DE-WDNTFLN-NDDYVD-FTRKWLNEC----------------KRVLKQNG--SIWVIGGM-QCIYTIGGIMQELGFWFI-N-D--VIWQK-SNPTPNFMGTRL-NNSHET-LIWA----------TKSKKSK--FTFNYK------------------T-------AKELNTENIDINLFEKGE-RRQLGSV-------------W-RF------SVCSGNE-R--L--KD-EN--G------NK-LHSTQKPESLLYRVIAISSKIGDIVLDPFGGTMTTAAMAKKLGRNYISIEQNDKYI-KFGKKRVN-D-IVFEDS--DIAHAK-FDKKPLKVNLDQMIDANFLNLGERFYLKNSDEFAILKRGSRLEYNNI-LYDIHSLAAKLKS-AKSERL-NGFKFWHVMRDNKKILLDDIRSHFREINA---- DV59_RS09130_Helicobacter_pylori_446268888 ---------TIIEGDCLEKLKD-FPNK-SVDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FEEYDT-FCLVWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LIWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICMGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSTTKPKDIVLDPFFGTGTTGAVAKSMNRYFIGIEKDSFYI-KEAAKRLN-N--TRDKS--DFITNLELETKPPKIPMSLLISKQLLKIGDFLYSPNKEKICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFYAYYQNQFLLLDELRYICQRDS----- QT55_RS02770_Helicobacter_pylori_727328309 ---------TIIEGDCLEKLKD-FPDK-SIDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FKEYDT-FCLGWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LIWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICIGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSATKPKDIVLDPFFGTGTTGAVAKSMNRHFIGIEKDSFYI-KEATKRLN-N--TMDKS--DFITNLNLETKPPKIPMSLLISKQLLKIGDFLYSPNKERICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFHAYYQNQFLLLDELRYICQKEF----- GZ76_RS01670_Helicobacter_pylori_726979196 ---------AIIEGDCLEKLKD-FPNK-SVDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FKEYDT-FCLGWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LIWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICMGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSATKPKDIVLDPFFGTGTTGAVAKSMNRHFIGIEKDSFYI-KEAAKRLN-N--TRDKS--DFITNLELETKPPKIPMSLLISKQLLKIGDFLYSSNKEKICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFYAYYQNQFLLLDELRYICQRDS----- NPL7_RS01720_Mycoplasma_hyosynoviae_738495747 ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKIYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN----- NPL1_02345_Mycoplasma_hyosynoviae_635203155 ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKTYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN----- NPL7_01825_Mycoplasma_hyosynoviae_635202114 ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKIYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN----- JF44_RS0106430_Thalassospira_australica_696534926 ---------KVLVGDCIELMNS-LPEK-SVDLIFADPPYNLQLGG--D-LLRPNNSKVDA-------VD-DH-WDQ-FDS-FRHYDD-FTRDWLTAA----------------RRVLKDTG--AIWVIGSY-HNIYRVGNTLQDIGFWIL-N-D--IVWRK-TNPMPNFRGKRF-CNAHET-LLWC----------SKSEEQK-AITFNYE------------------A-------MKQLN------------E-GLQMRSD-------------W-LM------PICSGSE-R--L--KD-DK--G------KK-VHPTQKPEALLQRVLMATTRQGDVVLDPFFGTGTTGAAARRLGRHFIGLEREEGYA-EAARDRIA-K-VQMLDG--DSLELTESKRSLPRIPFGAVIERGLLAPGDKIYDNRGNVAAMVRADGSISHKDN-AGSIHQVGAHVQG-A--QAC-NGWTYWHYKCDGRLVSIDNLRSQLRKELGQVPA QU01_RS03135_Helicobacter_pylori_727305172 ---------TIIEGDCLEKLKD-FPDK-SIDFIFADPPYFMQTEG--E-LKRFEGTKFQG-------VE-DH-WDK-FGS-FKEYDT-FCLGWLKEC----------------QRILKDNG--SICVIGSF-QNIFRIGFHLQNLGFWIL-N-D--IIWHK-SNPVPNFAGKRL-CNAHET-LLWC----------AKHKNSK--VTFNYK------------------T-------MKYLN------------N-DKQEKSV-------------W-QI------PICIGNE-R--L--KD-AQ--G------KK-VHSTQKPEALLKKIILSATKPKDIVLDPFFGTGTTGAVAKSMNRHFIGIEKDSFYI-KEAAKRLN-N--TMDKS--DFITNLNLETKPPKIPMSLLISKQLLKIGDFLYSPNKERICQVLENGQVRDNENYETSIHKMSAKYLN-K--TNH-NGWKFFHAYYQNQFLLLDELRYICQKEF----- _Candidatus_Hepatoplasma_crinochetorum_740676991 ---------KIKLGNCLEELKK-IPSK-SIDLIFADPPYFMQTGT--GTLYRINGNKYNG-------VD-DE-WDK-FDS-YKEYDK-FTRKWLTQC----------------RRILKDKG--SIWVMGTF-HNIYRLGYIIQDLNFWII-N-D--ITWEK-TNPTPNFRGTKF-VNSNEN-LIWF----------TKSQNSK--FTFNYK------------------T-------MKNEN------------K-KKQMGSV-------------W-KF------SICSGKE-R--L--KD-NN--G------NK-LHNTQKPEDLLRRIILASTKINDVILDPFFGTGTTGAVAKKLHRNFIGIENNEKYINYANDRIKNVDKISKNDPFFDYIKAK-FDEKIKYPKILDLIRENKIKSKYLYNL--KEDKVYLNSEGKIKINNV-KYSIHK-ATEIFE-N--GRYLNGWKYWYIKEGNNYISIDDIRKDVK-------- MALK_RS00800_Mycoplasma_alkalescens_488970128 ---------KILYGDCIENLKK-IPDE-TFDFCFADPPYFMQIERGKK-LFRVDGTEFNG-------CD-DE-WDK-FES-ITAYKK-WTKQWLTEV----------------HRVLKKDG--SICVIAGM-QSIFEIGSILREIGYWVI-N-D--IIWHK-SNPTPNFGGTRL-NNSHET-LIWA----------TKTKKSK--FTFNYK------------------T-------GKFLN------------G-GKQMGSI-------------W-KF------SVCSGNE-R--L--KD-YN--G------KK-VHNTQKPEALLYRIITLFTKKDDLILDPFGGTMTTAYVAKKTGRNYTMIERDPNYI-KHGQKRID-S-AIPSIG--DVENAI-FDLKPPKVQFSKMVEANYFNIGEPFYTKNKEKALLNSKNGHLKYNAE-INSMHEIAGKMIG-LD-RRV-NAFNYLYVIRDDELISINQIRNKYRAKLKEDI- NPL4_RS01685_Mycoplasma_hyosynoviae_738491218 ---------QILLGDNIELFKQ-IPDN-SIDLIFADPPYNMNLQK--D-LIRYDGSKFDG-------VD-DE-WDK-YES-LEEYDK-ECKLWLAEC----------------LRVLKKDG--SLWVIGSF-QNIHRLGYILQDMSAWII-N-E--IVWEK-ANPVPNFGGTRF-VNAQET-MLWV----------TKNSKAK--FTFNYK------------------T-------MKHMN------------G-GTQMKSV-------------W-KL------PICTGSE-R--L--KD-ED--G------KK-IHSTQKPLALLERIIIACSKPNDIVLDPFSGTATTAHAAKMLGRNYIGFEKDKTYY-EQSILRLN-T-VRKDESKNDLINAI-YDAKPQKVDFIDLINNNYISTTDKLRIITKDYELHFNKNGDIDFEGE-SLTPNKLCRKIFN-K--PT--NAWDVIMVND----MKLSEIREKYRAEN----- K355_RS0107980_Thalassospira_lucentensis_550983501 ---------QVLVGDCIEMMNS-LPEK-SVDLIFADPPYNLQLGG--D-LLRPNNSKVDA-------VD-DH-WDQ-FDS-FRHYDE-FTRDWLTAA----------------RRVLKDTG--AIWVIGSY-HNIYRVGNTLQDIGYWIL-N-D--IVWRK-TNPMPNFRGKRF-CNAHET-LLWC----------SKSEEQK-AITFNYE------------------A-------MKQLN------------E-GLQMRSD-------------W-LM------PICSGSE-R--L--KD-DN--G------KK-VHPTQKPEALLQRVLLATTRQGDVVLDPFFGTGTTGAAARRLGRHFIGLEREETYA-EAARERIA-K-VQMLDG--DSLEVTESKRSLPRIPFGAVIERGLLSPGEKIYDNRGNVAAMVRADGSISHKDN-AGSIHQIGARVQG-A--EAC-NGWTYWHYKCDGRLVSIDNLRSQLRKEMGQVPA consensus/100% ...................h.........hphhhsD.sa....................s..............p.................h......................h....s....hh.s..................b..........a.K..................E..h.h.....................................................................................................................................p...KP..hh..hh...s.....lhD...G..sT...s....p.....E..........................<---RAMA domain----------------------------------------------------------------------------------------------> consensus/95% ..............ssh..h...h.p..SlchlhsD.PY....................u..........s..ac....s........h...h...h.................Rl...ps..shhh.s..........h...p...hbh.p....hlW.K.s........p.......E..l.hh............................................................................................p.........p.............................HssQKP..Lhpbhl...op..p.lLD.h.Gp.oTuhss....RphlshE.p...h...u..bl......................................................................................................................... consensus/90% ..........l...Dshp.h...l.s..SlchlhsD.PY....................G.......s..s..WD....s........h..hW..ph.................Rl...ps..slhh.u......ap..h..pp...hbh.p....hlW.K.s..ss...up....p.aE..lhhh...........c........sa..b...................................................................p.......s.s...b........................hHssQKP..Lhpbll.s.op..-.lLD.h.GsuoTuhus....RpaIuhE.p..ah...u.p+l......................................................................................................................... consensus/85% .........plb..DChp.h...lss..SlchlhsD.PY....................G.......s..sp.WD...hs........h..hW..ph.................Rl.b.pu..slhl.u...p..ap..h..pp...abh.p....hlW.K.spsss.h.up.b..p.aEs.llhh...........+........sap.b...............................................p...................ph......P.s...b........................hHsTQKP..LhpblIbshop.s-.VLD.hhGoGoTulAs.p.sRpaIGhEb-..ah.p.upp+l......................................................................................................................... consensus/80% .........plh..DChc.h...ls-..SlDhlhsD.PY....................G.......sp.sp.WD...ls........hp.hW..ph................pRl.b.su..slhl.u...p..Fs..l..pp.p.a+h.p....hlW.K.spsss.hsuppb..p.HEs.lllh...........+pbs.....sap.b..............................p................p...................+h......P.slb.h...........p............hHsTQKPl.LhpaLIboaop.sD.VLDshhGoGTTulAs.p.sRpaIGhEb-..ah.p.AppRlp........................................................................................................................ consensus/75% .........plh..DChc.h.p.lsDp.SlDhlhsD.PY....................G.......sp.sp.WDp.hls........hp.LW.pph................pRl.Kpsu..slll.u...pp.Fs..l..up.p.a+Y.p....hlWbK.spsosahsuppbP.c.HEs.IllF..........hKpbs.....sYpPb..............................p................p..sh......p........+a......P.slb.h....p.....pp............hHPTQKPlsLhpaLIboaos.uD.VLDshhGoGTTulAs.p.sRpaIGhEb-p.Yh.pbAppRlp........................................................................................................................ consensus/70% .........plh..DChc.h.p.lsDp.SlDhIlsDLPY....................G.......sp.sp.WDp.hls........hp.LW.ppY................pRlhKcsu..slll.u...pp.Fss.lh.Ss.cba+Y.p....hlWbK.spsosahsAcpbP.+.HEs.ILlF..........aKpbs.....sYpPb..b...............p.......pp..p........h.......p..sh......s.p......+a......Ppolbph.s..p..p..pp............hHPTQKPlsLhpaLIboYos.GDsVLDsChGoGTTulAs.p.sRpaIGhEb-p.Yh.pbApcRlp......p.................................................................................................................Back to Contents
General notesThe NAEGRDRAFT_76461 N6-MTase in Naegleria is present in a single copy and has been confirmed to be part of the Naegleria genome (i.e. it isnt a contamination). The protein is a large one with the DAM at the C-terminus. Prokaryotic homologs of this N6-MTase can be distinguished by their unique Str-4 signature with a DLPY motif. Addtionally, members of the family share a K between strand-1 and helix-1, D at the beginning of strand-2 (and the universal D** at the end of strand-2), R and E** flanking str and-3, D in the helix before strand-4, D and R** in helix between strands 4 and 5, K at the end of strand-6 and E** and K** flanking strand-7. Neighborhoods suggest that they are part of Type IV secretion systems or present in phages. There is at least one version which is an R-M system. Phylogenetic analysis groups the Naeglerial DAM with DAMs found in Bacteroidetes species that are part of Type IV secretion systems. |
# 80; Either phage or transposon associated GI Gene neighborhoods Arch Pfam arch Gene name Len Taxonomy Species Genbank 464395818 <-ASCH<-?<-N6-MTase*<-?<-?<-Gam-nuclease-inh<-Gam-nuclease-inh<-?<-?<-AAA N6-MTase N6_N4_Mtase LEP1GSC133_0802 307 bacteria>spirochaetes Leptospira borgpetersenii serovar Pomona str. 200901868 DNA methylase family protein [Leptospira borgpetersenii serovar Pomona str. 200901868]. 464395810_?-><-464395750_?<-464395824_?<-464395740_?<-464395671_?<-464395689_ASCH<-464395717_?<-464395818_N6-MTase*<-464395803_?<-464395746_?<-464395728_Gam-nuclease-inh<-464395805_Gam-nuclease-inh<-464395713_?<-464395797_?<-464395704_AAA 410804029 Gam-nuclease-inh->?->?->MuF->N6-MTase->multi-TM->MazG->N6-MTase*-> N6-MTase SP+N6_N4_Mtase LEP1GSC071_3962 288 bacteria>spirochaetes Leptospira santarosai str. JET putative uncharacterized adenine-specific methylase YhdJ [Leptospira santarosai str. JET]. 410804041_Gam-nuclease-inh->410804133_?->410804034_?->410804117_MuF->410804038_N6-MTase->410804175_multi-TM->410804054_MazG->410804029_N6-MTase*->410804125_?->410804108_?->410804019_?->410804198_?->410804067_?->410804142_?-><-410804048_? 446543012 Gam-nuclease-inh->?->?->MuF->N6-MTase->?->N6-MTase*-> N6-MTase SP+N6_N4_Mtase - 285 bacteria>spirochaetes Leptospira interrogans hypothetical protein [Leptospira interrogans]. 446148057_?->523650350_Gam-nuclease-inh->446043447_?->447063297_?->523650335_MuF->446545314_N6-MTase->446614561_?->446543012_N6-MTase*->446271796_?->446265643_?->446525580_?->446495114_?->447002706_?->516465345_?-><-523650344_? 696229311 <-N6-MTase*<-?<-?<-?<-N6-MTase<-MuF N6-MTase SP+N6_N4_Mtase - 285 bacteria>spirochaetes Leptospira MULTISPECIES: DNA methylase [Leptospira]. 495673486_?->696229310_?-><-495673641_?<-495673463_?<-490624138_?<-490624181_?<-495673482_?<-696229311_N6-MTase*<-696229312_?<-696229313_?<-495673450_?<-495673454_N6-MTase<-495673619_MuF<-495673617_?<-495673592_? 490906211 MuF->?->N6-MTase->?->?->N6-MTase*-> N6-MTase SP+N6_N4_Mtase - 284 bacteria>spirochaetes Leptospira kirschneri putative adenine-specific methylase YhdJ [Leptospira kirschneri]. 490906210_?->490906225_?->490906216_MuF->642970670_?->490906226_N6-MTase->490906212_?->490906229_?->490906211_N6-MTase*->642970671_?-> 495865040 MuF->N6-MTase->?->?->?->N6-MTase*-> N6-MTase SP+N6_N4_Mtase - 284 bacteria>spirochaetes Leptospira licerasiae DNA methylase [Leptospira licerasiae]. 495866424_?->495865716_?->495867158_MuF->495866141_N6-MTase->495867192_?->495871841_?->495866482_?->495865040_N6-MTase*->495864971_?->495871835_?->495866397_?->498200535_?-><-495865257_?<-495867171_?||495865794_?-> 490422204 <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM N6-MTase N6_N4_Mtase - 280 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methylase [Bacteroides]. 490422198_?-><-492267499_?<-490422199_VirD4-like<-490422200_?<-490422201_?<-490422202_?<-490422203_TraK<-490422204_N6-MTase*<-490422205_?<-490422206_TraM<-490422207_?<-490422209_?<-494420144_MultiTM<-490422214_?<-490422215_? 696349061 Gam-nuclease-inh->?->?->MuF->N6-MTase->multi-TM->MazG->N6-MTase*-> N6-MTase SP+N6_N4_Mtase - 278 bacteria>spirochaetes Leptospira santarosai DNA methylase [Leptospira santarosai]. 696349045_Gam-nuclease-inh->490624197_?->490624147_?->490624187_MuF->490624149_N6-MTase->490624220_multi-TM->490624160_MazG->696349061_N6-MTase*->648272076_?->490624181_?->490624138_?->490624232_?->490624165_?->490624203_?-><-696349066_? 490593464 <-MazG<-?<-N6-MTase*<-?<-?<-?<-?<-Gam-nuclease-inh<-?<-AAA N6-MTase N6_N4_Mtase - 277 bacteria>spirochaetes Leptospira santarosai phage DNA methylase [Leptospira santarosai]. 490593448_?-><-490601417_?<-490593451_?<-490626751_?<-696346268_?<-490593460_MazG<-490593462_?<-490593464_N6-MTase*<-490593466_?<-490593468_?<-490593470_?<-696346260_?<-514360257_Gam-nuclease-inh<-490593475_?<-490593476_AAA 490596716 <-N6-MTase*<-?<-?<-?<-?<-Gam-nuclease-inh<-?<-AAA N6-MTase N6_N4_Mtase - 277 bacteria>spirochaetes Leptospira santarosai phage DNA methylase [Leptospira santarosai]. 490598338_?-><-696346163_?<-490596721_?<-490596720_?<-696346179_?<-490596718_?<-490596717_?<-490596716_N6-MTase*<-490596715_?<-490596714_?<-490596713_?<-490596712_?<-490596711_Gam-nuclease-inh<-490596709_?<-490596708_AAA 490621523 DCM->?->?->?->N6-MTase*-> N6-MTase N6_N4_Mtase - 277 bacteria>spirochaetes Leptospira santarosai DNA methylase family protein [Leptospira santarosai]. 490614287_?->490621546_?->696347273_?->696347305_DCM->696347306_?->696347275_?->696347276_?->490621523_N6-MTase*->696347279_?->490621505_?->696347282_?->490621564_?->696347283_?->490621527_?->490621517_?-> 490626771 AAA->?->Gam-nuclease-inh->?->?->?->?->N6-MTase*-> N6-MTase N6_N4_Mtase - 277 bacteria>spirochaetes Leptospira santarosai DNA methylase family protein [Leptospira santarosai]. 490626833_AAA->490626754_?->490626820_Gam-nuclease-inh->696348306_?->490626759_?->490593468_?->490626829_?->490626771_N6-MTase*->490626767_?->490626844_?->696348307_?->696348327_?->490626751_?-><-696348308_?<-490602563_? 490633882 AAA->?->?->Gam-nuclease-inh->?->?->?->N6-MTase*-> N6-MTase N6_N4_Mtase - 277 bacteria>spirochaetes Leptospira weilii DNA methylase family protein [Leptospira weilii]. 490633900_AAA->490636543_?->490633845_?->490633859_Gam-nuclease-inh->490633887_?->490633842_?->738117417_?->490633882_N6-MTase*->490633893_?->490633841_?->490633904_?->490633890_?-><-738117420_?||490633762_?->738117406_?-> 696345163 Gam-nuclease-inh->?->?->MuF->N6-MTase->multi-TM->MazG->N6-MTase*-> N6-MTase SP+N6_N4_Mtase - 277 bacteria>spirochaetes Leptospira santarosai DNA methylase [Leptospira santarosai]. 490613760_Gam-nuclease-inh->490613694_?->490613847_?->490613713_MuF->490613894_N6-MTase->490613846_multi-TM->490613768_MazG->696345163_N6-MTase*->696345164_?->490613748_?->490613729_?->490613809_?->490613876_?->490613815_?-><-696345092_? 649530658 <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM N6-MTase N6_N4_Mtase M089_3211 271 bacteria>bacteroidetes Bacteroides ovatus str. 3725 D9 iii DNA methylase family protein [Bacteroides ovatus str. 3725 D9 iii]. <-649530652_?<-649530653_VirD4-like<-649530654_?<-649530655_?<-649530656_?<-649530657_TraK<-649530658_N6-MTase*<-649530659_?<-649530660_TraM<-649530661_?<-649530662_?<-649530663_MultiTM<-649530664_?<-649530665_? 490416986 N6-MTase->?->MultiTM->?->?->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like-> N6-MTase N6_N4_Mtase - 270 bacteria>bacteroidetes Bacteroides MULTISPECIES: hypothetical protein [Bacteroides]. 490416978_N6-MTase->490416979_?->495942219_MultiTM->490422209_?->490416983_?->490416984_TraM->490416985_?->490416986_N6-MTase*->490416987_TraK->490416988_?->490416989_?->490416990_?->495942226_VirD4-like-><-494418804_?<-490416993_? 494418810 <-VirD4-like<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM<-?<-N6-MTase N6-MTase N6_N4_Mtase - 270 bacteria>bacteroidetes Bacteroides cellulosilyticus hypothetical protein [Bacteroides cellulosilyticus]. 490416994_?->490416993_?->494418804_?-><-494418806_VirD4-like<-494418808_?<-490416988_?<-490416987_TraK<-494418810_N6-MTase*<-490422205_?<-494418815_TraM<-490422207_?<-490422209_?<-492201471_MultiTM<-492201468_?<-490416978_N6-MTase 494743555 TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like-> N6-MTase N6_N4_Mtase - 270 bacteria>bacteroidetes Bacteroides MULTISPECIES: hypothetical protein [Bacteroides]. <-495124505_?<-495124515_?<-495124517_?||490422209_?->490416983_?->494743553_TraM->490416985_?->494743555_N6-MTase*->490416987_TraK->490416988_?->490416989_?->490416990_?->494418806_VirD4-like-><-494418804_?<-490416993_? 494836074 <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM N6-MTase N6_N4_Mtase - 270 bacteria>bacteroidetes Bacteroides MULTISPECIES: hypothetical protein [Bacteroides]. <-494836086_?<-696379496_?<-696379521_VirD4-like<-494836080_?<-494836078_?<-494836077_?<-494836075_TraK<-494836074_N6-MTase*<-494836072_?<-494836070_TraM<-490422207_?<-494836068_?||490439559_?->490439558_?->490439557_?-> 494843167 N6-MTase->?->MultiTM->?->?->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like-> N6-MTase N6_N4_Mtase - 270 bacteria>bacteroidetes Bacteroides MULTISPECIES: hypothetical protein [Bacteroides]. 494843158_N6-MTase->494843161_?->495930520_MultiTM->763472918_?->763472920_?->494843165_TraM->494843166_?->494843167_N6-MTase*->763472922_TraK->763472924_?->494843175_?->494843179_?->763472926_VirD4-like->695476477_?->494843186_?-> 695344547 <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-?<-TraM<-?<-?<-MultiTM<-?<-N6-MTase N6-MTase N6_N4_Mtase - 270 bacteria>bacteroidetes Bacteroides fragilis DNA methylase [Bacteroides fragilis]. 490416993_?->494418804_?-><-495942226_VirD4-like<-490416990_?<-757748891_?<-490416988_?<-492201739_TraK<-695344547_N6-MTase*<-490416985_?<-490416984_TraM<-490416983_?<-490422209_?<-757748895_MultiTM<-492201468_?<-490416978_N6-MTase 763222313 <-ASCH<-?<-?<-N6-MTase*<-?<-?<-?<-AAA N6-MTase N6_N4_Mtase - 270 bacteria>spirochaetes Leptospira borgpetersenii cytosine methyltransferase [Leptospira borgpetersenii]. 763222312_?->488839006_?-><-488839031_?<-763222280_?<-488838822_ASCH<-488838861_?<-763222284_?<-763222313_N6-MTase*<-488838988_?<-763222285_?<-763222286_?<-488838843_AAA||763222314_?->763222289_?->488838960_?-> 410015573 <-N6-MTase*<-?<-?<-?<-N6-MTase<-MuF N6-MTase SP+N6_N4_Mtase LEP1GSC068_2949 269 bacteria>spirochaetes Leptospira sp. Fiocruz LV3954 putative uncharacterized adenine-specific methylase YhdJ [Leptospira sp. Fiocruz LV3954]. 410015635_?->410015575_?-><-410015681_?<-410015622_?<-410015634_?<-410015655_?<-410015632_?<-410015573_N6-MTase*<-410015572_?<-410015584_?<-410015615_?<-410015617_N6-MTase<-410015662_MuF<-410015660_?<-410015643_? 490637745 AAA->?->Gam-nuclease-inh->?->?->?->?->N6-MTase*->?->ASCH-> N6-MTase N6_N4_Mtase - 268 bacteria>spirochaetes Leptospira weilii DNA methylase family protein [Leptospira weilii]. 490637739_AAA->490637823_?->490637758_Gam-nuclease-inh->490637820_?->490637784_?->738101833_?->490637799_?->490637745_N6-MTase*->738101836_?->490637751_ASCH->490637803_?->490637863_?->738101882_?->490637836_?->515129620_?-> 545404645 <-VirD4-like<-?<-?<-TraK<-N6-MTase*<-TraM<-?<-?<-MultiTM N6-MTase N6_N4_Mtase - 268 bacteria>bacteroidetes Bacteroides pyogenes DNA (cytosine-5-)-methyltransferase [Bacteroides pyogenes]. 748714185_?->545404639_?-><-545404640_?<-545404641_VirD4-like<-748714186_?<-545404643_?<-545404644_TraK<-545404645_N6-MTase*<-545404646_TraM<-545404647_?<-545404648_?<-545404649_MultiTM<-545404650_?<-545404651_?<-748714187_? 648272077 <-N6-MTase* N6-MTase N6_N4_Mtase - 268 bacteria>spirochaetes Leptospira santarosai DNA methylase, partial [Leptospira santarosai]. 515125024_?-><-515125025_?<-515125026_?<-515125027_?<-515125028_?<-515125029_?<-648272076_?<-648272077_N6-MTase* 489305499 <-N6-MTase*<-?<-URI<-N6-MTase N6-MTase N6_N4_Mtase - 265 bacteria>firmicutes Bacillus pumilus DNA-cytosine methyltransferase [Bacillus pumilus]. <-736611728_?<-736611801_?<-736611802_?<-736611803_?<-489305170_?<-736611729_?<-489305717_?<-489305499_N6-MTase*<-736611804_?<-489305926_URI<-489305329_N6-MTase<-736611730_?<-489306211_?<-736611731_?<-489305838_? 748669634 MultiTM->?->?->TraM->N6-MTase*->TraK->?->?->?->VirD4-like-> N6-MTase N6_N4_Mtase - 265 bacteria>bacteroidetes Tannerella sp. 6_1_58FAA_CT1 DNA methylase [Tannerella sp. 6_1_58FAA_CT1]. 748669630_?->496674958_?->748669827_?->496674960_MultiTM->496674961_?->496674962_?->496674963_TraM->748669634_N6-MTase*->748669635_TraK->748669828_?->748669830_?->748669832_?->748669636_VirD4-like->496674969_?->748669833_?-> 330989854 N6-MTase*-> N6-MTase N6_N4_Mtase PLA107_32876 264 bacteria>proteobacteria>gammaproteobacteria Pseudomonas amygdali pv. lachrymans str. M301315 DNA methylase N-4/N-6 domain-containing protein, partial [Pseudomonas amygdali pv. lachrymans str. M301315]. 330989854_N6-MTase*-> 446127730 <-N6-MTase*<-?<-Collar N6-MTase N6_N4_Mtase - 264 bacteria>spirochaetes Leptospira interrogans hypothetical protein [Leptospira interrogans]. <-446127730_N6-MTase*<-447029443_?<-446808019_Collar<-757477342_?<-446127269_?<-447014100_?<-446555036_?<-642966717_? 446127731 Collar->?->N6-MTase*-> N6-MTase N6_N4_Mtase - 264 bacteria>spirochaetes Leptospira interrogans hypothetical protein [Leptospira interrogans]. 447082902_?->516465892_?->516465893_?->516465894_?->446272244_?->446808018_Collar->447029444_?->446127731_N6-MTase*->658829992_?-><-446558080_?<-446799003_?<-446767232_?<-447143376_?<-516465896_?<-446325284_? 495941257 <-MutS_I+N6-MTase<-DCM<-?<-?<-?<-?<-N6-MTase<-N6-MTase* N6-MTase Methyltransf_26+N6_N4_Mtase - 264 bacteria>bacteroidetes Bacteroides sp. 4_1_36 DNA methyltransferase [Bacteroides sp. 4_1_36]. <-736517283_MutS_I+N6-MTase<-495941252_DCM<-495941253_?<-736517241_?<-495941254_?<-736517242_?<-495941256_N6-MTase<-495941257_N6-MTase*<-495941258_?<-495941260_?<-495941261_?<-495941262_?<-495941263_?<-495941264_?<-495941265_? 495417764 MultiTM->?->?->?->N6-MTase->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like-> N6-MTase N6_N4_Mtase - 263 bacteria>bacteroidetes Bacteroides coprophilus hypothetical protein [Bacteroides coprophilus]. 495417751_MultiTM->495417753_?->495417755_?->495417757_?->495417759_N6-MTase->495417760_TraM->495417762_?->495417764_N6-MTase*->749915300_TraK->495417768_?->749915273_?->495417771_?->495417773_VirD4-like->495417775_?-> 740126887 <-N6-MTase* N6-MTase N6_N4_Mtase - 263 bacteria>synergistetes Synergistes jonesii hypothetical protein [Synergistes jonesii]. <-740126887_N6-MTase*<-740130694_?<-740130686_?<-740130688_?<-740130690_? 23752321 <-N6-MTase*<-?<-?<-?<-?||?-><-MuF<-NUDIX N6-MTase N6_N4_Mtase BCPBV781_gp10 262 dsdna viruses, no rna stage>caudovirales Burkholderia phage Bcep781 gp10 [Burkholderia phage Bcep781]. <-23752315_?<-23752316_?<-47835036_?<-47835021_?<-47835022_?<-23752319_?<-23752320_?<-23752321_N6-MTase*<-23752322_?<-23752323_?<-23752324_?<-47835023_?||23752326_?-><-23752327_MuF<-23752328_NUDIX 41057660 <-N6-MTase*<-?<-?<-?<-?||?-><-MuF<-NUDIX N6-MTase N6_N4_Mtase BCPBV43_gp10 262 dsdna viruses, no rna stage>caudovirales Burkholderia phage Bcep43 gp10 [Burkholderia phage Bcep43]. <-41057653_?<-41057654_?<-41057655_?<-41057656_?<-41057657_?<-41057658_?<-41057659_?<-41057660_N6-MTase*<-41057662_?<-41057663_?<-41057664_?<-41057665_?||41057666_?-><-41057667_MuF<-41057668_NUDIX 503885074 <-N6-MTase* N6-MTase N6_N4_Mtase - 262 bacteria>firmicutes Oscillibacter valericigenes DNA-cytosine methyltransferase [Oscillibacter valericigenes]. <-503885062_?<-503885063_?<-503885068_?<-503885069_?<-503885070_?<-753860327_?<-503885072_?<-503885074_N6-MTase*<-503885075_?<-503885076_?<-753859673_?<-503885079_?||753859675_?-><-753859676_?<-503885081_? 313134650 N6-MTase->?->MultiTM->?->?->TraM->?->N6-MTase*->TraK->?->?->?->VirD4-like-> N6-MTase N6_N4_Mtase BFAG_00704 261 bacteria>bacteroidetes Bacteroides fragilis 3_1_12 DNA (cytosine-5-)-methyltransferase [Bacteroides fragilis 3_1_12]. 313134643_N6-MTase->313134644_?->313134645_MultiTM->313134646_?->313134647_?->313134648_TraM->313134649_?->313134650_N6-MTase*->313134651_TraK->313134652_?->313134653_?->313134654_?->313134655_VirD4-like-><-313134656_?<-313134657_? 516293791 <-N6-MTase* N6-MTase N6_N4_Mtase - 261 bacteria>firmicutes Bacillus subtilis DNA-cytosine methyltransferase [Bacillus subtilis]. <-751587985_?<-516293791_N6-MTase*<-751587988_?<-751587990_?<-751587993_?<-751587996_?<-751587999_?<-516293781_?<-751588002_? 740127826 N6-MTase*-> N6-MTase N6_N4_Mtase - 261 bacteria>synergistetes Synergistes jonesii hypothetical protein [Synergistes jonesii]. 740127826_N6-MTase*->740127829_?->740127831_?->740127834_?->740127836_?-> 488335929 <-N6-MTase* N6-MTase N6_N4_Mtase - 260 bacteria>firmicutes Enterococcus faecalis DNA-cytosine methyltransferase [Enterococcus faecalis]. <-488335922_?<-488335923_?<-488335924_?<-488335925_?<-488335926_?<-488335927_?<-488335928_?<-488335929_N6-MTase*<-488326316_?<-488335930_?<-488335931_? 523636963 N6-MTase*-> N6-MTase N6_N4_Mtase - 260 bacteria>spirochaetes Leptospira alstonii DNA methylase family protein [Leptospira alstonii]. 523636950_?->738083161_?->523636910_?->523636962_?->523637035_?->738083163_?->523636883_?->523636963_N6-MTase*-><-516420500_?||523636965_?->516420495_?->738083107_?->523637017_?->523636918_?-><-523636982_? 446276825 <-N6-MTase* N6-MTase N6_N4_Mtase - 259 bacteria>spirochaetes Leptospira interrogans hypothetical protein [Leptospira interrogans]. <-446276825_N6-MTase*<-488029201_? 446276826 N6-MTase*-> N6-MTase N6_N4_Mtase - 259 bacteria>spirochaetes Leptospira interrogans hypothetical protein [Leptospira interrogans]. 446010275_?->446857835_?->446042321_?->447190987_?->446733555_?->446710508_?->446742328_?->446276826_N6-MTase*-><-642966814_? 488105867 N6-MTase*->?-><-?<-?<-RADICAL-SAM<-RADICAL-SAM N6-MTase N6_N4_Mtase - 259 bacteria>spirochaetes Leptospira interrogans DNA methylase family protein [Leptospira interrogans]. 488105905_?->488105867_N6-MTase*->488105884_?-><-488105878_?<-488105912_?<-488105908_RADICAL-SAM<-488105906_RADICAL-SAM<-488105911_?<-488105916_? 490560754 N6-MTase*->?-><-?<-?<-RADICAL-SAM<-RADICAL-SAM N6-MTase N6_N4_Mtase - 259 bacteria>spirochaetes Leptospira noguchii DNA methylase [Leptospira noguchii]. 490560818_?->490560729_?->490560817_?->490560745_?->490560733_?->490560819_?->490560738_?->490560754_N6-MTase*->748682039_?-><-748682041_?<-488105912_?<-490560723_RADICAL-SAM<-488105906_RADICAL-SAM<-490560812_?<-488105916_? 490575676 N6-MTase*-> N6-MTase N6_N4_Mtase - 259 bacteria>spirochaetes Leptospira noguchii DNA methylase family protein [Leptospira noguchii]. 738074846_?->490560729_?->490560817_?->490575678_?->490575727_?->490575700_?->490575698_?->490575676_N6-MTase*-><-738074849_?<-490575711_?<-490575720_?<-738074852_?<-490575822_?||490575802_?->738074875_?-> 490627908 N6-MTase*-> N6-MTase N6_N4_Mtase - 259 bacteria>spirochaetes Leptospira santarosai DNA methylase family protein [Leptospira santarosai]. 648272267_?->490613267_?->490625571_?->648272268_?->490627898_?->490613256_?->490625561_?->490627908_N6-MTase*-><-515125978_?||490615892_?-><-490606321_?<-490615889_? 516471781 N6-MTase*-> N6-MTase N6_N4_Mtase - 259 bacteria>spirochaetes Leptospira interrogans DNA-cytosine methyltransferase [Leptospira interrogans]. 488060392_?->446857835_?->757477947_?->516471783_?->446733555_?->446710508_?->757477948_?->516471781_N6-MTase*-><-757477949_?<-642966825_?||757477950_?-> 529283433 N6-MTase*->?->?->?-><-?<-?<-RADICAL-SAM<-RADICAL-SAM N6-MTase N6_N4_Mtase LEP1GSC059_0080 259 Leptospira phage vB_LnoZ_CZ214-LE1 DNA methylase family protein [Leptospira phage vB_LnoZ_CZ214-LE1]. 529283421_?->529283359_?->529283377_?->529283378_?->529283410_?->529283402_?->529283348_?->529283433_N6-MTase*->529283414_?->529283408_?->529283440_?-><-529283441_?<-529283405_?<-529283374_RADICAL-SAM<-529283380_RADICAL-SAM 763469483 N6-MTase*-> N6-MTase N6_N4_Mtase - 259 bacteria>proteobacteria>gammaproteobacteria Pseudomonas amygdali hypothetical protein [Pseudomonas amygdali]. 763469483_N6-MTase*-> 490454463 <-VirD4-like<-?<-?<-?<-TraK<-N6-MTase*<-TraM<-?<-?<-?<-MultiTM N6-MTase N6_N4_Mtase - 258 bacteria>bacteroidetes Bacteroides ovatus hypothetical protein [Bacteroides ovatus]. <-696272134_?<-696272136_?<-490454458_VirD4-like<-490454459_?<-490454460_?<-490454461_?<-696272353_TraK<-490454463_N6-MTase*<-696272355_TraM<-490454465_?<-490454466_?<-490454467_?<-490454468_MultiTM<-490454469_?<-490454470_? 738085263 N6-MTase*-> N6-MTase N6_N4_Mtase - 258 bacteria>spirochaetes Leptospira alstonii DNA methylase [Leptospira alstonii]. 523635656_?-><-523635655_?||738085263_N6-MTase*->523639106_?->523639078_?->544626688_?-> 757125221 N6-MTase*->?->ASCH-> N6-MTase N6_N4_Mtase - 256 bacteria>spirochaetes Leptospira weilii cytosine methyltransferase [Leptospira weilii]. 515130822_?->757125221_N6-MTase*->515130824_?->648273328_ASCH->515130826_?->515130827_?-><-490634456_?||490634520_?-> 490416978 <-N6-MTase<-?<-TraM<-?<-?<-MultiTM<-?<-N6-MTase*<-?<-?<-?<-VirB4-FtsK<-?<-?<-Int-maturase+HNH N6-MTase Methyltransf_26+N6_N4_Mtase - 254 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. <-695344547_N6-MTase<-490416985_?<-490416984_TraM<-490416983_?<-490422209_?<-757748895_MultiTM<-492201468_?<-490416978_N6-MTase*<-490416977_?<-757748908_?<-490416974_?<-757748898_VirB4-FtsK<-490416972_?<-490416970_?<-757748901_Int-maturase+HNH 492201466 Int-maturase+HNH->?->?->VirB4-FtsK->?->?->?->N6-MTase*->?->MultiTM-> N6-MTase Methyltransf_26+N6_N4_Mtase - 254 bacteria>bacteroidetes Bacteroides fragilis DNA methyltransferase [Bacteroides fragilis]. 490416968_Int-maturase+HNH->490416970_?->695344546_?->490416973_VirB4-FtsK->490416974_?->695551343_?->490416977_?->492201466_N6-MTase*->492201468_?->695551347_MultiTM-><-695551206_?<-695551210_?<-695551214_?<-695551218_?<-695551229_? 494836062 VirB4-FtsK->?->?->?->N6-MTase*->?->MultiTM->?->?->TraM->?->N6-MTase-> N6-MTase Methyltransf_26+N6_N4_Mtase - 254 bacteria>bacteroidetes Bacteroides plebeius DNA methyltransferase [Bacteroides plebeius]. 490416966_?->494836051_?->494836054_?->494836055_VirB4-FtsK->490416974_?->696379523_?->494836059_?->494836062_N6-MTase*->494836064_?->494836066_MultiTM->494836068_?->490422207_?->494836070_TraM->494836072_?->494836074_N6-MTase-> 696379522 <-MultiTM<-?<-N6-MTase*<-?<-?<-?<-VirB4-FtsK N6-MTase Methyltransf_26+N6_N4_Mtase - 254 bacteria>bacteroidetes Bacteroides vulgatus DNA methyltransferase [Bacteroides vulgatus]. <-490439537_?<-490439536_?<-490439534_?<-647589678_?<-490439531_?<-696379543_MultiTM<-494836064_?<-696379522_N6-MTase*<-494836059_?<-696379523_?<-490416974_?<-696379524_VirB4-FtsK<-494836054_?<-494836051_?<-490416966_? 738211128 N6-MTase*->?->?->?->Terminase_LS->?->?->MuF-> N6-MTase N6_N4_Mtase - 254 bacteria>proteobacteria>gammaproteobacteria Lysobacter dokdonensis hypothetical protein [Lysobacter dokdonensis]. 738211113_?->738211116_?->738211118_?->738211121_?->738211123_?->738211126_?->738211371_?->738211128_N6-MTase*->738211131_?->738211133_?->738211136_?->738211139_Terminase_LS->738211142_?->738211144_?->738211147_MuF-> 489305329 <-N6-MTase<-?<-URI<-N6-MTase* N6-MTase N6_N4_Mtase - 253 bacteria>firmicutes Bacillus pumilus DNA-cytosine methyltransferase [Bacillus pumilus]. <-736611803_?<-489305170_?<-736611729_?<-489305717_?<-489305499_N6-MTase<-736611804_?<-489305926_URI<-489305329_N6-MTase*<-736611730_?<-489306211_?<-736611731_?<-489305838_?<-489305408_?<-489305272_?<-736611732_? 549781736 DHH->?->?->?->?->N6-MTase*->N6-MTase-> N6-MTase N6_N4_Mtase - 253 bacteria>firmicutes Bacillus amyloliquefaciens Modification methylase RsrI [Bacillus amyloliquefaciens]. 549781723_?->549781725_?->549781726_DHH->549781728_?->549781730_?->549781732_?->549781735_?->549781736_N6-MTase*->549781738_N6-MTase->504230838_?->752856931_?->549781740_?->752856932_?->545132120_?->545132119_?-> 500205672 METHYLASE-><-NucA<-?||?->?->N6-MTase*-> N6-MTase N6_N4_Mtase - 252 bacteria>proteobacteria>betaproteobacteria Burkholderia vietnamiensis DNA-cytosine methyltransferase [Burkholderia vietnamiensis]. 500205679_?-><-500205677_?||500205676_METHYLASE-><-759573915_NucA<-759573856_?||759573857_?->500205673_?->500205672_N6-MTase*->759573859_?->759573916_?-><-759573860_?<-500205667_?<-759573861_?<-759573862_?<-500205665_? 691080530 DCM->?->N6-MTase*->?->?->?->?->Terminase_SS->GT-> N6-MTase N6_N4_Mtase - 251 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii cytosine methyltransferase [Acinetobacter baumannii]. <-447065173_?<-487955263_?||446150822_?->446882801_?->446105617_?->446969423_DCM->691080528_?->691080530_N6-MTase*->691080532_?->446915980_?->691080536_?->447067279_?->691080538_Terminase_SS->691080540_GT->691080542_?-> 751875430 <-N6-MTase<-N6-MTase* N6-MTase N6_N4_Mtase B4069_2083 249 bacteria>firmicutes Bacillus subtilis Adenine-specific methyltransferase [Bacillus subtilis]. <-751875423_?<-751875424_?<-751875425_?<-751875426_?<-751875427_?<-751875428_?<-751875429_N6-MTase<-751875430_N6-MTase*<-751875431_?<-751875432_?<-751875433_?<-751875434_?<-751875435_?<-751875436_?<-751875437_? 505338314 - N6-MTase N6_N4_Mtase - 248 bacteria>firmicutes Ruminococcus sp. SR1/5 DNA modification methylase [Ruminococcus sp. SR1/5]. 756411756 <-HNH<-?<-?<-?<-?<-N6-MTase* N6-MTase N6_N4_Mtase - 248 bacteria>firmicutes Bacillus cereus hypothetical protein [Bacillus cereus]. <-447151656_?<-446510145_?<-446579460_HNH<-446891195_?<-446977878_?<-446667118_?<-446680756_?<-756411756_N6-MTase*<-756411780_?<-446637401_?<-756411757_?<-446569170_?<-756411667_?<-446700868_?<-446897432_? 655162666 N6-MTase*-> N6-MTase N6_N4_Mtase - 246 bacteria>firmicutes Paenibacillus harenae DNA methylase [Paenibacillus harenae]. 655162659_?->655162660_?->655162661_?->655162662_?->655162663_?->655162664_?->655162665_?->655162666_N6-MTase*-><-655162667_?||738828102_?->655162669_?->655162670_?->655162671_?->655162672_?->655162673_?-> 738793084 <-N6-MTase* N6-MTase N6_N4_Mtase - 246 bacteria>firmicutes Paenibacillus sp. FSL H8-237 hypothetical protein [Paenibacillus sp. FSL H8-237]. <-738793073_?<-738793332_?<-738793334_?<-738793337_?<-738793076_?<-738793079_?<-738793082_?<-738793084_N6-MTase*<-738793087_?<-738793090_?<-738793093_?<-738793097_?<-738793099_?<-738793101_?<-738793338_? 496003735 N6-MTase*->N6-MTase->METHYLASE-> N6-MTase N6_N4_Mtase - 245 bacteria>firmicutes Erysipelotrichaceae bacterium 2_2_44A DNA-cytosine methyltransferase [Erysipelotrichaceae bacterium 2_2_44A]. 496003721_?->496003723_?->748747028_?->496003727_?->496003731_?->748747029_?->748747030_?->496003735_N6-MTase*->496003737_N6-MTase->496003740_METHYLASE->496003742_?->496003744_?->496003745_?->496003746_?->496003748_?-> 695862061 N6-MTase*-> N6-MTase N6_N4_Mtase - 244 bacteria>firmicutes Lactobacillus paracasei DNA methyltransferase [Lactobacillus paracasei]. 511748281_?-><-695862042_?<-695862043_?||511748282_?-><-695862045_?||511748283_?->511748284_?->695862061_N6-MTase*->511748286_?->511748287_?->511748288_?->511748289_?->511748290_?->511748291_?->511748292_?-> 736516671 <-MuF<-Phage_portal<-Terminase_SS<-?<-Terminase_SS<-?<-N6-MTase* N6-MTase N6_N4_Mtase - 244 bacteria>firmicutes Lactobacillus kunkeei hypothetical protein [Lactobacillus kunkeei]. <-736516664_?<-736516665_MuF<-736516666_Phage_portal<-736516667_Terminase_SS<-736516668_?<-736516669_Terminase_SS<-736516670_?<-736516671_N6-MTase*<-736516672_?<-736516673_?<-736516709_?<-736516675_?<-736516676_?<-736516677_?<-736516678_? 149882909 <-N6-MTase*<-?<-?<-?<-?<-?||?-><-MuF N6-MTase N6_N4_Mtase BcepNY3gene09 243 dsdna viruses, no rna stage>caudovirales Burkholderia phage BcepNY3 DNA methylase [Burkholderia phage BcepNY3]. <-149882902_?<-149882903_?<-149882904_?<-149882905_?<-149882906_?<-149882907_?<-149882908_?<-149882909_N6-MTase*<-149882910_?<-149882911_?<-149882912_?<-149882913_?<-149882914_?||149882915_?-><-149882916_MuF 526178238 N6-MTase*-> N6-MTase N6_N4_Mtase N355_gp092 242 dsdna viruses, no rna stage>caudovirales Cellulophaga phage phi13:2 DNA methylase [Cellulophaga phage phi13:2]. 526178231_?->526178232_?->526178233_?->526178234_?->526178235_?->526178236_?->526178237_?->526178238_N6-MTase*->526178239_?->526178240_?->526178241_?->526178242_?->526178243_?-><-526178244_?<-526178245_? 702087568 N6-MTase*->?->?->?->Terminase_LS->?->?->MuF-> N6-MTase N6_N4_Mtase LF41_2421 242 bacteria>proteobacteria>gammaproteobacteria Lysobacter dokdonensis DS-58 Adenine-specific methyltransferase [Lysobacter dokdonensis DS-58]. 702087561_?->702087562_?->702087563_?->702087564_?->702087565_?->702087566_?->702087567_?->702087568_N6-MTase*->702087569_?->702087570_?->702087571_?->702087572_Terminase_LS->702087573_?->702087574_?->702087575_MuF-> 752537274 <-N6-MTase* N6-MTase N6_N4_Mtase - 242 bacteria>proteobacteria>gammaproteobacteria Aeromonas jandaei hypothetical protein [Aeromonas jandaei]. <-752537266_?<-752537267_?<-752537268_?<-752537269_?<-752537270_?<-752537272_?<-752537273_?<-752537274_N6-MTase*<-752537487_?||752537275_?-><-752537488_?<-752537276_?<-752537277_?||752537489_?-><-752537490_? 752704809 N6-MTase*->N6-MTase-> N6-MTase SP+N6_N4_Mtase - 241 bacteria>firmicutes Bacillus subtilis cytosine methyltransferase [Bacillus subtilis]. 752704719_?->752704717_?->516293771_?->695807941_?->498016884_?->752704715_?->752704713_?->752704809_N6-MTase*->752704807_N6-MTase->752704711_?->516293779_?->751588002_?->516293781_?->752704709_?->695807779_?-> 763125951 <-METHYLASE<-?<-?<-?<-?<-?<-?<-N6-MTase* N6-MTase N6_N4_Mtase - 238 bacteria>firmicutes Lactobacillus salivarius hypothetical protein [Lactobacillus salivarius]. <-763125944_METHYLASE<-763125945_?<-763125946_?<-763125947_?<-763125948_?<-763125949_?<-763125950_?<-763125951_N6-MTase*<-763125952_?<-763125953_?<-763125954_?<-763125955_?<-763125956_?<-763125957_?<-763125958_? 690349817 <-N6-MTase* N6-MTase N6_N4_Mtase LSJ_3100c 236 bacteria>firmicutes Lactobacillus salivarius DNA modification methylase [Lactobacillus salivarius]. <-690349810_?<-690349811_?<-690349812_?<-690349813_?<-690349814_?<-690349815_?<-690349816_?<-690349817_N6-MTase*<-690349818_?<-690349819_?<-690349820_?<-690349821_?<-690349822_?<-690349823_?<-690349824_? 672551258 N6-MTase*-> N6-MTase N6_N4_Mtase JO84_gp345 234 dsdna viruses, no rna stage Aureococcus anophagefferens virus putative cytosine-5 methyltransferase [Aureococcus anophagefferens virus]. 672551140_?->672551027_?->672551135_?->672551187_?-><-672551213_?<-672551058_?<-672551059_?||672551258_N6-MTase*-><-672551060_?<-672551198_?||672551141_?->672551214_?-><-672551007_?<-672551171_?||672551028_?-> 38638617 <-N6-MTase*<-?<-?<-?<-?<-?||?-><-MuF N6-MTase N6_N4_Mtase BCPBV1_gp10 233 dsdna viruses, no rna stage>caudovirales Burkholderia phage Bcep1 gp10 [Burkholderia phage Bcep1]. <-38638610_?<-38638611_?<-38638612_?<-38638613_?<-38638614_?<-38638615_?<-38638616_?<-38638617_N6-MTase*<-38638618_?<-38638619_?<-38638620_?<-38638621_?<-38638622_?||38638623_?-><-38638624_MuF # 14; R-M 496651971 HNH-><-?<-?||?-><-METHYLASE<-?<-?<-N6-MTase*<-REase<-METHYLASE<-?<-?<-?<-?<-DOC N6-MTase N6_N4_Mtase - 369 bacteria>proteobacteria>epsilonproteobacteria Campylobacter sp. 10_1_50 modification methylase [Campylobacter sp. 10_1_50]. 736902613_HNH-><-496651959_?<-496651961_?||496651963_?-><-496651965_METHYLASE<-496651967_?<-496651969_?<-496651971_N6-MTase*<-496651973_REase<-496651975_METHYLASE<-736902809_?<-496651979_?<-496651981_?<-496651983_?<-489029481_DOC 550983501 <-METHYLASE<-?<-?<-?||adenine-glycosylase-><-?<-N6-MTase*||?->ABC-> N6-MTase N6_N4_Mtase - 366 bacteria>proteobacteria>alphaproteobacteria Thalassospira lucentensis modification methylase [Thalassospira lucentensis]. <-655387197_?<-703179655_METHYLASE<-550983496_?<-550983497_?<-703179657_?||655387198_adenine-glycosylase-><-550983500_?<-550983501_N6-MTase*||550983502_?->703179660_ABC->550983504_?->550983505_?->550983506_?->550983507_?->550983508_?-> 696534926 <-ABC<-?||N6-MTase*->?-><-adenine-glycosylase||?->?->?->METHYLASE-> N6-MTase N6_N4_Mtase - 366 bacteria>proteobacteria>alphaproteobacteria Thalassospira australica modification methylase [Thalassospira australica]. 696535023_?->696534922_?-><-696534923_?<-696534924_?<-696534925_?<-696535024_ABC<-696535025_?||696534926_N6-MTase*->696534927_?-><-696534928_adenine-glycosylase||696534929_?->696534930_?->696534931_?->696534932_METHYLASE->696534933_?-> 488970128 REase->N6-MTase*-> N6-MTase N6_N4_Mtase - 362 bacteria>tenericutes Mycoplasma alkalescens Type II restriction modification system N4-cytosine or N6-adenine DNA methyltransferase [Mycoplasma alkalescens]. <-488970114_?<-488970115_?<-488970117_?<-488970121_?<-488970123_?<-750250321_?||750250339_REase->488970128_N6-MTase*-><-488970129_?<-488970130_?<-488970131_?<-488970133_?<-488970134_?<-488970136_?<-488970137_? 740676991 - N6-MTase N6_N4_Mtase - 361 bacteria>tenericutes Candidatus Hepatoplasma crinochetorum hypothetical protein [Candidatus Hepatoplasma crinochetorum]. 446268888 <-REase<-N6-MTase*<-?<-QRPTase_N<-?<-PS_Dcarbxylase N6-MTase N6_N4_Mtase - 359 bacteria>proteobacteria>epsilonproteobacteria Helicobacter pylori DNA methyltransferase [Helicobacter pylori]. <-447055814_?||487802840_?-><-446116267_?||446003551_?->446761496_?-><-658502684_?<-727092483_REase<-446268888_N6-MTase*<-446833673_?<-446375435_QRPTase_N<-447064608_?<-446148357_PS_Dcarbxylase<-446875834_?<-727086548_?<-446836634_? 726979196 PS_Dcarbxylase->?->QRPTase_N->N6-MTase->?->N6-MTase*->REase-> N6-MTase N6_N4_Mtase - 359 bacteria>proteobacteria>epsilonproteobacteria Helicobacter pylori DNA methyltransferase [Helicobacter pylori]. 726979213_?->446880902_?->726979188_PS_Dcarbxylase->726979190_?->726979192_QRPTase_N->726979215_N6-MTase->726979194_?->726979196_N6-MTase*->726979198_REase->726979200_?->726979202_?->726979204_?->726979206_?->447045345_?->446802786_?-> 727305172 <-REase<-N6-MTase*<-?<-QRPTase_N<-?<-PS_Dcarbxylase N6-MTase N6_N4_Mtase - 359 bacteria>proteobacteria>epsilonproteobacteria Helicobacter pylori DNA methyltransferase [Helicobacter pylori]. <-727305182_?||727305180_?-><-727305178_?||727305177_?->727305176_?-><-727305339_?<-727305174_REase<-727305172_N6-MTase*<-727305171_?<-727305170_QRPTase_N<-727305168_?<-727305167_PS_Dcarbxylase<-446875834_?<-727305165_?<-727305163_? 727328309 <-REase<-N6-MTase*<-N6-MTase<-QRPTase_N<-?<-PS_Dcarbxylase N6-MTase N6_N4_Mtase - 359 bacteria>proteobacteria>epsilonproteobacteria Helicobacter pylori DNA methyltransferase [Helicobacter pylori]. 545063097_?-><-727340056_?<-727328235_?<-727060838_?<-727328232_?<-727340057_?<-727340058_REase<-727328309_N6-MTase*<-727340059_N6-MTase<-727327806_QRPTase_N<-727327808_?<-727327810_PS_Dcarbxylase<-727105271_?<-727327829_?<-727327812_? 738491218 <-ABC<-ABC<-?||N6-MTase*-><-REase<-SAM-synthetase N6-MTase N6_N4_Mtase - 355 bacteria>tenericutes Mycoplasma hyosynoviae hypothetical protein [Mycoplasma hyosynoviae]. <-738509207_?<-738509210_?<-738489806_?<-738493702_?<-738509238_ABC<-738509213_ABC<-738509216_?||738491218_N6-MTase*-><-738491226_REase<-738509218_SAM-synthetase<-738491210_?<-738493659_?||738491204_?->738491202_?->738509221_?-> 738495747 SAM-synthetase->REase-><-N6-MTase*||?->ABC->ABC-> N6-MTase N6_N4_Mtase - 355 bacteria>tenericutes Mycoplasma hyosynoviae hypothetical protein [Mycoplasma hyosynoviae]. <-738508624_?<-738491202_?<-738491204_?||738493659_?->738493662_?->738495750_SAM-synthetase->738495763_REase-><-738495747_N6-MTase*||738495744_?->738495741_ABC->738495760_ABC->738508625_?-> 635202114 SAM-synthetase->REase-><-N6-MTase*||?->ABC->ABC-> N6-MTase N6_N4_Mtase NPL7_01825 350 bacteria>tenericutes Mycoplasma hyosynoviae DNA methyltransferase [Mycoplasma hyosynoviae]. <-635202118_?<-635202108_?<-635202109_?||635202110_?->635202111_?->635202112_SAM-synthetase->635202113_REase-><-635202114_N6-MTase*||635202115_?->635202116_ABC->635202117_ABC->635202119_?-> 635203155 SAM-synthetase->REase-><-N6-MTase*||?->ABC-> N6-MTase N6_N4_Mtase NPL1_02345 350 bacteria>tenericutes Mycoplasma hyosynoviae DNA methyltransferase [Mycoplasma hyosynoviae]. <-635203148_?<-635203149_?<-635203150_?||635203151_?->635203152_?->635203153_SAM-synthetase->635203154_REase-><-635203155_N6-MTase*||635203156_?->635203157_ABC->635203158_?-> 737788178 ABC-><-?||?->?->?->?-><-METHYLASE||N6-MTase*-> N6-MTase N6_N4_Mtase - 281 bacteria>bacteroidetes Flexibacter roseolus hypothetical protein, partial [Flexibacter roseolus]. 652629928_ABC-><-652629930_?||652629932_?->652629934_?->737788172_?->652629936_?-><-737788175_METHYLASE||737788178_N6-MTase*-><-652629939_?||652629940_?-><-652629941_?||652629943_?->652629944_?->737788181_?->652629945_?-> # 7; 446323973 N6-MTase*-> N6-MTase N6_N4_Mtase - 231 bacteria>firmicutes Streptococcus pneumoniae DNA-cytosine methyltransferase [Streptococcus pneumoniae]. 446197668_?->446393604_?->446106149_?->447079773_?->446276775_?->446520999_?->446532213_?->446323973_N6-MTase*->446377415_?->446079215_?->446701795_?->446719036_?->487776690_?->446963895_?->446742073_?-> 698840876 IstB_IS21->?->?->N6-MTase*->?->DCM-> N6-MTase N6_N4_Mtase - 231 bacteria>firmicutes Streptococcus pneumoniae putative prophage protein [Streptococcus pneumoniae]. <-698840869_?<-698840870_?||698840871_?->698840872_?->698840873_IstB_IS21->698840874_?->698840875_?->698840876_N6-MTase*->698840877_?->698840878_DCM->698840879_?->698840880_?->698840881_?->698840882_?->698840883_?-> 660643078 <-N6-MTase* N6-MTase N6_N4_Mtase EL26_10775 226 bacteria>firmicutes Tumebacillus flagellatus hypothetical protein EL26_10775 [Tumebacillus flagellatus]. <-660643071_?||660643072_?-><-660643073_?||660643074_?-><-660643075_?||660643076_?-><-660643077_?<-660643078_N6-MTase*<-660643079_? 589891490 <-HNH<-?<-?<-N6-MTase*<-?<-HNH<-?<-?<-?<-?<-DHH N6-MTase N6_N4_Mtase SEP9_059 225 dsdna viruses, no rna stage>caudovirales Staphylococcus phage vB_SepS_SEP9 cytosine specific DNA methyltransferase [Staphylococcus phage vB_SepS_SEP9]. <-589891483_?<-589891484_?<-589891485_?<-589891486_?<-589891487_HNH<-589891488_?<-589891489_?<-589891490_N6-MTase*<-589891491_?<-589891492_HNH<-589891493_?<-589891494_?<-589891495_?<-589891496_?<-589891497_DHH 529047177 URI->Toprim->N6-MTase*-> N6-MTase N6_N4_Mtase N007_05790 223 bacteria>firmicutes Alicyclobacillus acidoterrestris ATCC 49025 hypothetical protein N007_05790 [Alicyclobacillus acidoterrestris ATCC 49025]. 529047170_?->529047171_?->529047172_?->529047173_?->529047174_?->529047175_URI->529047176_Toprim->529047177_N6-MTase*->529047178_?->529047179_?->529047180_?-> 665851256 URI->Toprim->N6-MTase*-> N6-MTase N6_N4_Mtase - 222 bacteria>firmicutes Alicyclobacillus acidoterrestris hypothetical protein [Alicyclobacillus acidoterrestris]. 750137128_?->750137130_?->544884002_?->544884003_?->544884004_?->665851255_URI->544884007_Toprim->665851256_N6-MTase*->544884009_?->665851257_?->544884011_?-> 740246844 <-N6-MTase* N6-MTase N6_N4_Mtase - 215 bacteria>firmicutes Tumebacillus flagellatus hypothetical protein [Tumebacillus flagellatus]. <-740246740_?||740246840_?-><-740246743_?||740246745_?-><-740246746_?||740246748_?-><-740246750_?<-740246844_N6-MTase*<-740246847_? # 1; 290971699 N6-MTase*-> N6-MTase SP+N6_N4_Mtase NAEGRDRAFT_76461 994 eukaryota>heterolobosea Naegleria gruberi strain NEG-M predicted protein [Naegleria gruberi]. <-290971703_?||290971697_?-><-290971705_?||290971699_N6-MTase*->290971701_?-><-290971707_? 737515587 <-Terminase_SS<-Terminase_SS<-N6-MTase* N6-MTase N6_N4_Mtase - 219 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis adenine methyltransferase [Haemophilus parasuis]. <-737515584_Terminase_SS<-538043063_Terminase_SS<-737515587_N6-MTase*<-737515588_?Back to Contents
Two alignments are shown, the first alignment is only of the Fungal N6-MTases ilustrating the various domains in the proteins. The second shows the core N6-MTase domain Boundaries <----Treble-clef-DNMT3-like-----> <------ Chromo ----------------------------------------------------------------------------------------> <----------------------Chromo-----------------------------------------------------------> <-AT-hook-----> <-----------------------------------chromo-----------------------------------------------------------> <----At-hook-----------> <--------------------------chromo--------------------------------------------------------------------> <---------*--*----------*--*----ZZ-finger---------*--*--------*-*---? <----------*--*-------------------PHD finger------------------------*--*-----------------------><-----*--*-PHD----*----* fin*er*---------------*--*--> <-------*--*----------ZZ finger-------*--*-----------*--*-------------------> <--- GATA finger------------> <--- DAM methylase-------------- <---------Syanapomorphic strand-helix--------> Str-1 Str-2 Str-3 Str-4 **** Str-5-> Str-6 Str-7 <--C-terminal N6-MTase Str-1 Str-2 Str-3 <---- KRI domain-------------> EEEEEE EEEEE EEEEEEEE <--false fusion of SPX, CYTH and tMs---- FINAL ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------E----------------------------------------------------------------E-----EE---------------------------EE------------------------------------------HHHHHHH----HH--------------------------------------HHHHH----------------------E--------------------------------------------EEEE-E-----EEEEEEEEE----EEEEEE-----------------EEEE----E--------------------------------------------------------------------------------------------E-----------------------------------------------------------------------------------HHHHHHHHHH----EEEE------HHHHHHHHH----EE--------HHHHH-----------------HHHH------------------------------------------------------------------------------------------------------------------------EE--------------------EE--HHHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHHH--HHHHH--------EEEEEEEEE--------EEE-------------------------------HHHHH--------HHHHEE-EE-----------EEE---------EEEE-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------H-HHH-----------------------------------------------E-----EEE----------EEEEEE----EEEEE---------EE--------------------------------------------------------------E--------------EE------------EE-------------------------------------------------------------------------E------EEEE-----E-EEEE----------------------------------------------------EEE---------------------------------------------------------HHHH----EEEEEEEEEEEE------------------HHHHHHHHHH---------------EEEEE------------------------------------E-EEEEEEEEE-------------------------------------------------------------------------------HHHHHHHHHHH----E------EEEE-----------------------------------------------E--EEEEE--------E-EE----HHHHHHHHHHH---HHHEE------EEEEEE----EEEE--------------EEEEE-------------------EEEEEEEEEE-----------------------EEEE------------------------------------HHHHH----------------------------HHHHH-----------------------------------E-----EEEEEEE----HHHHH-----EEE-----EEE---------------------EE-------------------EEE--------EEEEE-----------------------HHHH-------EEE---E-EE---------------------E------EEE---------------EEEEE--------------------HHHHHHHHHH------EEEEE------HHHHHHHHH---EEEEE--HHHHHHHHHH-------------------EEEEE--HHHHH---------------EEEEEE------EEE--------------HHHHHHHHHHHHHHHHHHH----EEEEEEE------EEEE-HHHHHHHHHH---EEEEHHHHHHH-------HHHHHHHH--EEEHHHHHEEEE------------------------------------------EEEEEEEEE-----EEEEEEEEEEEEEEE------HHHHHHHHHHHH-------EEEEEEEE-----------------------------------------------------------------------------HHHHHHHHHHHHHHHHHHHHH----EE----------HHHHHHH------------------------------EEEEEE-------------H-HHHHHHHHHHHHHHHHHH-----EEEEEE------------------------------EEHHHHHHHHHH------EEEEEEEEE----------------------------------------------------HHHHHH--------------HHHHHHH------- ALIGN ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EEEE------------------------------------------------------------------------------------------------EEEHHHHH---------------------------EEEE-EE-----EEHHHEHH-----EEEE-------HHHHHHHH--------------------------------------------------------------------------------------------------HHHHHHHH-H------------EEEE---------E-----HHHHHHHHHHH----------------------EEE-EE-E--------HHHHHHHHHH-----E--------------E-----HEE---HHHH-HHHHH-------H----HHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------E----HHHHHHHHHHHH---------------------EEEEEE---------------EEEEEE-------EEEEEEE-------HHHHHHHH--HHHH--------------------------EE-------HHHHHH-HH---HE------------------------HHHHH------------------------------------------------------------EEEE-----E-----------------------------------------------------------------H-----------------------HHHHHHHHH--------------------EE-EE----------HHHHHHHHHHH------HEE-E------------------------------H----HHH------HHHHEHH-------HHHHHHHHHHHHHHHHH---EEEEE---------EEE--------------------HEEEH---------------------------------EEEE--------------E----------------------------------------------------------------------------------------------------------------------HHHHHHHHHH-----------------------------------------------------------------------------------------------EE-----EEE-EEEEEEEE------------------HHHHHHHHH---------------EEEEEE------------------------------------E-EEEEE--HHH---------H------------------------------------------------------------------------HHHHHHH---EE------EEEE--------------------EE-----------EEEEE------------EEHH---------E-EE------HEHHHHHHH----EEEEE-----EEE-EE----------------------EEEEE-HHHHH-------H-----EEEEEE-----------------------------EEEE------------HHHH-------------------HHHHH---------------HHH---HHHHHHHHHHHHHHHHHHHHHHH--HHH------------------------EEEEEE----HHHHH------EEE---HHHHH---------------------H--------------------EEE-------------------------------EEEEEE-------EEEE------------------------------------------------EEE-----E-----------------------HHHHHHHHHH------EEEE--------HHHHHHHHHH---------HHHHH-----------------------HHHHHH-HHHH-----------------EEE---------EEE-------------HHHHHHHHHHHHHHHHHHHHH---EEEE-------EEE----HHHHHHHH----HHHHHHHHHHHHHHH-----EEEEEH-HHEHHHHHHHHHH------------------------------------------EEEEEE------------EEEEEEEEEE-----HHHHHHHHHHHHHH-------HHHHHHHH------------------------------------------------HHHHH------------------------HHHHHHHHHHHHH--HHHHHH----HH----------HHHHHHH------------------------------EEEEEE-----------HHH-HHHHHHHHHHHHHHHHHHH-----EEE--------------------------------HHHHHHHHHHHH--------EEEEEEE--------------------------------HHH------------------HHHHH--------------HHHHHHH-H----- HMM -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------E--H-HHHHHH---EEEEE-----HH------------------------EEEE--EEE----------------------------------------------------EEE--------E-----E----E-EEEE-HHHH-HHHHHHE-----EEEE--------------------HHHHHHHHH---EEE-EEE-HHHHHHH----HHHH-----------------------EEEEE----EEE-HHHH---------E----EEEE---EEE------EEEEEE----HHH------H----HHHHHHHHHHHHHHHHHH-HHH---EEEEEEEEE----EEEEEHHHHHHHHHHHH-HEH-HEEEE----EEEE---HH-HHHHHHHH-------EEEHHHHHHEEEEE---------EEEEE----EEE-----------------------------------------------------------------------------------------------------------------------------------------------E--H-HHHHHH---EEEEE------HH-----------------EEEE--EEEEEE----------EE----EEE-----EE-HHHHHHHHHHE-----EEEE------HHHHH----HHHH---EEEEEE---HHHHHHHHHHH---------------E---EEEEE--------------------EE-HHHH---E----EEEEE---EE----EEEEEE----HHHH----HHHHHHHHHHHHHHHHHHHHH---EEEEEEEEE----EEEEEHHHHHHHHHHHH-HE-HHEEEEEEEE---HHHHHHHHHH--EEEHHHHHHEE-EEE--------EEEEEEE----E--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HHHHHHHHHHH--EEEEEEEE----------------HHHHHHHHHHHH-------------EEEEEE-----------------------------------EE-EEEEEEEEE-----------------------------------------------EEEEE---------------EE-------------HHHHHHHH---EE------EEEE-------------E-------E-----------EEE---------EEE--EEEE--------EE-EE----HHHHHHHHHHH---EEEHHHH---EEEEEE-----EEEEEE-----HHH----EEEEE--HHHH-------HH-HHHHEEEEEE---------------------------EEHHHHH-HHHH--HHHHHHHHH---------------HHHHHHH---------------HHH---HHHHHHHHHHHHHHHHHHHHHHH----------H--------EE-EE-----EEEEEEEE---HHHHH---E-EEEEHH---EE---------------------EE-------------------EEEE--------EEEE-----------------EEEEEEHHHH---EEEE-------------------------------E------E-------EEEEEEE----EEEE-----EEEE-----EEEEE-HHHHHHHHHHHH----EEEEEE-----HHHHHHHHHH---EEEEE--EHHH----EEEEE--------------HEEEEHHHHHHHHHHHH-----------EEEEEE---HHHHHHHHHHHH-HH------HHHHHHHHHHHHH-EEEEE---EEEEEEE-----EEEEE-HHHHHHHHHHH--EEEEEEEHHHHHHHH----EEEEHHHHHEEEHHHHHHEEEE-----------------------------------------EEEEEEEEEEE--------EEEEEEEEEE------HHHHHHHHHEEEE-----EEEEEEEEE---------------------------------------HH----HHHHHHHHHH-----------------------HHHHHHHHHHHHHHHHHHHE---E----------HHHHHHHHHHH-----EEE--------------------EEEEEE--HHH--HHHHHH---HHHHHHHHHHHHHHHHH-----EEEEEEEEEE---------------------EEEEEEEEEHHHHHHHHH-----EEEEEEEEEE--------------------E---------EEE-----E----H---EE---HHHHH--------------HHHHHHH-H----- FREQ ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------EE-------------------------------------------------------------------HHHHHHHHHHH----H--------------------------------------------------------------------------------------------------------HHHHH--EEEE-E-----EEEEEEEE----EEEEEEE--------HHH---H--EHHE-----------------------------------------------------------------------------------------------------------------------------------------------------------------E------------------HHHHHHHHHH-----EEEE------HHHHHHHHH---EEE--------EE--------------------HHHHHHH------------------------------------------------------------------------------------------------------------------------------------------------HHHHHHHHHHHH--------------------HHHHHHHHHHHHHHHHHH---EEEEE--H------EEEEEEEE---------EE---------------------------------EEE---------HEEEE-E------------EE----------EEEE-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------H-HHH------------------------------------------------E----HHHHH-HH-----EEEEEE----EEEEEE--------EE--------------------------------------------------------------E--------------EEE---------EEEE-------------------------------------------------------------------------E-----EEEEE---EEE-EEEEE---E------------------------------------------------EE---------------------------------------------------------EEE----EEEEEEEEEEEE--------E----------HHHHHHHHH----------------EEEEE-----------------------------------EE-EEEEEEEE--------------------------------------------------------------------------------HHHHHHHHHH-------------EE---------------------------------------------------HEEHHE----------HH---HHHHHHHHH-----EEEEEE------------------------E-EEEEE--------------------------EEE--EEEE------------------------EEEE-----------------------------------EEEEE------------------------------------------------------------------E-EE-----EEEEE-----------------E--HH--HEE---------------------E--------------------EEEE-----EEEEEEE----HHHHHH-----------------------HHHH---H-HE----------------------------------------------EEEE------------------------HHHHHHH------EEEE-------HHHHHHHHH----EEE---HHHHHHHHHH------------------EEEEE-----------------------EEEEE------EEEEE-------------HHHHHHHHHHHHHHHHHHHH----EEEEE--------E----HHHHHHHHHH-HHHHHHHHHHH---------EHHHHHHHHHHHHHHHHHHE---------HHHH------------------------------EEEEEE-------EEEE-EEEEEEEEEE-----HEH-HEHH-EEEEEE------HHHHHHHHH---------------------------------------------------------------------------HHHHHHHHHHHHHHHHHHHHHH----EEE---------HHHHHHH-------------------------------EEEEE-----------H-H-HHHHHHHHHHHHHHHHHH------EEEE-------------------------------HHHHHHHHHHH-------EEEEEEEE-----------------------------------------------------HHHHHH--------------HHHHHH-------- PSSM ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HHHH--------------------------------------------------------------------------------------------------------------------E-----EEE---------------------------------------------------------EEEE-------------------------------------------------------------HHHHHH------H------------HH---------------------------------E----------EEEE-EE----EEEEEEEE-----EEEEE------------------EE------------------------------------EEE----------HHH-----------------------------------------------E-----------------------------------------------------------------------------------HHHHHHHHHHH-----E------------------E---------H-HHHHHH-----------HHHHHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------------HHHHHHH----------------------EEE------------------------------EEEEEEEE----------E-----------------------------------------------HHHH--------------------------EEEE---------------------------------------------------------------------------------------------------------------------------------------------------H------------------------------------------------------------------HHHHH-------------HHH----------------------------HHHHHHH------HHHEEE-----EEE-------------EEE----EEEEE----------------------------------E------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------E-----------------EE---------------E--------------------------------------------HH--HHHH---EEEEEE---------------------HHHHHHHH--------------------------------------------------------E-EEE-----------------------------------------------------------------------------------------HHHHHH----EE------EEE----------------------------------H---H------------EEEEE-----------------HHHHHHHHHHHHHH--H--------EEE-------EE-----HH--------EEEEE--------------------EEEEEEEEE----------------------------------------------------------------------------------------------------------------------------------------------EEEEEE---------------EEE-----E-E-------------------------------------------------------------------H--------HHHHHHHHH-------EEE---E-EE---------------------E------EEEE----------------E----------------------HHHHHHHHHH------EEE---------HHHHHHH----EEEEE--HHHHHHHHHHH-------------------EEEE----H-----------------EEEEE------------------------HHHHHHHHHHHHHHHHHHH----EEEEEEEE------EEE--HHHHHHHHH---EE---EEEH---------EEEHHH-----EEEE-EEEEEE--------------------------------------------EEEEEE-----------EE--EEEEE--------HHHHHHHHHHHH-------EEEEEEE------------------------------------------------------HH----HHH--------------HHHHHHHHHHHH---HHEEEE--EE------------HHHHHHH------------------------------EEEEEE----------------HHHHHHHHHHHHHHHHH----EEEEE---------------------------------EEEHHHHHHHH---------EEEEE-----------------------------------------------------HHHHHH--------------HHHHHHH------- CONF --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------87-----8755535221356678883321288888446--3-323224--------56778654322000----------------------------------------------------013887886402-----530065-566686443-345677765542002301--------------2457655566652454222-12314655565----3036777---7777888--------8865544----34511110134------602677653---101013--33532335543025------78765422456410001562677-6178616668885726870888712345665202304430-00134----314554312-4434445754-----5346522223445510-04330044555----54121112121233346677667778877766542010-2566777-8888766666775565554777863235666654578777888-------7651456211-25-5555676604678888874378615662776312111354414410008453211-0244220366435----442121310011223333765--67887665555445-----5576444456777788888444588877530011----35676774122125--20244455556888-88877--76673---32024--------------------40410344677778875254---328887344555870145442366765532343003221010025566234354314898831110421356654535865-542224344587722210121124654663255313-304542455775034125----6726775456543322345------------------------------68-888888878765-------567666676444667888888------------------------------88888888788876543----------421005123346777777888876655566666567888--88887776556633656888888887765543343345467765410-233012334543443443356777776642155433455------675410068720210243343452146664188267777178877787022777554---------4566422345----------4667-----------657766545210--------------0103333334542012-------------------------------------------------------------------------16543212332678401-21100---1-55566677766677766665676673-00-----------------12035------5788843--0-------------------------------1589--3543402001132002567503266888637888---885482566665347687788888877313441888------------------------888-888712-3306765413---------588----------------------------88888866666-----------88776688--88887-1557234441268303------3732578888643---2412--0020-----------245420035678421--78873133458622-3227865839689998467300000058883254330689825301211002000017704552120001-------16966532445776516888----88----------8-804623445-333587565444456-8---88-78887772113201---------------356---66644212210245676300357--77788--8688-88---31-13-----57886602784112208983-58501220046---------------------6078988---887--54445604507655630345522344010100002244142024001210565440224---6-622---------10011--57530------321337886300124665277715788853247898836764207766256530699874865118887-51677664468006785235888988875032256787766655---16888726312022237-------999525887078953303103455476666752878999999999999875317972999872213683366401678999987185011123460722565321000366620023215664047336778888201088--88777-------------------76040024652245503204301510356326766003537285676520278996155544431--------6775445667777-----------------66----63333455655----424157--888888782736899997875835207652--3021166766--4021112310668888556------------88---8-617887236524--3155110-447889999888888875687453897410112----------------777-8512021004145455636788-613624776337777788-----------886888-----7412388886765616--41-1335578--------------8999865-589888 MVEG_09762_Mortierella_verticillata_NRRL_6337_672819038 --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MD-----ASTAPPKKTVLPQFHSVYPSALSSSSSNNPV--L-EPKKRG--------RKPKARPVEQAPVS----------------------------------------------------VTSSPGESGESL-----VDIDSI-STSSSSLSS-ALSLSSIATSPSFSLSSL--------------SPLSSPTSTSSIDIVFDTH-QSPRFESIDPS----VTLLTPS---STASAGR--------KRKQSQP----PQHKHNPVAQD------HHDSVDNNI---MLDNFH--TTGAFLTRTASKLL------GGQSVTGLDSNIHGYSLGRPCKV-FGTDKVWYFGTMVGLKNGKIRIRFDDWGSDWDEWLASDSR-RLVKS----LTPEETEQR-NARLATLNEG-----STSTTTISATTVVKTS-ALGVQASAPTI----GPNAKGKSVKEAKMPMEPKYKKIKIAPATHTSDTVVRT-LGISKMD-QQNTASTSALTSTLDPTLIQKQKTTKTTKPPKTPKTPKTPKAP-------KKKEQELLPS-EI-PSPTGNETAITSIAVSEATSTVKKKPKTPKNTSKKAGTVAPSSTDLVAAPDAMSA-TSVATDTHLNMQL----ETTHAVAKVKKELKRPKKASK--TAAQDTLDGDQVSS-----LVDATFTPGAPGDTVKPKKRPKAKAAEVPISLAA----LFASPVPTEQTLAL--ASEATIEITTQPDT-LSSTE--STLST---PPPRS--------------------MSATPTLSEENQTTDESKDS---ESSPDPPVTRTLGNPIDLGLLPLEITEFEYEDESKELMGMIHRVLDTGKIQNRVVEYVPEDDYYDTAGGYRKKKKRPNGDG-ENGDEEDDEGSKRTQTKTKDKFVAILESRHQQQILK-QHISGMPLPSSFPFVPPP----PPKKKLTKEQREALAQKCE------------------------------HA-MRNMCHEPMIQL-------VRQVIYTNSDRVEAINNYNNN------------------------------YSKSQPKIVDRKERFAQ----------ALAGRKLTKLNRGLPLARKIIGPTSFAAAQGLTFNEAGY--IETPVTKIIAAAKTIATSTPVKKKRGRKPLSLKAAVASTAGGGTGLD-ESQAGFGKDGKDGKINPFQSLRSLTQTKKRKIGVAEDHI------ANLKRLYTPGTRIQARDKQMEWLLARVRDLRNSRVLVQYEGFPAFYNEWIDINSERL---------KYDTTLEQTP----------WDPN-----------ATSTRNPLTTTT--------------ITASHPVSSNDSGTTA-------------------------------------------------------------------------PPPTDPEADSVPQDTPE-DKALT---G-KKAIREKKGGVTESTPTDQEKAEESI-SA-----------------EEGPV------DDGLDAV--E-------------------------------EENA--AVVNCIQCQVKISQFRIYCMYCEVESKAVVQSDP---PCEPFNLCLWCFSNAFPEHHDHPRSSFATKVIVGPK------------------------GVR-PVKGGI-ITRFEKDVLD---------LEY----------------------------KEPEKPAAPTL-----------SPEDQLNA--MMRLD-NDQSYVYLDQWRERKV------CAFCNDEGLANKD---PFIG--PYPF-----------LLASTNRYGDAKKKN--FWAHDACARHSPEV-IQGKDGTWYNVSMAMRRGRTVKCTLCKEKGATIGCFEPKCYRSFHVPCTGKPMSHFEDGVIFWCPQHEKA-------YLQRDAYDETFSCDRCSKIL----GV----------N-PWSTCIKCS-DDFFHTFDLCRECFS-K---DD-INHEHGKDDFKITS---------------LEL---LRSEQLEKEAAMVIPMDEALANK--KKPIS--YKPK-MR---GL-SR-----LVCSYCWSATSTKWRKGYNG-VLMCEDCFSAG---------------------PVNDTPM---QPP--TTLEESLSDQALSNPNLPPLVVGGSGVGFVDTENPKGVGRYATSAEDYSHTPYLTRTSV---S-AVR---------FDHSS--SQAVY------LDSYGPSENQLYSLPIDTTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRAVTKYTNPNDKILSNFLGRG-TDAIECFLLGRCCTAVDINPAAITLSIRNCSFAIPPNGTVKAEH---RPTILQGDSRKLTGPLF-------ESESFDHVLSHPPYKDCVAYSTHIDGDLSRFGNSIEFQREMTHVVQETYRLLKMGRRCTLGIGDNREHCFYIPVSFQLIRQYINQGFELEELIVKRQRYCAMFGLGTYLCVQFDFLCFTHEFIATLRKVPKQGHDTMILEP--DYSLL-------------------DLVDVTGTVRAIPSCPIERKSVVMGSVWTFKPTEEFDFPTLCASRMVERFGKNDSNWEEFQIEFK--------QTIDPNVVSASDD-----------------DD----TLNDIEDEKEW----TEDAVA--ILPPEEENLVSYERDRLQQIQENNRMLLAL--GLITELSETS--DDIGHQIKLKNSNDNTCFP------------PP---A-ETVLWLVAHIPC--TQMKTHQ-VPAYRTAIMNLARKALVQLPLTGVFVVGAQDIR----------------TEK-GKLLPLGMLILEDIVRVVGDDC-LRLKELIEAVPDGYQKDR-----------RKITSW-----EEYQEEACSPNDQIPK--KH-LPIVHAC--------------YLVFTKV-KEPVKP------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Crev1000002507_Coemansia_reversa_Crev1000002507 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSGPDRCHGADGPLSPLA---------------------------AAFDYDS-----------------SGSLSSLDSFLDALSDCADDPLRPDSSLLPE-------RQFNRSASATDAKSSQTVPLK---QQEHKKEKQNL------------P-SRNKRER--------TADVSSP----KPRQTVNLQRQ---------SMQTLV---SSNNMI--TTGAFLTRTTLRKLGVADIVADPE-----ASNAGLAVGSRIKV-LNLDKHWYTAVVLAIDSGKALAHYPGWEHCYNEWVLIESR-RL-LY----RGKADLGVS----------------GSTKAMEQLE-------ALYSGVEEPVL-I-------------------------GFDLAQAINDAFGIS---------TGCNLKNGGADE-AAANVFVNSVDNIDSGTLARKQS--------------GVKESKEHRS----RGRPAGTRNRRRAGHIKAKSSRKQR---P--TIQNDK------AKKTLVGDQEES-AAAECPSTPGVP-----------AEL----RVPSVRLV--RAAENPYARCHRSE--------DIFCGDSDENEATARESCLTAVRDVHNGGNEE---PSGKKARIADD-----NDTATASSGNAIWH-LTRGD-----------------------------------YVTTGAFSTRRTIKALAHSGSTGGIMQD-HHGYYPGQRVEI--MNANQ-----SWYQGRVIAY-----ANKKFL---IHYGGWDHANNEWIVAGSRRMRPASDI----NDMAVTET-EEMARKACVVLVDEYNTYIDGVER-KNAEKADAKRKARELRKTPVNVRVAKLAQSMSADSMGDDEA-EEMTHACLPENE-----EEDDEDVDPI-----SVEAGYTPVPQL-----LRVKDYVQLFRKGMQIAARDRNKLWWRATIAEIKTFRLRIHYTGFGSSWDEWVEMNTQRIMFEESAESSRCDAEMAGDPLHVSGENSSALSKEPNCMGQVISSSSHGKDAGQTDYGTVE--ESQETKGIPVPR---RLGRPPGPETKSTPLSLRLALKALM-SDREMF-EQCHPE-------ELDVFHLPKEHMSMR--------DYS------TFLKVGDRVRIRDRDK----QWYDCTIIDLRHGRIRICFNGHSDEFNQWIPVNSDRI-RILR-ETIDGDKRLEKME----KESQIAQRRKQEKLRAQRRKRSQASIASLVRLAES-----------------------------------------------------------------------------------------------------LEYIVDCEDTFVSGHTQ-VGPQD---G-ITELDAATEVQSRLEGSTEDDASGGD-DK-----------------PLLQLM---M-ESDIDNV--DGMPLLTRILLAEHFKRQRFGALIRHGSMVAMQDSA-TWFVYCNQCNIVISTFRYYCLSCERPSD-----GY---DYESYDLCLMCFSRQFPSDHPHSQASFARAAVGDAESIVKFTADALSRCRDHERLAAASAHML-DLFSGL-IAVYEPDAFDTSYKPRTP-GTSLWSKLAVGLHGTTTSTLDTSAVVGKIIGNTRRSRITSIINP---DSEML-SSCNGHDA--DASSD-KEDKDETDRQLCKADVDDLPPRCAFCSEDDQSQRDLLGTFAAEQP--F-----------VLSMVRDDGTVRRRR--FWAHTACAKYSPEV-LVTEAGQWFNVAAALRRARTIKCAECKRRGATIGCFHDRCQKSFHVACAGMSKSFFESGRIFWCPKHARMAAGVVEGNAGPEPVSLEARCANCNHEL----SG----------DLMWMECLECL-AEPERQFSLCLTCYD-SKDALA--DHPHKKRCFREH----------------------------LSHTGGVSSNGQYLADI--AAQDS-------RR---RV-GKGT---TCCHYCRSRQSRRWRKGYAG-VVMCEACFNTAHSLRGGAQAKQVQAGTVCDQDLFAEADN---DS----PGELEVVALNPFGRS---LITGSDAQPLPPPQQQ----QQGALIEDYTQGIYFTREACIAPN-RVG---------LPSVSQ-QPLGE------LSSYGPTDSMLFTLPVNTSYFDIPGRAPRWASHSGTDYHGTWLPQTVRRALLRYTQRGEHVLSNFLGRG-TDAIECFLLNRKCVGVDINPSAVSLSQRNCSFTITPGCGMSIEF---RPTIMQGDARDLRSDLWPGASYFAESESFDHILSHPPYKDCVLYSTNIDGDLSRFPGPDEFQREMEKVVTESWRLLKMGRHLTLGIGDNRAECFYIPVSYQLIRTYISSGFELEELVVKRQRYCQAFGLGTYLCVQFDFLMFTHEFIATLRKVPKDQIDSMHLAD--RHYAEDSEFGLQTVTVDKDPLDFRLVAISHRCLREVPASPIERKGVVMGSVWTFEHHPVHSFTHMCMSRMVERFGRDGSNWEQIDLALRP---LE--QGTTENAADGTNAASDIAAASDTQCTGDVIDD----KCASLNNARN-----QAESDP--ELLDSDTEEGGYERARQRQIQQNREQLLQL--GLVSELGEDS--TDIAHYQKMIAMTP----L------------PPTSSA-PLALIVVPHILN--TEFARCH-VEPYRRTLVQITHDASHRLCPSGLLVLGVQDVR----------------DEH-GKLWPLGMLVLEDVQRAVGSIR-LRLKEFIVVVENGYARKR-----------DDVMSR-----ETFVDEQCVVEVNTPD--IH-VPIVHAY--------------YLVFMKL---K--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RMCBS344292_09167_Rhizopus_microsporus_729708575 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSSRPFAWQLFQNCGATEELKLETRADPKNAPDKKKFDT--------FMGKLLLKSNNNNNKPTN--------KQSTLTEEPFANLFRV---------RKNKLQDS--NDDA-----KFTFKYLPND-----------GKKENRL----------------SNG--KKTRW---D--VG-----QEEEQKKDEVTQKEEPSLEE--KEENPIDLF----DDDS-----------------------------ESLTSLDTLQSDDDVLGPWL----------------NLA--------SDKKEQE---------CESCVQLNDKDRDPMDA-AHCCSTCADQWKTLLSDLLEKI---------QSSVVNTSAGKKKKVKLD---KSIREQPSTKK--------------EKQPQQR--------TRGKKSA-I--PPTQSRHSQ--------------STN--KKMETFV--STGAFMTRKTAETL------ADPE-NGFYPNPYGYVHLQVVEV-LNINGIWYRGTLEKMDKGKVKVKYSDWDDQ-E-WIIMGSR-RL-RI----VPPEVIAKE---------------------------------RDE-KKEKD--------------------------------SSKDALTVIPRA----K--G-RDDDYVSSTLD--KDPHQLFNDNEVFMTRRMARAMVDEYGFRPNSF----GYRRNRAVAV-VF------NVKDKECIGFLREMRDNQVRVWYP--DLYQSEWVRVGSRRLRLLSSEEEE--KYKQHVDLDVQ------------------EVPAVTEQ--KKVEDAPKEAAPKE--------STPEPKTKKLRQ-KKAKSKTPTPEPETHQEKQ--------------------------QQKELQK-PVESS-----------------------------------FLTTGAFATRRAMRQLQDEN---GFVPN-PYNYTYNQPVEI--LNTRSGKTH-FWECGRLVAM-----RPGQVK---VHYDGWDEAYDEWVMVGSRRIRILSKE--------------------------------EEENK-KKHNELLVAEANPEVQDEVKRKRKHQVIRPEDYAKLGLLES-EQT-----------------------------VIKQKKKPSKEI---------------------------T------------------------------PVESDSSSSEEDEEEYE----------EPSIRRRSRKASKNKKKKA------VKQKQQLQQQKAAE--HPTTTTGAEEEKQIIS---------------LRVAQAKAS-EKYEFV-ANVYGY------------------------------DYM------QHVTVLNLDK----------KMYEARLVSMHKNKVKVHYCGWPDIFDEYITVGSRRI------QPIENDHQVECIE--------------------------------------------------------------------------------------------------------------------------------------------PDYQKRYEKIMQDGPTE-CQHQH---QHQPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETPIE-GDQGYKFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSTSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKARETARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTTEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLTFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTAKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTTIEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENVDRMLINE--EDQ---------------------HRIPTKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPAEL-----------------EK-----------------------GK--ETNEEQEEISDYERQRLKRIEENNKTLLKL--GLISELSEKS--DDIIHYENMMSKPP----Y------------VE---S-DLVLMIVSHQQ-----ILPQY-INSYRQTLVGIAKEAIERLAPKGMLIIGAQDIR----------------DPVSGKLWPMSMLILEDIERAVGRDD-IRLKELVVTVPDGYSKDR-----------QQKPRS-----EEEEEEMIDIE--TID--DF-VPIVHAV--------------YLIFQKL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Pbla1000013272_Phycomyces_blakesleeanus_Pbla1000013272 ------------------------------------------------------------------------------------------------MPSDPFSW-----------QLVRPTAEA------EGVHPKPIKPPLDRWKGAVHGNCENSTQKHSFGTA---------------DTHKSQASDASLSSSTPIPVLEKPKQTNPIILTSKFGQ--------FMGKMLLQSSNNRTKPVPKINE----HSQQSNEPSYTGYFNIERKRRSSDGLKVVLRAV--GQEERSESSTQKVVELKDSIP--EDTIM--DDTMDDTMDDMSIEEESIEDLPTSNSLLKQNLT------VN-----TTIGEASTNNTPIEEVVLKNTPSPISPIGLN----QPKDITLSDPRYHHKKGKRTRWDVGPILIPEDMECTQSFASLQLDDTSKA------------------------------LDYEPENLCDTLATGDCDACNIMNSQERGPLDQ-VALCESCKQTWLPNAKSLLDRL------SKHSKDFVKPTQKQTSKQTLK---QTSKLLSKQQK----QPHQQDN---QSQSQPQ--------SKQKQTP----KQTSKKTTKRLPPAKSVHLHSKKSSI--QEAKNYY--TTGAFLTRNTAKQL------VD-E-AGFHPNPHGFTNMQKVKV-LNINGHWYRGILTMMYGSKVKVHYLDWDDQ-EEWIVMGSR-RL-RG----LTKDEEEEDEDANEEEDGEDAAIENENKDEENEEKDEDKDKEEEEEEEEDDILSIPAESKTTQPVKKGEHLSVSP-----KSHSRKSQSNNVSAL---------NPKKHYTTPID--TDPTQIFNDNEIFMTRRMAHQLTDEHGFKPNSF----GYRYNRAVAV-SL-RAEKGKRNRMEYNGLLREMRGNQVRVWYP--SLRQSDWLIIGSRRLRVLTDQEAS-ELDNLGTELVRTMDTRAKDSEISTKLPTETKTPSPSTT--ETLPEPQSESVPKH-----VEETVEDTVEDTVEGEVEGIAEEPVEAPVPEPAPGPI-KRGRGRPRKVALPVDPSNPHIVTIPKTPTK-TALKKRNDSIKNGIKETKKAAAAAAMVVEMGTKDTEDALDYLTTGAFATRRAMRQLKDEH---GFVPN-PYGYVYDQPIEI--LNTRSSKNK-FWERGRLIGM-----CPGKVL---VRYDGWGEVYDEWVMVGSRRIRPAAAQIESSGDQKSSTMASTENTAPTSTLNGGSASTKKRAKQ-AARNDLLVTEANPEVEDEARKKRQHRVLGPEDYERLGLLAGSEKVEKIERRGRKKMVRDVETPKETETMKDKAVLIEEEPKPIEPS-------VEAPIVQNEDQEMAEPESDLT------------------------------KPNGAMTVPEGTQNDLPVKRKKQKAKTQQRKRKPAKATSPSPSVSSSTSLQQTQAQTELSTELSTE--TETETPTPISTSIVSVAATEADHDTSTLTSSYRRHIPSDE-SNHGFV-ANVYGY------------------------------DYL------QHVQVLHLDK----------KWYEGRLVSMERNRVRVHYCGWLDKFDENIAVGSRRI------QVIENDHEVVCIE--------------------------------------------------------------------------------------------------------------------------------------------PTYSERLEKMQEEKEKK-AVEPE---D-AQVVKPSKRREVAPTVVPAPEEPVHG-TH-----------------DMVEYH-----MEAVDGM--E-------------------------------VEENDTWKVYCNQCNIIIKQFRYYCTYCETPSE-----GH---DYQSFELCLRCFDQNFPFWHEHPRSSFAVQAVIDAD-----------------------MGPM-PIKGEL-VTVWEEDILEEIPDDTQ--DDL----------------------------NDPDDMFSGTM-----------EASEVFSG--VAPLD-EDQGYKFLKRWQRRKV------CAFCNDDDDTSTEL-GKFIG--P--F-----------VITSFNKNGTEKKRS--FWAHDACARYSPEV-FCTSEGKWYNVTIALRRGRGMKCYGCKEKGATIGCFESKCSKSFHLPCAQKPVSYFQSGVIFWCNTHEAY-------YKKKDTYVNIFNCDGCSKRL----ED----------E-TWFTCIPCA-SSYFSSFDLCAECFH-N---FP-QDHAHDEDQFEETS---------------FAI---LKEVEAQKATEAAKAKEELRAAN--PKKKP--LFPKRKR---RL-ADGSVP-LTCSYCGTEEAESWRKGYDGGVLMCTPCFELA---------------------LFIDNDG---N----TASNESLVIDSE-ETH-----------------------RYVMSIEDYTHKPYLTRDAV---S-ATK---------FSDH---RTGPR------LASYGPQPNQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTNKDERVLSNFLGRG-TDAIECFLLQRRCCGVDINPAAVALSQRNCCFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------ADESFHHVLSHPPYKDCVAYSTHLEGDLSRFTSVEDFRAEYGRVVRESWRLLKMGRRLTLGIGDNREHCFYIPVGFHLLREYINHGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFVATFRKIPLECTDKMLPID--NSD----CR---------------DHVRVEHVTKAVPQSAISRKSVVMGTVWIFKPTDTHTFEQLCISRMLERFGKDDGNWEQVLLDFMSPESMMIQNNVQQQYQSSTSS-----------------QNVHKDKPEEQEHDRDLDLDQDQEQEN--NKEDSQNQLSDYEKLRLKRIEENNQTLLKL--GLISEMSEDS--DDVIHYESMMSKKP----L------------EN---A-PLVLVMVGHQP-----IEPRQ-IGLYRETIVQIALEAVKKLAPLGMLIIGTKDIR----------------QKDNGKLWPMSMLVLEDIERAIDRSV-LKLKEMVVTVPEGHSKDR-----------QQKNLN-----TEVEEE---LE--IVD--EH-LTIVHAI--------------YLVFQRMNYSHNYN------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ Bcir1000010688_Backusella_circina_Bcir1000010688 ---------------------------------------------------------------------------------------------------------------------------------MEDIPTHPKYEWSSTQEKRVFNMLDEKTKEYRKQKE---------------DPFSWQLPCSVKFQQDNPNTGETTTNKQ---KTSKFGN--------FMGKLLLKSSTNASKIKKSVPKPLLVEKKDSTEMPLTGYFRF---------MKRRTHHS--NTSEDNGFPKINFKFSTARVLGNENTQIVGLNTQTNI----KPKYRLING--------KRTRWDVVSPVAN-----NSKDEDSSLDLKNKEVVLPQDQSQEPTSSTE----KSDMRE-KCIDF--DFQYKEDFNDDVV------DDGSSLGSLQSDDDVLGPWIELGMFDSNVSETKKSDHFSSFYSVDNQQNLKSLDLPEIQ----CLDCKLRTAYDKKPLDI-SNLCLSCQNKWSDTLSNVFRKFEFAVL-TQTCKEVTEKPKSKKVKVTND---NKKASVEKNDV----LHKEKTG---GEKLLPQ--------KLKKSPPAIGIPPTKSNYTRKT-NKKMATGSSKKNDI--PKKTNAT--AKPIVVATPETNDD------DDPP-FGFCSNPRGLVYKQVVEV-LNINGHWYRGTLELMDKRKVKVKYIDWDDQ-EEWVIIGSK-RL-RT----IQLEDKESD----------------QQTDQQMKSKGEST---ADEVSKNNPIY-------------------------------FRKIAAAVKSK---------EPDDYVSSTLD--KDPTQIFNDNEVFMTRRLAQELVDEHGFMPNSF----GYRRNRAVAV-TFYTSSKQRKQKEESVGYLREMHKNQVRVWYP--DLHQSEWLLVGSRRLRILTEEEEESILFDSSIDLDRQ------------------EVPKIQEI--AQIENKIDEI------------PIINPPPKRSRGRPKKTLPTEVVEIATEEDTN---NVYEPEQVPQSTI--------ILEKKVTEE-HGQGD-EAKVSN----------------------------FLTTGAFATRRAMRQLTDQS---GFVPN-PYGYTNNQAVEV--LNTRSGKKK-FWEFGRLVEM-----KPGKVR---VHYEGWSDLYDEWIMVGSRRIRVAQEQ-------------IPQKEDNDEVIAAPVP----------KTNDLLMTELNPEIRDEVKRNKKHKILSAKDYQELGLLVNIEELAAKELRKKK--------------------LHEKKTEEMGTT-----VKVKAVSKTKSKKSEIGGDKYED------------------------------EHDDEDIDEGDLDNDYQ----------DTVVKKRLKSASKFKRKVK-------KSKTKIAKQTPCE--HHSPSPPPANDTQVIS---------------LRLAQARAS-NSQSFV-ANVYGY------------------------------DYM------QHVTVLHLDK----------KFYEGRLVSMRKNKIKVHYCGWLDAFDEYITCGSRRL------QVIENDHEVVCIE--------------------------------------------------------------------------------------------------------------------------------------------PNFKERYESM---KSTG-EPSLP---E-ITPVNRIVRKRITLDDVCEEDSEGQR-EY-----------------HKEPSG-----EGEDEEE--L-------------------------------VEMD-AWKVYCNQCNIVIKQFRYYCTYCETPSA-----GC---DYHSFELCLRCFDQNFPFWHDHPRSSFAIQAVIDKE-----------------------VGPM-PIKGEL-VTVWEEDVLEESVNITNE-DEE----------------------------KNGEENIEPMFES---KIDSV-DASKVFSG--DASIT-TDQGYKYLKRWKRRKV------CAFCNDDDDTSNEL-GQFIG--P--F-----------IIATFNKNGVEKKRS--FWAHDSCARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYGCKEKGATIGCFESKCSKSFHLPCSQKPASYFKNGVIFWCQTHEAY-------YNKKDTYVNIFNCDGCSKKL----EE----------E-TWFTCVQCA-TSYFSTFDLCVDCYE-K---FP-ADHRHGEEDFEETS---------------LAI---LKEMEAQKATEAAREKEELRAANARKKKKS--LFPRRRR---KL-PDGSTP-VSCCYCGTYEAETWRKGYDGGVIMCNTCFELA---------------------LLIDNDG---DT---NVTDMPLVVDNDGLQQ-----------------------RYVSSIEDYSHKPYFTREAL---S-STK---------FSDA---STGRR------LESYEPQPNQYFSLTFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTVKDERVLSNFLGRG-TDAIECFLLQRRCCGIDINPAAVSLSQRNCCFEIPPGLT-SAEY---RPIVAQADARQLTGSLF-------GDESFHHVLSHPPYKDCVAYSTHIDGDLSRYTHIDDFKVEYNKVVKESWRLLKMSRRLTLGIGDNREHCFYIPVGFHLIRLYIDQGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIGTFKKIPLENIDRMLIKN--EEEEDSAERA--------------SHVRLTSMQRGVPSSAILRKSVVMGTVWVFRPTESFRFSQLCTSRMVERFGKDDNNWEHIELDFSF------QDQPRCEQIESCHA-----------------ET---------------------EKDQ--SIDEEESPLSEYEQQRLRRIEENNKTLLKL--GLISELSEES--NDVIHYENMMDKAP----L------------ED---G-KLVLMITAHQT-----LAPCQ-INLYRKTIVQLAKDATKKLAHHGMLIIGTQDIR----------------NNTSGKLWPMTMLVLEDIERAVDQST-LKLKEMVVTVPDGYSKNR-----------KQNMDEQPD--TEHNEEEIDIE--TVD--DY-VPIVHAV--------------YLVFQRL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- LCOR_11540.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661176173 -----MSYSDESSASNAFNWQLMSNGQGASPQQTSEQQAFDIHYN-QYQAYYYNHP--HVYNNYAADQNAAYYYHIYYGYHPGVNQHAYHHHHHYPMVENGYSYVENSTSYHHSSSTVEPSAPA--PP------PPPSKSSQSQTQNAPSSSQPRKDEKVSISEG---------------SSSRSHEKTVCNIEVDGAVPTTTTTNRKPVFKQSKFGS--------FMGKLLLVKSKGGNGKDDKGHE----KTNISKKSGL---------HRSAAEKTTSSSRH--HHQQQQNLMATNYFQIRKEVPGADNVVMATNKTEDDN---------------------KEKKW------VS-----DSITAAT----TTANVVIDDAPPSPSSKSTQ----EEDNVT--------QPHDKGNVDVDSLF-----DDVSSLSSLQSDDSVLGPWM-----------------------LDEQDDYNDDEDSQSW----CIDCDAL-PEQQNPLEP-TLMCDGCKTKWMFRTVSLINKIAAAAIHQRRRRPAVRPPKEKRSKVSGQ---QKEKMGNKTTRKRATTPTSGAK---RKAKAQQ--------HREQKLP----PPTKSRASKAI---------QKKTNGGKKKKDEFV--TTGAFMTRNTAKQL------ADPE-HGFHPNPHGFTRMQQVEV-LNFNGHWYRGVLTMMNANRVKVEYIDWQDQ-EEWIIMSSR-RL-RT----IKPDKIMTE---------------------------------SVHEEDRDPLL---------------------P-----QTTSTTENMTMTIKA---------KDDDYVTATLD--TDPHQISNDNEVYLTRRMAKELKDEHGFRINTF----GYRYNRAVAV-TC-KRGVMGKKNIEYLGYLREMRDTQIRVWYP--TLRQSDWLVVGSRRLRLLTSEEEK-ALEEEGKKMESLL-----------------QQPPPQSS--APTKEKNESDAAIS-----MNPSSVSPSTKRRKA-SRSSAKKDTPKPTTCETPSQQPKVSKGKEVETNVSIAPATTAPKSTKKISSQ-ECTTQ-----------------------------------FTTTGAFATRRAMRQLQDEH---GFVPN-PYGYYNNQPIEV--LNTRTAKGKFFWERGHLVGM-----KPGYVK---VRYDGWSDIYDEWFMVGSRKIRPASTE-------------SNEPSASTATTATAAGAGNEGGIA-ATGGDLLCLEDNPELRHE---KRPHRLIGPEDYLQLGYLVP---------------------------------IVDPPPPPPPPPATISTSPLSRSVPTKSKLTRIPDNEN-D------------------------------DKDDDAIIDNDDDEDYTCKKRIGR---RRRKRQASATNNRGKRRRRNNTTATTKAQSRKRDREKEEEEDDGEWEEQVFPSKIPI------------STLIRRGRPVDDDDNHGFI-ANVYGY------------------------------DYM------QHVQVLHLDK----------KWYEARLVRMERNMVRVHFCGWIDKFDEYIRVGSRRI------QVIENDHEVECIE--------------------------------------------------------------------------------------------------------------------------------------------PFYKERYESAAYQQCQH-DHQMD---Q-ERAKAAAATAAELAQRMAEMRRSRRR-TL-----------------ENMPTE-----EEGSGDI--D-------------------------------VDGN---KVFCRQCGVIIKQFRYYCTYCESSAE-----DG---TTHSFDLCLLCFDQQFPFWHEHPRSSFAVQAVIDSE-----------------------AGPM-PIKGEL-VTVWEEDVIEDTSAATAADKDT----------------------------EGGETATTTAAAS---KQDHVEEASQVFTG--SSAIDTAEQGYKYLKRWQRRKV------CAFCNDDDDTSEDL-GKFIG--P--F-----------VIATFNKNGVERKRQ--FWVHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYGCKEKGATIGCFESKCNKSFHLPCADKPVNYFRNGLIFFCPTHEAY-------YNKKDTYVNVFKCDGCQKEM----QD----------E-SWYTCLPCA-SSYFRSYDLCGECFD-T---LPDRNHPHDEDDFEETS---------------FAI---LKEVEAEKAREEARAKE--LAAA--KRKKS--LFPK-KR---RL-RAGESPDITCCYCGTTESEEWRKGYDGGIVMCRPCFEMA---------------------LLVDNND---GGRPLISEPNTLINDPVVAAD-----------------------SYVTQIEDYTHKPYLTRDAL---S-STK---------FSNDGKVAPVPR------LSTYEPQPHQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRSILKHTDKDERVLSNFLGRG-TDAIECFLLQRRCVGVDINPAAVALSQRNCCFEIPPGMT-SAEY---RPIIAQADSRHLEGSLF-------GDESFHHILSHPPYKDCVAYSTHLEGDLSRFTNIEEFKMEYVKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPLSNMDRMMPQE--TDAQQQPSQ---------------DQIKLSYTQHGVPSSAILRKSVVMGTVWVFKPTEQYRFEQLCMSRMVERFGRDDTNWEQVHLEFQT-------NEAEQQLLLSRGN-----------------ED-------------------NSAMKC--KQKKDESTLSEYEQQRLKRIEENTRMLVQL--GLISELSEES--TDVMHYETMMTKPS----L------------PE---A-PLRLIISAHQPQ----LLAHQ-INAYRQTLMQLARDAVNKLAPQGMLIIGTQDIR----------------SAD-GKLWPMGMLVLEDIERTVDATM-LKLKEMVVAVPDGYSKDR-----------KQETTASLPSSTQLDKEEDIVD--IVD--EH-LPIVHAV--------------YLVFQKL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- LRAMOSA00608_Absidia_idahoensis_var_thermophila_671688888 -----MSSSNESSASNAFNWQLMAHGQGASPQQTSEQQAFDIHYNSQYQSYYYNHP--HAY-DYAVDQHAAYYYQQYYGYYPVVNQHSYH-----PM-SNAYSYMEHSSSY-QSSSTIAPSVPSQLPPTTQVAAPSSSKSSLSQHQNLSSSSQPGKDEKVSISEG---------------SSSKNTEKTLCNIETDGNGTVPMTTNQKSALKQSKFGS--------FMGKLLLVKGKGGS---DKGHD----KTSGSKKSDL---------YRSAAEKTTSLKHH--HQQKQQSLMTANYFQIRKEAPVADNVVV--KKTEENT---------------------KKEKC------VS-----DSTTATTMTTTTKADVVIEDDATQSPSNKRT----SQHNIK--------QQNDKDD-DLDSLF-----DDVSSLSSLQSDDSVLGPWM-----------------------SDD-DNNDDGDDSQPW----CINCDAL-PEQQNALET-RHMCDGCKDKWVFQTVSLINRIAGAAM-QRGKRPT-RPPKEKQNKVFIQ---QKDKMSNSTVR--TSTPTSGTR---RKGKIQQ--------HREQNLP----PPTKSRASKVI---------QKKSSG--KKKDAFV--TTGAFMTRNTAKQL------ADPE-HGFHPNPHGFTRMQQVEV-LNFNGHWYRGVLTEMNANRVKVEYIDWQDQ-EEWIIMSSR-RL-RT----IKPDKIMTD---------------------------------HVQEVDRDPLL---------------------P-----QVTTTAAENMATTKA---------KDDDYVTATLD--TDPHQISNDNEVYLTRRMAQELKDEHGFRLNTF----GYRYNRAVAV-TC-KRGVMGKKNIEYLGYLREMRDTQIRVWYP--TLKQSDWLVVGSRRLRLLTPEEEK-ILEEEGKNMELLL-----------------QQPPPQSS--GQEKEKIQPDAA----------SPVTPSSKRRKGNSRSSARKETSQPATRQVSQQ--KVNKGKSVDINASKA-SLTVTKSINKVPSQ-ECTSQ-----------------------------------FTTTGAFATRRAMRQLQDEH---GFVPN-PYGYYNNQPIEV--LNTRTAKGKFFWERGHLVGM-----KPGYVK---VRYDGWSDIYDEWFMVGSRKIRPASTE-------------TNEASSSSATTTTATG---NEAVT-ATGGDLLCLEDNPELRHE---KRPHRLIGPEDYLQLGYLVP---------------------------------IVDPPPPPPPPT--------SHPISATSKSTHVFNNENDD------------------------------DDDDDAVIDNDDDEDYTCKKRIGR---RQRKRQTSATSSRVKRRRR------TKAQPKKQVNEKDE---DGEWEEQVFPSKLPI------------STLIRRGRPVDD-DNHGFI-ANVYGY------------------------------DYM------QHVQVLHLDK----------KWYEARLVKMERNMVRVHFCGWIDKFDEYIRVGSRRI------QVIENDHEVECIE--------------------------------------------------------------------------------------------------------------------------------------------PFYKERYESAAYQQCQH-DNQVD---Q-ERAKAAAATAAELAQKMAEMRRSRRR-TL-----------------ENMPTE-----EEGSGDI--D-------------------------------VDGS---KVFCRQCGVIIKQFRYYCTYCESPTE-----DG---IMHSFDLCLLCFDQQFPFWHEHPRSSFAVQAVIDAE-----------------------AGPM-PIKGEL-VTVWEEDVIEDLNSAAAV-KNT----------------------------DGEDTTTTTTTATTSKQADHVEEASQVFTG--SSAIDTAEQGYKYLKRWQRRKV------CAFCNDDDDTSEDL-GKFIG--P--F-----------VIATFNKNGVERKRQ--FWVHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYACKEKGATIGCFESKCNKSFHLPCADKPVNYFRNGLIYFCPTHEAY-------YNKKDTYVNVFKCDGCQKLM----QD----------E-SWYTCLPCA-SSYFRTYDLCAECFD-T---LPDRNHPHDEDDFEETS---------------FAI---LKEVEAEKAREEARAKE--LAAA--KRKKS--LFPK-KR---RL-RAGESPDITCCYCGTTESEEWRKGYDGGIVMCRPCFEMA---------------------LLVDNND---GGRPLISEPNTLINDPVVAAD-----------------------SYVTQIEDYTHKPYLTRDAL---S-STK---------FSNDGKIAHVPR------LSTYEPQPHQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRSILKHTEKDERVLSNFLGREWYKESMC----RRGYQSGKYKAAVALSQRNCCFEIPPGMT-SAEY---RPIIAQADSRHLEGSLF-------GDESFHHILSHPPYKDCVAYSTHLEGDLSRFTNIEDFKMEYIKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPLNNIDRMMPQE--TDARQQTTQ---------------DQIKLSYTQHGVPSSAILRKSVVMGTVWVFKPTEQYRFEQLCMSRMVERFGRDDTNWEQVHFEFQT-------NEAEQQLLSSRSG-----------------ED-------------------SSAMKT--KQKDDQDILSEYEQQRLKRIEENTRMLVQL--GLISELSEES--TDVMHYETMMTKPS----L------------PE---A-PLRLIISAHQPQ----LLAHQ-VNAYRQTLMQLAQDAVDKLAPQGMLIIGTQDIR----------------SAD-GKLWPMGMLVLEDIERTVDATM-LKLKEMVVAVPDGYSKDR-----------KQESAASF---TSLEKEEDIVD--IVD--EH-LPIVHAV--------------YLVFQKL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RO3G_02774_Rhizopus_delemar_RA_99-880_384485890 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGKLMLKSKQ---QTPA--------KQPTLTEEPFAGLFRM---------RKNRMQDGVLNEETTVTANKFTFRFLPEN-----------DKKNYRI---------------SSNG--RKTRW---D--VSVHKDKQEEQSNSEEIIPKENPKSEKTPSQNQEEDIF-----DDA-----------------------------ESLTSLDSLQSDDDVLGPWI----------------ELGMIH-----PSKKQED---------CKDCIQLNHKDRNPLDS-VRPCSTCTDQWRELFQELVEKI---------QQHTPQTTKKKKVKLSSK---EEETKIPSIKETTKTSSKNNAKFNNKESSEKR--------SRGKKSITI--PPTQSKHSQ--------------STN--RKMMDFV--STGAFMTRKTAETL------ADPE-NGFYPNPYGYIHHQIVEV-LNINGIWYRGTLEMMDKGKVLVKYSDWDDQ-EEWVIMGSR-RL-RI----VPLEIIAKE---------------------------------KDEETKGQQEV---------------------------NSLAIKDAPTLIPRV----KISG-KEDDYVSCTLD--KDPQQLFNDNEVFMTRRMARALVDEHGFRPNSF----GYRRNRPVAV-VF------NIKDKECIGYLREMRKDQVRVWYP--DLHQSEWIVMGSRRLRLLKPEEEE--KYKKEVDLDAQ------------------EVPVPIQKPTEHKEKQPEASKPKKGRSLKKIKAKSVPEPEMASE-EETVASTPPSSVVVKEDQT--------------------------SEKDSIKLSNSSS-----------------------------------FLTTGAFATRRAMRQLQDEN---GFVPN-PYNYTYNQSVEI--LNTRSGKTH-FWECGKLVAM-----RPGQVK---VHYDGWDDAYDEWIMVGSRRIRVLSKK--------------------------------EEEDKQKRYNDLLVAESNPEVQDEVKRKRKHQVIRPEDYQKLGLLEN-EQV-----------------------------I---KKKKIKEP---------------------------T------------------------------YIESDSSS----EEEFN----------P---KRRSKKASNSHQKK---------KKAATKQPIVEE--EPPV---LEEEKQIIS---------------LRVAQAKAS-EKYEFV-ANVYGY------------------------------DYM------QHITILHLDK----------KLYEGRLVSMHKNKVKVHYCGWPDAFDEYITVGSRRI------QPIENDHQVECNE--------------------------------------------------------------------------------------------------------------------------------------------PDYRERYEKMMQDGPVETCQHKH---Q-PPVSKKLNRKRLTLEDVQDEEGEAAQVEY-----------------YKGPTN-E---DDEIEDTIVV-------------------------------VEMD-SWRVYCNQCNVIIKQFRYYCTYCENPSI-----GH---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------IGPR-PIKGEL-VTVWEEDVLEEEPLQEDT------------------------------------------------------NASNIFTG--EIPID-SDQGYKYLKRWKRRKV------CAFCNDDDDTSEEL-GQFIG--P--F-----------VIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYTCKEKGATIGCFESKCSKSFHLPCSKKPVSYFKSGVIFWCRIHEAY-------YNKKDTYVNVFNCDGCGKKM----ED----------E-SWFTCVPCSSSSYFSSFDLCSECYE-K---FP-SDHPHNEDEFEETS---------------LAI---LKEMEAQKAREAAKKKEEAREAN--AKKKKKSLFPRKRR---KL-PDGSTP-ISCCYCGTFEAESWRKGYDGGVIMCNPCFELA---------------------LMVDNDE---RP----SSDMPLVIHN--TEQ-----------------------QYMTSIEDYSHKPYFTRDTA---T-KVN---------NDSA---V-GQR------LGSYEPQPNQLFSLTFDSTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTSKDERILSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCCFEIPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLEGDLSRFTSIEEFNREYTKVVEESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRIYIDEGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPKENIDRMLIKD--DQD---------------------ECIATKRTLHGVPSSAIMRKSVVMGTVWVFKPTEAFRFDQLCISRMVERFGKDDGNWEHIELDLLV-------NLSVDE------------------------ET-----------------------PK--DEEEKEDLISDYEKQRLKRIEENNKTLLKL--GLISELSEKS--DDVIHYENMINKIP----Y------------SD---S-DLVLMVIGHQK-----VEPQS-INAYRKTLVSIAREATQRLAPKGMLIIGTQDIR----------------DPVNGKLWPMSMLVLEDIERELGRDE-IRLKELVVTVPDGYSKDR-----------QQKFPL-----EQEEEEVIDIE--TID--DF-VPIVHAV--------------YLIFQRL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RMCBS344292_14260_Rhizopus_microsporus_729703045 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGKLLLKSNNNNNKSTN--------KQSTLTEEPFANLFRV---------RKNKLQDS--NDDA-----KFTFKYLPDD-----------DKKENRL----------------SDG--KKTRW---D--VG-----QEEEQKKDEVTQKEEPSLEE--QEEDPIDLF----DDDS-----------------------------ESLTSLDTLQSDDDVLGPWM----------------NLA--------SDKKEQE---------CESCIQLNDKDRDPMDA-AHCCSTCTDQWKTLLGDLLEKI---------QSLAVNASAGKKKRVKLD---KSIQEQPSTKK--------------EKQPQQR--------TRGKKSV-I--PPTQSRHSQ--------------STN--KKMETFV--STGAFMTRKTAETL------ADPE-NGFYPNPYGYAHLQVVEV-LNINGIWYRGTLEKMDKGKVKVKYSDWDDQ-E-WIIMGSR-RL-RI----VPPDVIAKE---------------------------------RDEEKKEKD--------------------------------SSKDTLTVIPRA----K--G-RDDDYVSSTLD--KDPQQLFNDNEVFITRRMARAMVDEYGFRPNSF----GYRRNRAVAV-VF------NVKNKECIGFLREMRDNQVRVWYP--DLYQSEWIRVGSRRLRLLSSEEEE--KYKQQVDLDVQ------------------EVPAVIEQ--KKAEDEPKEATPKE--------STSEPKTKKLRQ-KKAKSKAPTPEPETHQEKQ--------------------------QQKEPQR-PVESS-----------------------------------FLTTGAFATRRAMRQLQDEN---GFVPN-PYNYTYNQPVEI--LNTRSGKTH-FWECGRLVAM-----RPGQVK---VHYDGWDEAYDEWVMVGSRRIRILSKE--------------------------------EEENK-KKHNELLVAEANPEVQDEVKRKRKHQVIRPEDYAKLGLLES-EQT-----------------------------VIKQKKKVSKEV---------------------------T------------------------------PVESDSSSSEEDEEEYE----------EPSVRRRSRKAGKNKKKKA------VKQKQQLQQQKIAE--YSTTTTGAEEEKQIIS---------------LRVAQAKAS-EKYEFV-ANVYGY------------------------------DYM------QHVTVLNLDK----------KMYEARLVSMHKNKVKVHYCGWPDIFDEYITVGSRRI------QPIENDHQVECIE--------------------------------------------------------------------------------------------------------------------------------------------PDYQERYEKVMQDGPTE-CQHQH---Q--PAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-KDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VIGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPAEL-----------------EK-----------------------GN--ETNEEQEEISDYERQRLKRIEENNKTLLKL--GLISELSEKS--DDIIHYENMMSKSP----Y------------VE---S-DLVLMIVGHQQ-----ILPRY-INSYRQTLVDIAKEAIQRLAPKGMLIIGAQDIR----------------DPVSGKLWPMSMLILEDIERAVGRDD-IRLKELVVTVPDGYSKDR-----------QQKPRS-----EEEEDEMIDIE--TID--DF-VPIVHAV--------------YLIFQKL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PARPA_01280.1 scaffold 1359_Parasitella_parasitica_758369443 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSQNDPDIVYQ------------P-YISDFPDVDKFSATTTTTATTT----EPIDALMALSD------------YAV--NQSNQIGIVPQDNMIHANSVDNL--P---DN-N-FQWITCSPAVNGDNISFV-QNWSISSSSNNEGLHSNNIASLDLTVMQDN---NNLTAHISTI-QTDIIDLTRTNGSSN-------------N--HECKVC-----------IINSMDRGPFD---------------------------KINTCNKCQAQWTNF---------IASTTFKKTND--ASVSQILHDRQVRLTRNMAQELMDDYGFGPNIH----GYRRDRPVFINTC----HQDQKKKDYIGVLQEQKQGKVKVWVP--DFESFEWLPVGSKRLKTMTPQEEQ----EAYMRIGT-------------------TNIPIQDE--APLLPTQVEKTHHK--------------SNKKQPRIKRPPVKS--NSRQKRTRA-------------------SVTAKASKAAVTAS-AAIHG-----------------I-----------------HLTTGNFSKEATKSQSKDND---GFIPN-SYGYAKNRSVQI--LDIENNKIE-AWSHGTLVAM-----RPGFVK---VHYEKGSKRYYEWIDTGSQRIKLLAEE-------------NTAT------------------------ACLMMLDEDSNAENEPKKSGR------QSEKSIGDISQ-----------------------------------QNNARPITSL--------------------------------------------------------------------------------------------------------------------------------------------------------------RIAQIKAS-ETEHFA-PNAYGY------------------------------HYM------QHITVLDSNK----------KYYEARITSLQKNKVKIHYCGWIDKFDELIPLGSKRI------RVLEDDKEADCLE--------------------------------------------------------------------------------------------------------------------------------------------PNYCERYEQSLHDNNPP-CRPNQE--K-ISAVEDLKQYQVAKNKDND----------------------------------------------------------------------------------T-VTKICCSQCNSEIKRFRYYCSYCEAPSP-STP-CSIDQNSQSFQLCPACFDYSFPSWHQHPRSGFAFQAMTNDF-----------------------HQEE-EHDTTM---LWEHDILPEQGHNVG---------------------------------------------------MPL-EASKVFTGTEEISAQ-DGNGYLFLQKWKDRKI------CGFCNDDDDNSQEL-GPFIG--P--F-----------T-STSMKLGQEKKRT--VWAHYACARYSPEVSYSAEEKKWYNVTKALKRGRSM-------------------------------------------------------------------------------------------------------------ELTKCLHVFR-Q-ST---------------------------------------------EKISSNA-----------TKKTQ--LFPQRPR---KL-PDGSTS-ISCCYCGTSEAKTWRKGYDGGIMMCEPCFDVV---------------------YAKHSSL---QD-------GLFAVD-----------------------------SYAASIEDYSHKPYFTRDTL---S-LTK---------P------VVGPR------LTSYEPQPNQTFSLTFDSTYFDIPGRAPRWASHSGTDYHGTWLPQTVRRALLKYTKKDERVLSNFLGRG-TDAIECFLLQRKCCGIDINPAAIALSQRNCCFQVPEGLT-FAEY---RPIIALTDARQLNGSLF-------NDESYHHVLSHPPYKDCIAYSTHLDGDLSRFTRIEDFKQEYTRVIRESYRVLKMSRRVTLGIGDNREHCFYVPVGFHLLRLYIDQGFELEELIVKRQRYCSAFGLGTYLCVQFDFLIFTHEFIATFKKIPHHRVDKMTLAL-KTGTSLI------------------QPPVATTVLCMASHLVLFPEKA---------------------DRMIERFGQDDCNWLHVELVTDM--LSR--EHNQQEESGSSEG-----------------LQ-------------------MKI-----SRSTEVETISEYEQERLKKIQENNEILLKL--GLVSDLSQESAVNDSIYCDTLLSRKP----Y------------VH---A-DLAVMATGHIAK----LAPEQ-ITVYRQSVVKLAQDAMVKLPVKGLLIVGTMDIR----------------DEKTGKLWPISMLVLEDIERT--TSG-LKLKELVTTVPEGYSKNR-----------DRTEV----------QQEP----------EH-LPIMSFLAEFNKSIYEPWRIEYVAFEAILNGLRAICESGHWTRQDEEDFESAIRLEAGKVDLFINCKQREIESRVLYCQRTLVQQKSMSEKTRNSTDDTLTDILADINDLTKFTRLNFKALERLIQEHDRLTNTNRQPLLVEVCRTRPLDSQRFDGILVQVSSLLDKCRGRLALDSNTDNNSSNTTKSRRQGESSSARYWVHQDNATEVKAVLLFNLPIFGDDSYKQSERAMSYVYLDNASFSEYTAQLQSDNGAELITCRWDGDIHSASQVFVERHVFVKGGFSTQDGIALNANRLHDFVVTKSYSAEEYAQDLTSVGFDQNYVDSSYTIAKSIQGTIIDKQLKPKLRVQFNRLHFEAPHDKSLSVSLDNDVSLSATLDKPSSIDWLNGFLDNRQRFPYAILETRVQDQEPPPWLSRLLESNLVYEVPRFSICLHGVALLWGPQLPLLPWWLSQIDVDIRTAKKQDKLLVEGASEYSGLTRSNSLRPLIDGQYRMGYLEAQLQKRLPQRRQRSLARHGSQYSSNSSRHQSIVITDETLHAAVDSKEKAPEYVVQLEDANASRLTLQSTPTANDEQEGLRSRSSLKQLQTFSDFYRPQEGGSQAYMLQDPHTIKDNDQMRKAMLTELAQEKEEKKKKKKKQKPPQHTMEPKLFFANERTFINWLQFSALIMTAALTLLNFGDHVSTIAGATFFGISMVIALYAFFRYRYRAYQMSTRPDIRYDDLFGPVGLCCLLVGAMALNFALRWQHPSASDTYLGVNNKTDEQS HMPREF1544_03082_Mucor_circinelloides_f_circinelloides_1006PhL_511008850 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSQNMEEIVWE------------P-QASHSHK--------TNTAEYI----APTDQVHLHQA------------TDE--LYQRELG--PQGSINQAIAAINQ--P---INIE-HQWISCNSEPIGKNLHSVLLESDNDSSQNSILNQSSSNSEINQTSTRGN---NHVLSSS----------FQPDETSKS-------------I--NDCENC-----------IINNRDRHPFD---------------------------KISICNKCQCQWAHF---------LPNSIDVDSDN--NSVTQMLHDRHIRLTRKKAQELIDDYGFGPNIH----GYRRHRPVKV-AF----PLDKTKSECIGVLCRVHQGKVKVWVP--ELRMVEWLPMGTRRIKLMNPQEEK----DAETMLRG-------------------SNPTIDDV--ELLPDKQDQQAFKV--------------SHSKRASLDPARLKKPSKVQSKQRKL-------------------TELNTVDSQMNTTL-ATSNR-----------------A-----------------YLTTGAFATRRAVHQLKDDN---GFIPN-PFGYAKNQAVQI--LDTKHSKSK-SWYDGTLVEM-----KPGYIK---VHYNQWPETYDEWLMIGSRRIRIADGG-------------SVATEDIC-----------------KTDEHLMVLAEDPDARHEAKRKRRSVGMQQKRQKSTNSRPE-----------------------------------QSSTRPITSK--------------------------------------------------------------------------------------------------------------------------------------------------------------RVAHLKAA-EAEQFV-PNVYGY------------------------------YYM------QHVTVLYCDK----------RHYEARIVGIQKNKVKIHYCGWADEFDELIPNDSKRL------QAIDTTNQVECIE--------------------------------------------------------------------------------------------------------------------------------------------PDYSERDKKAPLSTNSN-ATDNEVLES-IDVFQEKEAKEIINLNASEHGTEEEE--------------------DIIIVD-----DVTVDNA----------------------------------VVAS-TSKVYCSHCKKDLEQFRYYCTYCEASSS-ASD-QT---NLNSFQLCLACFDQCFPDWHQHPRSGFAIQAITDSP-----------------------KQHQ-NKDTSSSLSIWEEDIMQKQDDCIDK------------------------------------------------TALSL-EASKIFTGVDNTATQ-DEYGYLLLQKWKDRKI------CAFCNDDDDNWQEL-GPFVG--P--F-----------V-SITTKLGQEKKRT--FWAHEACARYSPEV-------------------------------------------SFHLACTNKPINNFRNGVIFWCHVHEAA-------HNKKDTYINIFHCDGCSKRF----SN----------DETWLTCEQCSLVNYFSSFDICNECYN-D-DAVI-GEHQHDKSAYQETS---------------YSL---IERIEARKQIKKEEGKFH------SVKKAQ--LFPRRTR---KLPSRSTTT-TSCCYCGTLEADAWRRGYDGGILMCNTCFGMV---------------------YNKDRPA---ED----ACEGPLAIE-----------------------------SYAASIEDYSHKPYLTRDTV---SSSNK---------P------FIGPR------LTSYEPQSNQLFSLTFDSTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRALLKYTKQNERILSNFLGRG-TDAIECFLLQRKCCGIDINPAAVALSQRNCCFEVPAGLT-FAEY---RPIIALADARQLKGSLF-------GDESYDHILSHPPYKDCVAYSTHLEGDLSRFTQLNDFKAEYTRVIRESYRVLKMDRRLTLGIGDNREHCFYVPVGFHLIRLYIDQGFELEELIVKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKIPQQQVNRMPLTQDEEDSDTA------------------NKPIYATTLYGIPQSAITRKSRVMGTVWTFNLSHQYSFQLLCISRMIERFGQDDCNWLLVELAIDK--TTQ--EHHQQQQRSGSRA-----------------NR-------------------LSIIAI--PSAPEVETISEYEQERQRKIQQNKETLLKL--GLISDLSQDSVVNDSIYCDTLLNKKP----Y------------AH---A-DLVVMATGHIEN----LLPNQ-IDLYRKSIIQLAQDATSQLAVKGMLIIGTKDVR----------------DQTNGKLWPLSMLVLEDVERT--GNG-LKLKEMVITVPEGYSKNK-----------DTFTTE-----SSIEEEPP----------QHLLPIVHAI--------------YLIFQKQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MAM1_0127c06017_Mucor_ambiguus_758351301 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MSGNHREPQCE------------P-QAEYFHE--------RSRAKHA----GPTDQVHLYHA------------SDE--LDQRELG--PLCNINQTTTIKDH--P---KSID-YQWVPYN--PAESSLDLV----QHNFLQNTCSNHSLPNGEINQASTQG----NIALANL----------MQPTKAIES-------------I--IDCKNC-----------ITSSRDRKPFD---------------------------KIALCNQCQCKWINF---------LPSPFNTHSDN--GNVAHMLQHGHKHL----PQELADSSSFGPNVH----GYHRHRLIKI-AF----PKDKTNKECIGVLCRLHQGKVKVWVP--ALQMVEWLPAGTRRIKLMDSEEEK----DAETVLRG-------------------SIPSIHDV--ELLFDEQNQQARGM--------------SQRKRATSESPAQKRSFSAPHKNKEL-------------------AQANTADSQPCIAL-PRTHR-----------------A-----------------YLTTGSFATRKTIHQLKDDS---GFIPN-PFGYAKNQPVQI--LDTKHGRSK-SWYNGTLVEM-----RPGYVK---VHYNQWPETYDEWLMAGSRRIRIADGV-------------SAAVNDMS-----------------KSDEQLMAIAEDPDTQHNAKRKKQSLEKQQKGLKRTSSGYE-----------------------------------QSGARTIASR--------------------------------------------------------------------------------------------------------------------------------------------------------------RLAHLRAA-EVEEFV-PNLYGY------------------------------SYM------QHVNVLYHDK----------RYYEARIVGVQKNKVKVHYCGWTDDFDELIPNGSHRL------QAIDTK---ECLE--------------------------------------------------------------------------------------------------------------------------------------------PDNLERDKHMPLAENTI-SNDEAV--S-VNVPQKIKALEATKLGINDQAVEKEE-EGRQYVDLRQLFSFLPLFVDIIMVD-----DVLVEDK----------------------------------VEPD-TAGVKCSHCKAAIEDFRYYCTYCEATST-TCN-VN---NLESFQLCLVCFGHCFPDWHPHPRSGFAIQAITDGP-----------------------RQSQGSRPLSLSSSMWEEDVMETQDECMGE------------------------------------------------AAISL-EASKIFTGVDNITAQ-DKHGYLFLEKWSNRKI------CGFCNDDDDNSQEL-GSFVG--P--F-----------V-STMTKLGQEKKRT--FWVHDACARYSPEVRFSVVDGKWYNVTRALKRGRSMRCFACKEKGATIGCFDSKCSKSFHLSCTNKPVNNFRNGVIFWCHIHEAA-------LEKKDAYINVFHCDGCSKRF----SN----------DETWLTCEQCSLDNYFSSFDLCKECYK-K-NSVL-REHQHERSVYKETS---------------YLQ---LEQAEVLEQIKKGDGKY--------TKKAQ--LFPWRSR---KL-SNGSTP-TSCCYCGTLQADNWRKGYDGGILMCDTCFGMI---------------------YDRQQPT---QD----ASEGSAAIE-----------------------------NYIASIEDYSHKPYFTRETL---S-MNK---------S------LIGSR------LTSYGPQSNQLFSLTFDSTYFDIPGRAPRWATHSGTDYQGTWLPQTVRRALLRHTKKDERILSNFLGRG-TDAIECFLLQRKCCGIDINPVAVALSQRNCCFEVPAGLT-FAKH---RPIIALTDARQLNGSLF-------GDESYHHILSHPPYKDCIAYSTHLEGDLSRFTQLEEFKKEYMRVVQESYRVLKMDRRLTLGIGDNREHCFYVPIGFHLIRLYIDQGFELEELIVKRQRYCSAYGLGTYLCVQFDFLIFTHEFIATFKKVPQQQVNKMPLMQ--EGLPTT------------------NAPEYTTTLYGIPHSAIARNSRVMGTVWTFKPSHQYSFQILCISRMVNRFGQDDCNWLHVELAIDM--TTQ--EHCQQQQDTESRA-----------------DR-------------------LQITCN--PSTMKVHPISEYEHKRQRKIQENRETLLKL--GLISDLSQDSVVIDSIFCDTMLNKKP----Y------------PH---A-DLVIMATGHIEN----LLPNQ-IDMYRKSIIQLAQDATRQLAVKGRLVIGTKDVR----------------DQISGKLWPISMLVLEDIERT--SHGLLKLKEMVITVPEGYAKDK-----------NAFTAK-----PLIEEGNP----------AH-LPIVHAI--------------YLVFQK-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RirG_262840_Rhizophagus_irregularis_DAOM_197198w_595436939 MYLPYENFEQNVEATSTSNWQL---NRTETPPGVNPHHLLPP----PPQNYFPQRPIVSTF-NNAHQINPPLYFRPPNQSHDKFKKNKRGFIRHNEVTNNQYAYTSDQNLK-NNSYSASPRFNS--HS-GYQTWHAPNNNNNSNWSIANSTECQYAPKQFDRQNRQMFPLGTQPYPIYHFSQQYSHHPVPNYSYPQQPNNCQYFPSQRPVQHFYQHGTQPNDFHQAIIPQFPTYPTDNNNENVENVEN-IT-QAKKSKRNKKKKVV-----EHSKNNINTVNDQI--IPKPSSEFIKQNDNESKKRIRPSDNERSSTVPLSNDS---LTKSFHIYE---------NDGRK------LF-----TKKKNVNKIEINLKERVSKDKINRLNNNSSKSFLADDENYE--VFGI--KKKIKLDVEIDSIL-SV--HDSTSSENIQILQQVNTDML----------SSKGISNISEV--VNDDSYSNIIDSSESI----IFDDSLSQNENSSRRDS-GVACDNME----------LDDM------VIKEKDINDPPITISNTVLENSYYNSNPIINNSET----INDIVVS---NFASLDA--------NETINL-----ESKLTKYTKLI------ADSSLNNNL--LNSNNEF--QENSLVSSNSIPEI------SA------TPILSPIIRNESTNI-ISNSENEIQREKDEKISNQSKLSY-----E-SDTPVVSET-LE-DS----FTSTNVTND-DFAIPIDKKAG----NDLKESLPINHLKNNS-EDIQSPKESFI-IVDDSSEFEDSDIKLTGNENP-----DVDSTAESTNSFSKVDFSYAKLRLKPEETIQVTCDDKENPNHSSNNSTKIEKGKFSGTIFTRRSIRDLAFAKFEGYSVNQRVKVL--------NVDSIWYPGTIIAMDKTKVRVRFDGWDAKYDEWVSKDSRRLRVMSDEEIL-EIQENSSQNNYEM----IKQHLIDKI----EKRESISI--TTVTEDPQTDFSLK-----SKRTKKRPTKKQTSSGDNTKKVRGSPKLQLSQTKK---SIDHKQKKNN------SHSRPKSPTNVVKK-KCNSC-ENIFDK----------------------------LEKVGALELCTKCVPLFAPQ-LTRIKAF-AHDFTLDQKVQV--LNIDK-----IWYPARIVNV-----EKSRVK---VHFDGWGKRFDEWISVESRRLKALSED-E-----------VNEVQKENHLSDDADNRQNKSNQD-QNQTKASTSESSPKFDSV-DDKPKKDSVCSRDIDKERQLNA--------YKLKE--------------------ILGSDSESLSSV----PDSDSDSLSSCTESISSSSLSLSA------------------------------SESHYNSLSSSSDDEFEVKM-------PKIKRTVRHNNNKTPIRKK------ILQSSKEKICESCK--IAHLNVQRIGSLDLCT---------------YCRSLFGDD-ATFRFKRGGQYGF------------------------------ELH------KRVKVLSRDG----------EWYPATMVDVEDSRIRVHFDGWSDYFDEWIPAGSQRMRDMTLEEIIEAQKALEQLDKETLKQREIIYKPQKRRRSNLKRTIQSSTVTPLNRVTKSMDSTETANNKVDVDSSLPDQQSDHSTDSTISFDWNEYYIGRDTRRSLRNDKLLSNSDVLAKLKYRFKPGNQIEVRDRLKEWVPATIIETKGCRVLIHYDDVPAFYDEWIDISSERLRE-KCAKN---E-IKEVAKNVKSVTTSLETKKDLKKKKD-KV-----------------ENDDYRLEFVVNGALNGR--L-------------------------------ITANDKWCIYCDQCNVVIKQFRYFCTYCESRSE-----GN---DYKSFELCVWCFAHQFPNYHEHPRCSFAVQSVIDDE-----------------------AIKM-SSKGEV-VKTFERDVFDTTYKEPEF---------------------------------------------------------DITSD--KMPLD-TDMGYLYLQAWNMRKI------CGFCNDDD--TNQL-GGFIA--PYPF-----------VSNTYTRYG-EKQKT--FWSHYACAKYSPEV-FFTKSNEWYNVTLAWKRGRSMKCGKCKERGATIGCFEPKCAKSYHLSCTDKPLSHFEMGVIFYCPSHEAR-------YNQKELYNEVYRCDVCSCEL----QE----------D-KWMTCRPCE-SNFFSSFDLCLQCFEVK---FP--EHEHKKDEFEETS---------------VKK---IKDAQITKQATLAVANQKARNAG--MRKKS-------KN---SL-QNKGGR-IQCSYCWAEESSRWRKGYNG-VLMCEDCFELV---------------------LVNNNTG---EPQ---------------EKD-----------------------KLLVTSEDYSYQPYLTRNFC---S-DKK---------FDDFE--SQAMY------LDSYEPVENQLFSLSFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDITAEH---RPIILQADSRNLTGPMF-------EVESYDHILSHPPYKNCVEYSTHIDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKRQRYCQAFGLGTYLCVQYDFLMFTHEFIATFKKVDKAHNNRMLVTP--DESTLS------------------GTVIFSRNLREIPVLPIARKSVVMGTTWTFKPTRTHSFVQLCTSRMVERFGRDFANWEEIQIKFNN---------MEPNNIANDNS-----------------SD----K--------------LTKFED--QIDNDEEDMPEYERIRQKRIKENQKMLLSL--GLKCDLGETS--DDISHLEKILHSMP----L------------PP---PVPTALIVVPHIPN--NLLTSQI-IPIYRTAIKHLAKEAYERLPPSGFFVIGGQDVR----------------TSD-NKLWPLSMLFMEDVNNSVGEDK-MPLKELVVTVPEGYAKDK-----------KKITKK-----DDYIEEQCILD--EEDKIEH-LSIVHAC--------------YLIFMKL---R--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Spun1000004719_Spizellomyces_punctatus_Spun1000004719 ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MPTKSRPSICEMARR-------------YVCRRNG--QSPLQAFER----FCYPLALSVCRHAGLRFRTL--------HVKLPV-----TGVTGLCNVLCMPANSNTCSARIC------SILGNDAPSTRSARMSVFQTLPRS----------------K--------------ENPNNHSR--RMLRGMGQKRTAGC--CC-VVQEDRDPFDTQIPCTSCCQFFASCLPSFFSQD-----LVKKKKKV--------SRKSIKGRKSSSAN-ERDDD-----------------------------------FLEPVILPPIINSRKL-------ALPAN-CPAFHIDAIVEV-----RDGAK--TWWPGRVVTV-----QSGKVC---VHYDGWGDQYDEWIDCESQRIRLATQM-------------PADQCSERISQENEHG---------QTGEAGIGPDAVILVGRSVREKRKKSAAYTNQKNAKRKKAN----------------S------------------RTNVTVVNPI----------------------------------------------------------TSKSTESATPARPQGFN------------AQQPRSSSTSPVDNFGI------FRAAANREAREAYI---------------------------------SRRALANDA-TADDM--KRLYGK---G-----------------------------------ARVEVCCAGG----------ERYMATVIKTRSWQVLVQYDGWDEAWNEWIDMNSSKM------KLVEAA---------------------------------------------------------------------------------------------------------------------------------------------------------------------SGENN----------------------SSEDGNSSA--------------------GECSSE-----EDDEQ-----------------------------------------KWKIFCNRCEKRIRQYRYFCTYCEVPSE-----GF---EYESFDLCLACFQQDFPLDHPHPIQSFAVEPLLDTD-------DP--------------TRRK-FKDGEL-VSTFVLDEFDTSYIAMGT------------------------------------------------NQSQI-DAETVVTA--IPMVP------------------R----CAFCHSER--TDIV-GPFIG--PHPFRNTRISGRRMPLPSSEKKNSGKNRRVPIFWAHDACARFSPEVYFMKDSGKWHNVLKALARGRGVKCAACKERGATIGCFDVRCTRSYHVGCTRKPLSQFEEGVIFWCPRHESL-------VNKADNYKDVYNCDVCSNSLGLSDND----------E-QWHTCDECA-QNHFNTFDLCKECFE-G-R-FP-ETHDHGKDRFITTC---------------MSQRKEIREMEQQLARELVAAN---------ASRKS--LGQRRKK---KL-ERASG--IRCAYCWIDSSSRWRKGYNG-IPMCEDCFQMA--SSAFASSKAPELCATPEA-IPSPSPS---ES----LSQSPVNAPTPVPVR----IEADSPAPVLDPTSMER--VYRTEIEAYSHEYYLTRGVV-G-K-ASG---------ADEIGA-AEVNKSSEFGILQSYAPTDDQLFTMGFDTSFYDIPGRAPRWATHSGGDYHGTWLPQIVRMSLLRYTSEGERVLSNFSGRG-TDAIECFLLKRRCCSVDINPASVALSQRNVSFSVPPELGLTAAY---RPVIVLADSRELIGSLF-------EDESYDHILSHPPYKDCVSYSAHIEGDLSHFPDMEDFQKEMEKIVAETWRLLKPNRRCTLGIGDNRRECFYQPVSFQTIRTYINDGFELEELIVKRQRYCQMAPLGTYLCTQYNFLMFTHEFIAILRKVDDRQHSGLFSYLKVDDDHD-------------------FHVNPTRILRVIPAAPIDRTSIVMGTVWTFRVTQKHSLARLAMSKLIERFGTDSAYWEEVSISEFR-----------NKVIRAAVY-----------------DD--------LCAERD-----PPEEED--EEEENGTEVTEYERRRREQLSKNTRELLSM--GLISELSPEGE-DDAKHLETLLAMPP----VQTIQHEGCPTMHPP---S-SPVIIFVPHINAPSTAILPHAWINEYRKFVIDCARDAAARLTDGGYFIIGVKDARIFLPPKTDPSSENEQPCGIQTKYVPLGLLVSEDLSRYFEGSE-MRLKDFVVAVPEGYSRDKGIEFEEMKARIDEDEQE-WK--SEQEKEANGQS--NVR--RL-LPIVQAY--------------YFIYAKQ--------------AGNRKLSEPSS------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ RMATCC62417_10446_Rhizopus_microsporus_727142291 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MQDGPTE-CQHQH-----QPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPA------------------------------------------ELEKGNETNEEQEEISDYERQRLKRIEENNKTLLKL--GLISELSEKS--DDIIHYENMMSKSP----Y------------VE---S-DLVLMIVGHQQ-----ILPRY-INSYRQTLVDIAKEAIQRLAPKGMLIIGAQDIR----------------DPVSGKLWPMSMLILEDIERAVGRDD-IRLKELVVTVPDGYSKDR-----------QQKPRS-----EEEEDEMIDIE--TID--DF-VPIVHAV--------------YLIFQKL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RMATCC62417_10446_Rhizopus_microsporus_727142293 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MQDGPTE-CQHQH-----QPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEV-------KIYKMSFT----Y---VF---------------------------------IYIYHF--RSLIKSNSL--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RMATCC62417_10446_Rhizopus_microsporus_727142292 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MQDGPTE-CQHQH-----QPAPKKLNRKRLTLDDVQEEEGQG---EY-----------------HKEPNE-E---EEDIE----V-------------------------------VEMD-SWKVYCNQCNVVIKQFRYYCTYCENPSI-----GY---DYRSFELCLRCFDQNFPFWHEHPRSSFAIQAVIDAE-----------------------AGPR-PIKGEL-VTVWEEDVLEEEHQQEEQ------------------------------------------------------DASSIFTG--ETAIE-GDQGYRFLKRWKRRKV------CAFCNDDDDTSEDL-GSFIG--P--F-----------IIATYNKNGVEKKRS--FWAHDACARYSPEV-FCTPEGKWYNVTLALRRGRGMRCYVCKEKGATIGCFESKCSKSFHLPCSQKPVSYFKSGVIFWCSVHEAY-------YNKKDTYVNVFNCDGCGKKM----EE----------E-SWFTCVPCSSSSYFSSFDLCNECFE-K---FP-EDHPHNEDDFEETS---------------LAI---IKEMEAQKAREAARKKEEAREAN--AKKKL--LFPRKRR---KL-PDGSAP-VSCCYCGTAEAESWRKAYDGGVMMCNPCFELA---------------------LMVDNDE---RP----TSDMPLVIHN--AEQ-----------------------QYMTSIEDYSHKPYFTREAA---I-KID---------SETA---VVGQR------LTSYEPQPNQLFSLAFESTYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT-SAEY---RPIIAQADSRQLTGALF-------TDESFHHILSHPPYKDCVTYSTHLDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFICTFRKIPKENIDRMLINE--EDQ---------------------HRIPIKRTLRGVPSSAIMRKSVVMGTVWVFKPTDSFRFDQLCISRMVERFGKDDGNWEHIELDLEE-------PHQKQQSVKPGLY---FV---------------------------------LLIFHS-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Ccor1000008322_Conidiobolus_coronatus_Ccor1000008322 --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MKCTKCRMMGATVGCFNAKCPRIYHLTCCDKNPKLFLQGYIFYCPKHEAI-------ENKKQTYEEYYHCDHCKNSL-PR-ANTLGPFPDYSPD-EWFTCQACVEENFFSGFDLCTECFT-H-K-FKSIKHNHKANRFIRTTKEKLEVLLDVLANNKLTL-R-NDKLKGRKSLKNDKLNSIENIKD-IAKPED--SAPKKRKIIKKI-VQYQPN-IHCSYCWSTSSTIWRRGYMG-VLLCSKCFMNT--------STDKNLASIATQ-STSEVNE---DD----QDSEEIIIDV--VND----------LPSPPAQQDKF--GYHGNYEDYIHQPYHTRNLP-----QLNCLKYDPSQIAESS---VMANK--AIH-LETYGPTIYQAFSLDYKSTYYDIPGSAPRWASHSGSDYHGTWLPQTVRRAITRYTKEGDMVLSNFLGRG-TDAIECFLLKRKCIGIDINPVAVSLSQKNISFALPPSLLANSEFKYHRPTIIQGDARNLFNIL--------TNESISHVLSHPPYKDCVEYSNNIDGDLSKFSTNMEFCKEMQNVVNETWRVLKMGGRCTLGIGDNRDQCFYQPVSFDLLLLYMETGFQVEEIIVKRQRQCRAFGLGTFLCVKYDFLMFTHEFIITLRKVPITSRGSMS-----RNMKK-------------------SKFFQQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ GLOINDRAFT_316719_Rhizophagus_irregularis_DAOM_181602_552908586 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRKKSKN---SL-QNKGGR-IQCSYCWAEESSRWRKGYNG-VLMCEDCFELV---------------------LVNNNTG---EP----------------QEK-D---------------------KLLVTSEDYSYQPYLTRNFC---S-DKK---------FDDFE--SQAMY------LDSYEPVENQLFSLSFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDITAEH---RPIILQADSRNLTGPMF-------EVESYDHILSHPPYKNCVEYSTHIDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKRQRYCQAFGLGTYLCVQYDFLMFTHEFIATFKKVDKAHNNRMLVTP--DESTLS------------------GTVIFSRNLREIPVLPIARKSVVMGTTWTFKPTRTHSFVQLCTSRMVERFGRDFANWEEIQIKFNNM------EPNNIANDNSSDK--------------------------------------LTKFED--QIDNDEEDMPEYERIRQKRIKENQKMLLSL--GLKCDLGETS--DDISHLEKILHSMP----L--------P---PP-V---PTALIVVPHIPNN--LLTSQI-IPIYRTAIKHLAKEAYERLPPSGFFVIGGQDVR----------------TSD-NKLWPLSMLFMEDVNNSVGEDK-MPLKELVVTVPEGYAKDK-----------KKITKK-----DDYIEEQCILD--EEDKIEH-LSIVHAC--------------YLIFMKL---------R--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Uram1000000474_Umbelopsis_ramanniana_Uram1000000474 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MCEACFELA-SLD-----------------LIQDELPLVLDE----------------EAD-H---------------------TYAASIEDYSHKPYLTREAL---S-STK---------FDDMK--SNAVR------LASYAPVEHQLFSLSFDSTYFDIPGRAPRWASHSGTDYHGTWLPQTVRRAILRHTRKDDRILSNFLGRG-TDAIECFLLQRRCCGVDINPAAVSLSQRSCSFETPPGLT-TAEH---RPIIVQADSRKLTGALF-------ADESYDHVLSHPPYKDCVAYSLHIEGDLSRYTNPLDFQEQYDKCVRESWRLLKMDRRLTLGIGDNREHCFYIPVGFQLIRLYINNGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKVPRNSNDKM-DSF--DASNTK------------------SQMRITYTCREIPRSPIARKSVVMGTVWLFKPSSRHSFAQLCTSRMVERFGKDESNWEHVELEIISD-------NDDSSLKPSLAN--------------------------------------IATDSM--VVEDEGLSISSYEIERQKRIDENRLALLQLVSSLSTPFIHIN--DDLTYL-FCMAGVD----I--------R---PQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Alignment 2 <-----Syapomorphic strand-helix unit--------> Str-1 Str-2* Str-3 Str-4 **** Str-5 Str-6 Str-7 | FINAL -HHHHHHHHHHHHHHHH-------EEEEE----------------------------------------------------------------------------------------E---EEEEE---------------------HHHHHHHHHH------EEEEE------HHHHHHHHH---EEEEE--HHHHHHHHHHHHH------------------------------------------------EEEEEE--HHHHHHHH------------EEEEEE------EEEE----------------HHHHHHHHHHHHHHHHHHHHH---EEEEEEE------EEE--HHHHHHHHHHH--HHHHHHHHH----H------HHHH---HH-----------HHH---H----EEEEHHHHEEEEE-|--------------------------------------------- ALIGN ------E-----------------EEEEH------H--HH--------H-----------------------------------------------------------------E-EE---EEEE---------------------------EEEEEE-----EEEEE-------HHHHHHHH---EEEEE--HHHHHHHHH----------------------------------------------------EEEEE-----------------------EEEEE------EEEEE--------------H-HHHHHHHHHHHHHHHHHHHHH----EEEEE------------HHHHHHHHHHH-HHHHHHHHHH----H-------------EE-----------------HH--HHHHHHHHHHHHHH-|--------------------------------------------- HMM ------E--H-HHHHHH-------EEEEE-------------------HH---------------------------------------------------------------EE-EE--EEEEEE--------EE-----EEEEE-HHHHHHHHHHE-----EEEE-------HHHHHHHHH---EEEEEE-HHHHHHHHHHH--------------------------------------------------EEEEEEE-HHHH---E-----------EEEEEEE----EEEEEE--------HHHH----HHHHHHHHHHHHHHHHHHHHH---EEEEEEEEE----EEEEEHHHHHHHHHHHH-HEHHEEEEE----EEE----HHHH---HH-----------HHH---H----EEEHHHHHHEEEEE|--------------------------------------------- FREQ -HHHHHHHHHHHHHHHH--------EEEE----------------------------------------------------------------------------------------EEE--EEEE------------------------HHHHHHE------EEE--------HHHHHHHHH----EEE---HHHHHHHHHHH--------------------------------------------------EEEEE---HHHHHHHH------------EEEEE-------EEEE-----------------HHHHHHHHHHHHHHHHHHHH----EEEEE--------E----HHHHHHHHHH-HHHHHHHHHH------------HH----HH-----------HHH---HH--HHHHHHHHHHEEE--|--------------------------------------------- PSSM -------HHHHHHHHH--------EEEEE----------------------------------------------------------------------------------------------EE----------------------HHHHHHHHHHH-----EEEE-------HHHHHHHH----EEEEE--HHHHHHHHHHHHHHHH-----------------------------------------------EEEE----HHHHHH--------------EEEEE------------------------HHHHHHHHHHHHHHHHHHHHH----EEEEEE------------HHHHHHHHHH----HHHHHHHH----H---------------------------------------EEE--EEEEEEE|--------------------------------------------- RMATCC62417_10446_Rhizopus_microsporus_727142292 QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L------- RMATCC62417_10446_Rhizopus_microsporus_727142293 QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L------- RMATCC62417_10446_Rhizopus_microsporus_727142291 QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L------- RMCBS344292_14260_Rhizopus_microsporus_729703045 QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----IGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LAFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTTKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTSVEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENIDRM---------L------- RMCBS344292_09167_Rhizopus_microsporus_729708575 QQYMTSIEDYSHKPYFTRE----AAIKID------S-ETA--V-----VGQ-----R--------L---TS----Y--EP------------------------QPN-QL---FS-LTFESTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKHTAKDERVLSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCAFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LDGDLSRFTTIEEFNKEYAKVVSESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRLYIDEGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFICTFRK|----I------------------P---K-ENVDRM---------L------- RO3G_02774_Rhizopus_delemar_RA_99-880_384485890 QQYMTSIEDYSHKPYFTRD----TATKVN------N-DSA--V------GQ-----R--------L---GS----Y--EP------------------------QPN-QL---FS-LTFDSTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKYTSKDERILSNFLGRG-TDAIECFLLQRRCIGVDINPAAISLSQRNCCFEIPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFTDESFHHILSHPPYKDCVTYS-TH---LEGDLSRFTSIEEFNREYTKVVEESWRLLKMSRRLTLGIGDNREHCFYIPVGYHMFRIYIDEGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---K-ENIDRM---------L------- HMPREF1544_03082_Mucor_circinelloides_f_circinelloides_1006PhL_511008850 ESYAASIEDYSHKPYLTRD----T---VS------SSNKP--F-----IGP-----R--------L---TS----Y--EP------------------------QSN-QL---FS-LTFDSTYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRALLKYTKQNERILSNFLGRG-TDAIECFLLQRKCCGIDINPAAVALSQRNCCFEVPAGLT----------------------------------FAE---YRPIIALADARQLK-----GS--LFGDESYDHILSHPPYKDCVAYS-TH---LEGDLSRFTQLNDFKAEYTRVIRESYRVLKMDRRLTLGIGDNREHCFYVPVGFHLIRLYIDQGFELEELIVKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---Q-QQVNRM---------P------- MAM1_0127c06017_Mucor_ambiguus_758351301 ENYIASIEDYSHKPYFTRE----T---LS------M-NKS--L-----IGS-----R--------L---TS----Y--GP------------------------QSN-QL---FS-LTFDSTYFDIPGRAPRWATHS-GTDYQGTWLPQTVRRALLRHTKKDERILSNFLGRG-TDAIECFLLQRKCCGIDINPVAVALSQRNCCFEVPAGLT----------------------------------FAK---HRPIIALTDARQLN-----GS--LFGDESYHHILSHPPYKDCIAYS-TH---LEGDLSRFTQLEEFKKEYMRVVQESYRVLKMDRRLTLGIGDNREHCFYVPIGFHLIRLYIDQGFELEELIVKR----QRYCS-AYGLG---TY-----------LCV---QF--DFLIFTHEFIATFKK|----V------------------P---Q-QQVNKM---------P------- GLOINDRAFT_316719_Rhizophagus_irregularis_DAOM_181602_552908586 DKLLVTSEDYSYQPYLTRN----FCSDKK------F-DDF--E-----SQA---M-Y--------L---DS----Y--EP------------------------VEN-QL---FS-LSFDSSYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDI---------------------------------TAE---HRPIILQADSRNLT-----GP--MFEVESYDHILSHPPYKNCVEYS-TH---IDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKR----QRYCQ-AFGLG---TY-----------LCV---QY--DFLMFTHEFIATFKK|----V------------------D---K-AHNNRM---------L------- LCOR_11540.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661176173 DSYVTQIEDYTHKPYLTRD----ALSSTK------F-SND---------GKVAPVPR--------L---ST----Y--EP------------------------QPH-QL---FS-LVFDSTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRSILKHTDKDERVLSNFLGRG-TDAIECFLLQRRCVGVDINPAAVALSQRNCCFEIPPGMT----------------------------------SAE---YRPIIAQADSRHLE-----GS--LFGDESFHHILSHPPYKDCVAYS-TH---LEGDLSRFTNIEEFKMEYVKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---L-SNMDRM---------M------- MVEG_09762_Mortierella_verticillata_NRRL_6337_672819038 GRYATSAEDYSHTPYLTRT----SVSAVR------F-DHS--S-----SQA---V-Y--------L---DS----Y--GP------------------------SEN-QL---YS-LPIDTTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRAVTKYTNPNDKILSNFLGRG-TDAIECFLLGRCCTAVDINPAAITLSIRNCSFAIPPNGTV---------------------------------KAE---HRPTILQGDSRKLT-----GP--LFESESFDHVLSHPPYKDCVAYS-TH---IDGDLSRFGNSIEFQREMTHVVQETYRLLKMGRRCTLGIGDNREHCFYIPVSFQLIRQYINQGFELEELIVKR----QRYCA-MFGLG---TY-----------LCV---QF--DFLCFTHEFIATLRK|----V------------------P---K-QGHDTM---------I------- LRAMOSA00608_Absidia_idahoensis_var_thermophila_671688888 DSYVTQIEDYTHKPYLTRD----ALSSTK------F-SND---------GKIAHVPR--------L---ST----Y--EP------------------------QPH-QL---FS-LVFDSTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRSILKHTEKDERVLSNFLGREWYKESMC----RRGYQSGKYKAAVALSQRNCCFEIPPGMT----------------------------------SAE---YRPIIAQADSRHLE-----GS--LFGDESFHHILSHPPYKDCVAYS-TH---LEGDLSRFTNIEDFKMEYIKVVQESWRLLKMGRQLTLGIGDNREHCFYIPVGFRLLRQYIDNGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----I------------------P---L-NNIDRM---------M------- RirG_262840_Rhizophagus_irregularis_DAOM_197198w_595436939 DKLLVTSEDYSYQPYLTRN----FCSDKK------F-DDF--E-----SQA---M-Y--------L---DS----Y--EP------------------------VEN-QL---FS-LSFDSSYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRALLRFTKKNGKVLSNFLGRG-TDAIESFLLGRRLVGVDINPAAVALSQRNCSFAIPPNRDI---------------------------------TAE---HRPIILQADSRNLT-----GP--MFEVESYDHILSHPPYKNCVEYS-TH---IDGDLSRFANSREFAIEMSKVIDESWRLLKPGRRVTLGIGDNREHCFYVPVSFNLFRQYIDQGFELEELVAKR----QRYCQ-AFGLG---TY-----------LCV---QY--DFLMFTHEFIATFKK|----V------------------D---K-AHNNRM---------L------- PARPA_01280.1 scaffold 1359_Parasitella_parasitica_758369443 DSYAASIEDYSHKPYFTRD----T---LS------L-TKP--V-----VGP-----R--------L---TS----Y--EP------------------------QPN-QT---FS-LTFDSTYFDIPGRAPRWASHS-GTDYHGTWLPQTVRRALLKYTKKDERVLSNFLGRG-TDAIECFLLQRKCCGIDINPAAIALSQRNCCFQVPEGLT----------------------------------FAE---YRPIIALTDARQLN-----GS--LFNDESYHHVLSHPPYKDCIAYS-TH---LDGDLSRFTRIEDFKQEYTRVIRESYRVLKMSRRVTLGIGDNREHCFYVPVGFHLLRLYIDQGFELEELIVKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLIFTHEFIATFKK|----I------------------P---H-HRVDKM---------T------- Bcir1000010688_Backusella_circina_Bcir1000010688 QRYVSSIEDYSHKPYFTRE----ALSSTK------F-SDA--S-----TGR-----R--------L---ES----Y--EP------------------------QPN-QY---FS-LTFDSSYFDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKYTVKDERVLSNFLGRG-TDAIECFLLQRRCCGIDINPAAVSLSQRNCCFEIPPGLT----------------------------------SAE---YRPIVAQADARQLT-----GS--LFGDESFHHVLSHPPYKDCVAYS-TH---IDGDLSRYTHIDDFKVEYNKVVKESWRLLKMSRRLTLGIGDNREHCFYIPVGFHLIRLYIDQGFELEELVIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIGTFKK|----I------------------P---L-ENIDRM---------L------- Pbla1000013272_Phycomyces_blakesleeanus_Pbla1000013272 HRYVMSIEDYTHKPYLTRD----AVSATK------F-SDH--R-----TGP-----R--------L---AS----Y--GP------------------------QPN-QL---FS-LVFDSTYYDIPGRAPRWATHS-GTDYHGTWLPQTVRRAILKYTNKDERVLSNFLGRG-TDAIECFLLQRRCCGVDINPAAVALSQRNCCFEVPPGLT----------------------------------SAE---YRPIIAQADSRQLT-----GA--LFADESFHHVLSHPPYKDCVAYS-TH---LEGDLSRFTSVEDFRAEYGRVVRESWRLLKMGRRLTLGIGDNREHCFYIPVGFHLLREYINHGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFVATFRK|----I------------------P---L-ECTDKM---------L------- Uram1000000474_Umbelopsis_ramanniana_Uram1000000474 HTYAASIEDYSHKPYLTRE----ALSSTK------F-DDM--K-----SNA---V-R--------L---AS----Y--AP------------------------VEH-QL---FS-LSFDSTYFDIPGRAPRWASHS-GTDYHGTWLPQTVRRAILRHTRKDDRILSNFLGRG-TDAIECFLLQRRCCGVDINPAAVSLSQRSCSFETPPGLT----------------------------------TAE---HRPIIVQADSRKLT-----GA--LFADESYDHVLSHPPYKDCVAYS-LH---IEGDLSRYTNPLDFQEQYDKCVRESWRLLKMDRRLTLGIGDNREHCFYIPVGFQLIRLYINNGFELEELIIKR----QRYCS-AFGLG---TY-----------LCV---QF--DFLVFTHEFIATFRK|----V------------------P---R-NSNDKM---------D------- Crev1000002507_Coemansia_reversa_Crev1000002507 QQQGALIEDYTQGIYFTRE----ACIAPNRVGLPSV-SQQ--P-----LGE--------------L---SS----Y--GP------------------------TDS-ML---FT-LPVNTSYFDIPGRAPRWASHS-GTDYHGTWLPQTVRRALLRYTQRGEHVLSNFLGRG-TDAIECFLLNRKCVGVDINPSAVSLSQRNCSFTITPGCGM---------------------------------SIE---FRPTIMQGDARDLRSDLWPGASYFAESESFDHILSHPPYKDCVLYS-TN---IDGDLSRFPGPDEFQREMEKVVTESWRLLKMGRHLTLGIGDNRAECFYIPVSYQLIRTYISSGFELEELVVKR----QRYCQ-AFGLG---TY-----------LCV---QF--DFLMFTHEFIATLRK|----V------------------P---K-DQIDSM---------H------- Spun1000004719_Spizellomyces_punctatus_Spun1000004719 RVYRTEIEAYSHEYYLTRG----VVGKAS--------GAD--E-----IGA-AEV-NKSS-EFGIL---QS----Y--AP------------------------TDD-QL---FT-MGFDTSFYDIPGRAPRWATHS-GGDYHGTWLPQIVRMSLLRYTSEGERVLSNFSGRG-TDAIECFLLKRRCCSVDINPASVALSQRNVSFSVPPELGL---------------------------------TAA---YRPVIVLADSRELIGS-------LFEDESYDHILSHPPYKDCVSYS-AH---IEGDLSHFPDMEDFQKEMEKIVAETWRLLKPNRRCTLGIGDNRRECFYQPVSFQTIRTYINDGFELEELIVKR----QRYCQ-MAPLG---TY-----------LCT---QY--NFLMFTHEFIAILRK|----V------------------D---D-RQHSGL---------F------- Ccor1000008322_Conidiobolus_coronatus_Ccor1000008322 FGYHGNYEDYIHQPYHTRNLPQLNCLKYD-PSQI-A-ESS--V-----MAN-----K-AI-H---L---ET----Y--GP------------------------TIY-QA---FS-LDYKSTYYDIPGSAPRWASHS-GSDYHGTWLPQTVRRAITRYTKEGDMVLSNFLGRG-TDAIECFLLKRKCIGIDINPVAVSLSQKNISFALPPSLLANSE------------------------------FKY---HRPTIIQGDARNLFN--------ILTNESISHVLSHPPYKDCVEYS-NN---IDGDLSKFSTNMEFCKEMQNVVNETWRVLKMGGRCTLGIGDNRDQCFYQPVSFDLLLLYMETGFQVEEIIVKR----QRQCR-AFGLG---TF-----------LCV---KY--DFLMFTHEFIITLRK|----V------------------P---I-TSRGSM---------S------- DESFE_RS02250_Desulfurococcus_fermentans_504580152 ---MRLVNYNEYLNHVSKR----NTVIVE--------GEE----IE--LKP--------I-K---V---KR----M----------------------------TPL-PE-E-LL-DSS-STVWSFPKRG-SWATH--RGDYRGNWPPQVARLLIERYSDPGNIVLDPMIGSG-TTCIEAKLLGRNCIGVDISYEAVILTLHRLYWLEKTLENPPDDAGS---------------------IDL-ENARR---AVVEIYHGDARRLS---------RVRDGTIDLVITHPPYFNIIKYS-S--R-VDGDLSRASSLEEYLKWFNEAAGEIYRVLKPGGHLGILIGDTRIRKYYVPISHHVLEILLRRGFILREEVVKI----QHKMKTTREVW------------S---RLK---DR--DFLLIYHEKLYVMRK|-----------------------P-----RNQEEY--EKYKY---------- SPHMEL_RS03490_Desulfurococcus_amylolyticus_756979360 ---MRLVDYNEYLNYVSKR----NTVIVE--------GEE----IE--LKP--------I-K---V---KR----M----------------------------MPL-PE-E-LP-DSS-STVWSFPKRG-TWATH--RGDYRGNWPPQVARLLIERYSNPGDIVLDPMIGSG-TTCIEAKLLGRNCIGVDISYEAVILTLHRLYWLEKTLENPPNDACS---------------------IDL-ENARR---AVVEIYHGDARRLS---------RVRDETIDLVITHPPYFNIIKYS-S--R-VDGDLSRASSLEEYLKWFNEATGEIYRVLKPGGHLGILIGDTRIRKYYVPISHHVLEILLRRGFILREEVVKI----QHKMKTTREVW------------S---RLK---DR--DFLLIYHEKLYVMRK|-----------------------P-----RNQEEY--EKYKY---------- DKAM_RS02320_Desulfurococcus_kamchatkensis_501637311 ---MRLVDYNEYLNYVSKR----NTVIVE--------GEE----IE--LKP--------I-K---V---KR----M----------------------------MPL-PE-E-LP-DSS-STVWSFPKRG-TWATH--RGDYRGNWPPQVARLLIERYSNPGDIVLDPMIGSG-TTCIEAKLLGRNCIGVDISYEAVILTLHRLYWLEKTLENPPNDAGS---------------------IDL-ENARR---AVVEIYHGDARRLS---------RVRDETIDLVITHPPYFNIIKYS-S--R-VDGDLSRASSLEEYLKWFNEATGEIYRVLKPGGHLGILIGDTRIRKYYVPISHHVLEILLRRGFILREEVVKI----QHKMKTTREVW------------S---RLK---DR--DFLLIYHEKLYVMRK|-----------------------P-----RNQEEY--EKYKY---------- DICTH_1800_Dictyoglomus_thermophilum_H-6-12_206739986 YKDMKEITKEDYIRFVEEN----EFVIIE--------DVK----VK--LNK--------N-W---D-I-KS----Y----------------------------SPP----ENYT-PEK-TTVWSFPDRG-SWATH--KGNYRGNWSPYIPRNLILKYTAKGDWVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFSYNPLFP---------------------------------KYSE---PIIKTYWGDARNLN---------KIEDNSIDLIATHPPYAGIISYT-KN-KKQSDDLSQL-PLEEYLKEMEKVAEESFRVLKPGKVCAILIGDTRKHKYYVPIAYRVMQVFLEVGFILKEDIIKL----QWNMKATRERW----------------RAK---EY--EFYLIGHEHIFVFRK|-----------------------P-----EDEKEY--KKYKF---------- Smar_0588_Staphylothermus_marinus_500164270 VS-MRKVSYEDYLEFLKNN----KVIEIE--------GSK----IS--LEP--------I-H---V---KH----L----------------------------YPL-PE-E-LT-DIS-TTVWSFPKRG-SWATH--RGNYRGNWPPQMARALIQKYTMPGDTVLDPMIGSG-TTCIEAKLLGRNCIGVDINYNALMLTLHRLYWLEK-YLEKKA------------SKPQEIIEGENSPISI---EDILN-AKVEIYHGDARNLD---------KISNNSIDLVATHPPYFNIIRYS-RGEK-IPGDLSGARKLEEYLSMIQQVISEAYRVLKPGHYMGILVGDTRIHKHYVPITHYVLQTLLKTGFILKEEVVKI----QHKMKTTREVW------------S---KLK---NK--DFLLIYHEKLFILRK|-----------------------P-----INKKEY--RKYKY---------- Shell_0210_Staphylothermus_hellenicus_502907573 AG-MRRVSYDDYLEFLKNN----RAVEIE--------GNR----IS--LEP--------I-R---V---KR----L----------------------------YPL-PQ-E-LT-DIS-TTVWSFPKRG-SWATH--RGDYRGNWPPQMARALILAYTMPGETVLDPMIGSG-TTCIEAKLLGRNCIGVDINYNAVILTLHRLYWLEK-YLEKQA------------S-TQEIFGGEYSPVSI---EDILK-ARVEIYHGDARNLD---------KISSNSIDLVATHPPYYNIIRYS-RTKK-IPGDLSGARRLEEYLAMIQQVGKEAFRVLKPGRILGILIGDTRIHKHYVPITHHVLETLLKTGFILKEEVVKI----QHKMKTTREIW------------S---KLK---NK--DFLLIYHEKLFILRK|-----------------------P-----IDKKEY--RKYKY---------- D891_RS0103440_Hippea_sp_KM1_643385301 ---MREIKQEDYLEFIKNH----SEVVIG--------NSA----VK--LEG--------N-F---VII-SN----C----------------------------SPS----ENYI-PER-TSVWSFPDRG-KWATH--RGNYRGNWSPYIPRNLILKYTEKNDWVLDQMMGSG-TTLVEAKLLQRNAIGIDINLEAVMVSRDRLNFSCDSSVHNDY--R----------------------------E-----PIIKTYWGDARNLD---------KIDNNSIDLIATHPPYANIISYS-RK-KKIESDLSSM-PLKKYISGMKEVARESYRVLKPGKICSILIGDARKHKHYIPISNMIMEIFLNSGFILKEDIIKI----QWNMKATRENW----------------RAK---QY--DFFLIAHEHIFVFRK|-----------------------P-----ENEKDL--KKHRF---------- DICTH_RS08720_Dictyoglomus_thermophilum_754082338 ---MKEITKEDYIRFVEEN----EFVIIE--------DVK----VK--LNK--------N-W---D-I-KS----Y----------------------------SPP----ENYT-PEK-TTVWSFPDRG-SWATH--KGNYRGNWSPYIPRNLILKYTAKGDWVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFSYN-PLFPKY--S----------------------------E-----PIIKTYWGDARNLN---------KIEDNSIDLIATHPPYAGIISYT-KN-KKQSDDLSQL-PLEEYLKEMEKVAEESFRVLKPGKVCAILIGDTRKHKYYVPIAYRVMQVFLEVGFILKEDIIKL----QWNMKATRERW----------------RAK---EY--EFYLIGHEHIFVFRK|-----------------------P-----EDEKEY--KKYKF---------- _Thermofilum_sp_1910b_530785168 ---MRLVTREEYLEYIKTH----RTVRIE--------DEE----IP--IGK--------P-H---R-I-EK----Y----------------------------APD----D-SA-LET-TTVWSFPDRG-DWATH--RGDYRGNWAPQIPRNLILRYSRPGETVLDQMCGSG-TTLVESKLLGRNAIGVDINYEAVMLTMDRLNFSYR-PLDPDY--R----------------------------E-----PEIRVYHGDARNLN---------LIEDESIDLIATHPPYANIISYS-KA-KRIEGDLSQVYSLEEYLQGIREVAKESFRVLKPGRYCAILIGDTRRHRHYVPIAFRVMQQFLEVGFILREDIIKI----QWNTKTTEKKWARLAKTSEENWID-KPENK---KHWTDFYLIAHEHLFVFRK|-----------------------P-----AEGEDI--EKYRD---------- CSUB_C0599_Candidatus_Caldiarchaeum_subterraneum_526884977 ---MRPVTWEDYRRYVAEK----GYVEVE--------DVR----IE--IGK--------P-H---K-I-NS----Y----------------------------SPP----ASYN-LEA-TTVWSFPDRG-DWATH--SGDYRGNWSPYVPRNLILKYTKPGELVLDQMVGSG-TTLVEATLLGRNAIGVDINYEACILTLDRLNFEFH-PL-DEQ--Q----------------------------E-----PVIKVYHGDAAKLN---------IIEDESVDLIATHPPYWNIIPYS-RK-RPEGDLSAYR-KLEDYLGKMMQIARESYRVLKPGRYCAILIGDTRKHKHYVPISTYVMLKFLQAGFVLAEDIIKL----QHKMKTTREKW----------------SGKNFEQY--GFHKIAHEHLYIFRK|-----------------------P-----ENEEER--GRLSL---------- I774_RS01625_Aigarchaeota_archaeon_JGI_0000106-J15_756971970 ---MREITDEDYREFVKTH----REIMVE--------NVK----IR--IGQ--------E-R---R-I-YE----Y----------------------------QPR----D-FV-LET-TNVWSFPERG-AWATH--QGNFRGNWPPQLVRNIILRYSKPGETVLDQMCGSG-TTLIECKLLGRNAIGVDINLDCVMLTRDRLNFEYT-LLDADY--P----------------------------R-----VTIKTYVGDARNLN---------LVEDDSIHLIATHPPYVNIIPYS-RR-KEIEGDLSAVHSIDEYIEGMRKIAEECYRVLKPGRFCTILVGDIRRHRHHVPVAFRTMQVFLESGFILREDIIKH----QWKTKTTREKWEGLTKVAEECWVDIDRKVR---KYYMDFYLLYYEHLFIFRK|-----------------------P-----DKNENL--DQYKD----S----- Mpt1_c10100_Candidatus_Methanoplasma_termitum_731481703 LYISKDIVYQPMIEILDSK----AEDVVV--------ENKKIVPWE--IYK--------K-G---K-I-SG----I----------------------------QPS----D-FK-LER-TTVWSFPDRG-DWASH--TPQYRGNWSPRVVRNIIELYSKPGDLVLDPMVGGG-TTPVECMLTGRNSISIDINQGAISITRNRLELPESMK----------------------------------KQIPK---TVHRTFIGDVRNLD---------KIADESIDLIATHPPYANIIKYA-PS---VDGDLSQINDYDVFFSEFKKAIKEFHRVLKPGAYCSILMGDTHNRSHFVPITARLMFDFLKEGFVLKEDIIKK----EWNCE-SDRNL------------G---KYS---NS--SFLLTMHEHLFVFRK|-----------------------L-N---KGEGAL--KNSSR----S-F-FE _Thermogladius_cellulolyticus_504550199 MVGMREVPISEYLEFVSRN----REVVVE--------DQV----IR--LDP--------I-E---V---KR----L----------------------------EPL-PE-E-LT-DVS-TTVWSFPVRG-GWATH--RGDYRGNWAPQIPRALILMYTRPGDVVLDPMVGSG-TTLIEAKLLGRNSIGVDINYNAVMLALHRLYYLEK-AAL------EYLKRLREGA--GAVGGGAGPAFGDAMPEDVER-AWYKVYHGDARSLS---------LLGSESVDLVATHPPYFNIIDYG-GGER-PEGDLSAARDLEEYLRWVREVAGELYRVLKPGKYCAVLVGDTRVHKHYVPISHYVLQAFLDAGFLLKEEVIKV----QHKMKTTREVW------------S---RVK---NR--DFLLIYHEKLFILRK|-----------------------P-----GQSEESRPSRLKYSGR------- DESMU_RS04730_Desulfurococcus_mucosus_503327799 ---MREVTVGEYLDFVSRN----RRIIVG--------GQE----VD--LSP--------I-E---V---RR----L----------------------------EPS-AD-E-LT-DVS-TTIWSFPKRG-SWATH--RGDYRGNWPPQMARALILGYTEPGEIVLDPMAGSG-TTCIEAVLLGRKCIAVDINYNAVMLTHHRLYYLVN-ARL------------KQGALPGLDAGGEGTGV-----------QGYRVFHGDARRLD---------EIRDNTVDLVATHPPYFNIIGYG-GN---VDGDLSNARTLEEYLEWLREVAGEIYRVLKPGRYCGILIGDTRVHGHYVPITHYALEVFLDAGFILKEEVIKI----QHKMKTTREVW------------N---RLR---KR--NFLLIYHEKLFVFRK|-----------------------P-----GVGEDT--GKLRYSMKLP----- Tagg_1290_Thermosphaera_aggregans_502895170 MY-VREVTVEEYLDFVSRN----TSITID--------GQS----IP--LKP--------I-K---V---NR----L----------------------------DPS-PQ-E-LP-DVS-TTVWSFPKRG-SWATH--KGDYRGNWPPQIPRALILKYTSEGDVVLDPMVGSG-TTCIEAVLLGRNCIGVDLNYHAVMLTHHRLYYLVK-AEL---------------------------SRGREPGR-----AWYKIYHGDARRLD---------KIRDDTVDLVLTHPPYLNIVRYG-EE-R-SEGDLSAVRGLEEFLVLFKEIAREVYRVLKPGKTLAVLVGDTRIKKHYVPLTHYVLLTLLDTGFVMMEEVVKI----QHKMKTTREVW------------S---RLR---NR--DFLLIYHEKLFILRK|-----------------------P-----VDRE----PKVKYSG-------- METIN_RS02115_Methanocaldococcus_infernus_502864863 ---MKEVTYDDYFEFIKEH----SYVTIE--------DTK----LE--IGK--------D-W---K-I-KK----F----------------------------QPD----N-FE-LEP-TNVWSFPKRG-DWATHYLNSKYRGNWAPQVARNLILRYSKEGETVLDPFVGSG-TTLIEAKLLFRNAIGVDINRDAVMLTLDRLRFNYNPL-------D-------------------------INEKPK---TWIKVFVGDARNLD---------KIEDESIDLIATHPPYVNIVKYT-KK-SEVDGDLSKVRSVEDFVNEMRKVAREFFRVLKPGRYCAILIGDTRRNKHHVPVSFRVMQAFLEEGFILKEDIIKI----QHNMR-VTPLW---KK-----------RSQ---EL--NFLLLKYEHLFVFRK|-----------------------P-----ESDEKL--SKFKE----S----- MBMB1_RS02390_Methanobacterium_sp_MB1_746331486 ---MKEKTHEDYNSFLKNN----RFIVIE--------DGKKTLELK--IGK--------K-H---D-P-IE----F----------------------------APE----D-FK-LEI-VNVWSFPKRG-KWATH--GGEYRGNWAPEIPRNILLRYSEAGDVVLDQFLGSG-TTLIECKLLGRKGIGIDVNLNAIMLTRDRLNFNYNPF-------E----------------------------IPI---YEQKTFMGDARDLD---------LIKNESIDLIATHPPYANIIRYS-KD-K-IPEDISNVKNIDEYIKEMEKVASESYRVLKNGKHAAILVGDTRRNKHHIPVAFRVMQAFLEAGFILREDIIKV----QHQMK-GTTFW---AK-----------RSQ---EL--NFLLLKHEHLFVFRK|-----------------------P-----EKDEKT--GKYKF----S----- MBMB1_0500_Methanobacterium_sp_MB1_557946003 HYYMKEKTHEDYNSFLKNN----RFIVIE--------DGKKTLELK--IGK--------K-H---D-P-IE----F----------------------------APE----D-FK-LEI-VNVWSFPKRG-KWATH--GGEYRGNWAPEIPRNILLRYSEAGDVVLDQFLGSG-TTLIECKLLGRKGIGIDVNLNAIMLTRDRLNFNYNPF-------E----------------------------IPI---YEQKTFMGDARDLD---------LIKNESIDLIATHPPYANIIRYS-KD-K-IPEDISNVKNIDEYIKEMEKVASESYRVLKNGKHAAILVGDTRRNKHHIPVAFRVMQAFLEAGFILREDIIKV----QHQMK-GTTFW---AK-----------RSQ---EL--NFLLLKHEHLFVFRK|-----------------------P-----EKDEKT--GKYKF----S----- FACI_RS02255_Ferroplasma_acidarmanus_518679720 ---MKRITLDDYNNYKKLN----DIVTIE--------DNK----IK--IGE--------K-N---I-I-ET----L----------------------------EPE----K-FN-LEI-DNVWSFPERG-KWCTHYLNAKYRGNYAPQLPRNIILRYSKENDLILDPFSGSG-TTLIEAKLLKRHGIGMDINLGSAMITMDRLNFNNSEN-------N----------------------------L-----IEPEIFNGDARNLN---------EIEDESIDLIMTHPPYANIIKYS-KD-NIIKDDLSSIESLEEYYKKFKKVIKEMHRTLKKGKYCAILIGDTRKKGYQIPISFTIMQLFLKEGFVLKEDIIKV----QHNTK-TRHYW---AS-----------LSI---KN--NFMLLAYEHLFVFKK|-----------------------L---------------------------- BJBARM5_0369_Candidatus_Parvarchaeum_acidophilus_ARMAN-5_290559536 ---MKEVTLDNFRDFARTH----NSVKIE--------DNT----IE--IGM--------Q-K---Q-I-TM----L----------------------------QPT----D-FS-PET-TTVWSFPKRG-DWATHYLNSKYRGNWAPQIPRNLILEYTNPEDIVLDPMNGSG-TTLIECKLLGRNGIGVDINEEAIMIALDRLNFQAHEL-------P----------------------------S-----SEIKTFVGDARNLN---------LIKDNAIDLILTHPPYVNIISYT-YN-R-VEGDLSSISSVSEFIEEINKLAVEFFRVIKPGKYCAILMGDTRRHSHYIPVTFRTMQAFLEAGFALKEDIIKL----QWNMQSTRQNW---AG-----------------KQ--NFYKIAHEHLFVFRK|-----------------------P-----THDERL--SELKE---------- I759_RS06660_Euryarchaeota_archaeon_SCGC_AAA252-I15_754482757 --------------------------------------------------------------------------------------------------------------------------MWSFPKRG-DWATH--RGDYRGNWAPEIPRNLILRYSTEGDTVLDQMVGGG-TTLIECKLLGRNGIGVDINSDAIMITRDRLRFDSIDE-------N----------------------------FPE---TGQKTYVGNARNLD---------KISDESIDLIATHPPYLNIIPYT-QE-Q-VKGDLSSVHDLNEFAEEMKIVAQESIRVLKSGKYCGILIGDTRRHKHYVPISARILQAFLKAGFILKEDIIKQ----QWNCK-ATGFW---KK-----------KSQ---ES--NFLLIMHEHLYVFRK|-----------------------P-----EKDEET--TRLKD----S----- _EM3_bacterium_JGI_0000106-B10_658542249 -------------------------------------------------------------------M-EK----F----------------------------EPE----N-FK-LET-TTVWSFPERG-EWATH--KGNYRANWSPYIPRNLILRYTQEGDLVLDQMVGSG-TTLIECKLLNRRGIGVDINHDAIMVTRNRLDFKYK------Y-------------------------------D-----PEIKTYVGDARNLN---------LIPDETIDLIATHPPYANIVKFS-NN-R-IEGDLSNVKNIDEFINEMIKVARESYRVLKPGKHCAILIGDTRKRKHFVPIATRVLEVFLKVGFILREDVIKL----QWKMKGTREKW----------------RGS---KY--DFLLLAHEHLFIFRK|-----------------------P-----GKDEKL--TLFKD----SII--- H17AP60334_RS10630_Thermosipho_africanus_490206260 ---------------------------------------------------------------------------M----------------------------EIN----N--K-LEI-TTVWSFPERG-KWETH--NSKYRGNFAPQIPRNLILKYSKEGEVILDPMVGSG-TTLIEAKILNRKSLGYDINPKSVEITKQNLEFQGD------Y-------------------------------K-----YEPIVKVGDARNLS---------EIEDNTIDLIITHPPYLNIIKYS-EG-T-IEGDLSNISNVEKFIKEIDKIAKELFRVLKENKYCAILIGDTRKRGHYVPLSYYVLKAFLNNGFVLKEDIIKV----QHNCK-STPYW---EK-----------QVK---KY--NFHLIMHEHLFIFRK|-----------------------P-----SKNENL--SPIKF----S-TNYL TMEL_RS01630_Thermosipho_melanesiensis_501003459 ---------------------------------------------------------------------------M----------------------------DNF----D--K-LEI-TTVWSFPKRG-KWKTH--NSRYRGNFAPQIPRNVILRYSNESETILDPMVGSG-TTLIEAKILNRKSIGYDINPESIELTKRNLNFEGN------Y-------------------------------K-----YEPAVKIGDARNLY---------EIKNETIDLIITHPPYLNIIKYS-SG-K-IKQDLSNISDVNKFILEFEKIVKELYRVLKENKYCAILIGDTRRKGHYIPLSFYVMKIFLKNRFVLKEDIIKI----QHNCQ-STPFW---EK-----------QVK---KY--NFYLIMHEHLFVFRK|-----------------------P-----KKDENL--THIKY----S-TGLF _Candidatus_Calescibacterium_nevadense_551115149 ---MREITEEDYRTFLKTH----DFVIIE--------NVK----VP--LIK--------E-H---K-I-EK----F----------------------------EPE----N-FK-LET-TTVWSFPERG-EWATH--KGNYRANWSPYIPRNLILRYTQEGDLVLDQMVGSG-TTLIECKLLNRRGIGVDINPDAIMVTRNRLDFKYK------Y-------------------------------D-----PEIKTYVGDARNLN---------LIPDETIDLIATHPPYANIVKFS-NN-R-IEGDLSNVKNIDEFINEMIKVARESYRVLKPGKHCAILIGDTRKRKHFVPIATRVLEVFLKVGFILREDVIKL----QWKMKGTREKW----------------RGS---KY--DFLLLAHEHLFIFRK|-----------------------P-----GKDEKL--TLFKD----SII--- TTHWC1_RS07155_Thermoanaerobacter_489963634 ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGNYRGNFAPQIPRNVILRYSQEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGD------Y-------------------------------K-----YEQVVRVGDVRNLK---------EISDISIDLIITHPPYLNIIKYS-NG-R-IEGDLSNISDVKKFCDELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---ER-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY THYS13_RS12835_Thermoanaerobacter_sp_YS13_757582161 ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGNYRGNFAPQIPRNVILRYSHEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGN------Y-------------------------------K-----YEQIVRVGDVRNLK---------DIGDSSIDLIITHPPYLNIIKYS-NG-T-IEGDLSNISDVKKFCDELKKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---ER-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY M663_RS0111020_Thermoanaerobacter_sp_A7A_658480004 ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGNYRGNFAPQIPRNVILRYSHEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGD------Y-------------------------------K-----YEQIVRVGDVRNLK---------EIGDSSIDLIITHPPYLNIIKYS-NG-R-IKGDLSNISDVKKFCNELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---ER-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY THIT_RS02440_Thermoanaerobacter_italicus_502759633 ---------------------------------------------------------------------------M----------------------------QDI----D-FK-KEI-TTVWSFPERG-KWKTH--KGDYRGNFAPQIPRNVILRYSQEGDFVLDPMVGSG-TTLIETKILNRRGIGFDINPDSVELTKRNLDFDGD------Y-------------------------------K-----YEQIVRVGDVRNLK---------EISDSSIDLIITHPPYLNIIKYS-NG-R-IEGDLSNISDVKKFCDELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---EK-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY TKV_c04890_Thermoanaerobacter_kivui_694165517 ---------------------------------------------------------------------------M----------------------------QNI----E-FR-KEI-TTVWSFPERG-NWKTH--NGSYRGNFAPQIPRNVILRYSNEGDIVLDPMVGSG-TTLIEAKLLNRRGIGFDINPESVELAKRNLEFDGE------Y-------------------------------K-----YEQIVRVGDVRNLK---------EISDSSIDLIITHPPYLNIIKYS-NG-R-IEGDLSNISDVKKFCDELEKGVIELYRVLKEDRYCAILIGDTRKSGHYIPLSYYVMRLFLKNGFVLKEDIIKV----QHNCK-STPYW---EK-----------QVE---KY--NFYMIMHEHLFIFRK|-----------------------P-----KKDENL--NKIKY----S-TGLY HYDTH_RS09530_Hydrogenobacter_thermophilus_502729540 ---MKEITMNDYLEFIKEN----DFVIIE--------SVK----VK--LNK--------T-W---S-I-KS----Y----------------------------GPK----E-YF-PEK-TTVWSFPNRG-SWATH--KGNYRGNWSPYVPRNLILKYTNKGDWVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFP--------Y--G----------------------------Q-----STIKTYWGDARNLD---------KIESQSIDLIATHPPYANMISYT-KN-KKLSDDLSLL-SPEEYLKEMRKVAEESYRVLKPGKVCAILIGDTRKYKHYVPIAFRVMQVFLEAGFILREDIIKL----QWKMKATREKW----------------RAK---EY--DFYLIAHEHIFVFRK|-----------------------P-----EKEEEY--RKYKL----S----- HGMM_F16H05C22_uncultured_Aquificae_bacterium_374851611 ---MKEITMNDYLEFIKEN----DFVIIE--------SVK----VK--LNK--------T-W---S-I-KS----Y----------------------------GPK----E-YF-PEK-TTVWSFPNRG-SWATH--KGNYRGNWSPYVPRNLILKYTNKGDCVLDQMMGSG-TTLVEAKLLERNAIGVDINLDAVMVALDRLNFP--------Y--G----------------------------Q-----STIKTYWGDARNLD---------KIESQSIDLIATHPPYANMISYT-KN-KKLSDDLSLL-SPEEYLKEMRKVAEESYRVLKPGKVCAILIGDTRKYKHYVPIAFRVMQVFLEAGFILREDIIKL----QWKMKATREKW----------------RAK---EY--DFYLIAHEHIFVFRK|-----------------------P-----EKEEEY--RKYKL----S----- MTC_RS07180_Methanocella_conradii_504218926 -------------------------------------------------------------------MKNNSNISL----------------------------APT----N-FE-PEF-TTLWSFPVRG-NWATH--SPDYRGNFAPQIARNLILKYSKEGDTVLDPMAGGG-TTLIEAKLLNRKGIGFDINPKAVDITIKNLRFECN------S-------------------------------N-----YEPKVKVGDVRNLK---------EIPDSSIDLIITHPPYLNIIKYS-DG-K-IEGDLSNISSLKKFCDELELGIKEFYRVLKEDSYCAILIGDTRRAKHYVPLSYYVMERFLDNGFVLKEDIIKA----QHNCE-STPYW---KS-----------KAE---KL--NIYLIMHEHLFVFRK|-----------------------P-----SEHENL--SRLRY----S----- _Gracilibacteria_bacterium_JGI_0000069-K10_742671763 -------------------------------------------------------------------M-AKKLK-L----------------------------PPE----D-FE-QEC-STVWSFPRRG-NWATH--NSKYRGNWSPEVVRNLILRYSKEGDYLLDPMIGGG-TTAIEAKLLGRNLLCYDINPEAIKLTESFLDFEIP------S-----------------------------PTKER---ARVRLKKHNATKKNK--------DLKDESIDFVLMHPPYVDIIKYS-D--G-IKGDLSHIHDLDEFSDEIEKVAKESFRVLKKGGYCAVLMGDTRREKMYQPLAFKTMERFLKVGFALKEDIIKV----QHNCK-ATGFW---VN-----------KSK---DY--NFLLIMHEHLFIFKK|-----------------------I---------------------------- ACD_71C00187G0001_uncultured_bacterium_(gcode_4)_406901678 -------------------------------------------------------------------M-PKKIKKL----------------------------QPE----E-FD-QEC-TTVWSFPRRG-NWATH--NSKYRGNWSPDVVRNLIVRYSKEGDTLLDPMIGGG-TTAIECKLLNRNLIAFDVNPASIELSESMLDFEYD------S-------------------------------S-----AKIRIVQGDARELMK--------KVGDESVDFILHHPPYADIIKYS-EW-K-IPEDLSNIHDIDEFADEMEKIARECFRVLKKWQYCAILIGDTRREKMYQPMAFKVMERFLRVGFALKEDIVKV----QHNCK-ATGYW---KT-----------SSQ---KY--NFLLIMHEHLFIFKK|-----------------------P---------------------------- _Methanocella_conradii_504218923 -------------------------------------------------------------------MKNNSNISL----------------------------APT----N-FE-PEF-TTLWSFPVRG-NWATH--SPDYRGNFAPQIARNLILKYSKEGDTVLDPMAGGG-TTLIEAKLLNRKGIGFDINPKAVDITIKNLRFECN------S-------------------------------N-----YEPKVKVGDVRNLK---------EIPDSSIDLIITHPPYLNIIKYS-DG-K-IEGDLSNISSLKKFCDELELGIKEFYRVLKEDSYCAILIGDTRRAKHYVPLSYYVMERFLDNGFVLKEDIIKA----QHNCE-STPYW---KS-----------KAE---KL--NIYLIMHEHHLFLGS|-----------------------R-----VNMKI------------------ ACD_81C00186G0010_uncultured_bacterium_406873648 -------------------------------------------------------------------MSIKDFK-L----------------------------HPE----E-FD-LEC-TTVWAFPRRG-NWATH--ASDWRGNWAPEVVRNLILRYSSEKDHLLDCMIGGG-TTAIEAKILNRHITCIDVNEEALERTRKSLEFEVE------N-------------------------------K-----AKQRVMKCDARDMS---------FIKDNEIDFVLTHPPYADIIKYS-DG-Q-IEEDISGIHDIDAFVDEIEKVAKELYRVLRPGKYCAILMGDTRRNKMYQPLAFKVMERFLRVGFVLKEDIIKR----QFNCK-ATGFW---VN-----------KSK---ES--NFLLIMHEHLFVFQK|-----------------------LDSIKSPDFATV--SKIKT----I----- FERPE_RS08390_Fervidobacterium_pennivorans_752594110 ---------------------------------------------------------------------------M----------------------------HDLNVEFD-FK-PEI-TTVWSFPERG-KWSTH--KGTYRGNFAPQVARNLLLRYTKEGDVILDPMMGSG-TTLIEAKLLKRRAIGIDINPTSVELTKRNLSFNCP------N-------------------------------S-----YEPEVFIGDARDLS---------FIEDETVDFVLLHPPYLNIIKYS-EG-N-INGDLSNISDVKRFCTELEKVIIELFRVLKPGKFCSVLIGDTRKNGHYVPLSYYVLTLFLKNGFVLKEEIIKI----QHNCT-STPYW---RK-----------KVS---EN--NFYLIMHEHLFVFKK|-----------------------P-----ELGENV--SKIKY----S----- Ferpe_1711_Fervidobacterium_pennivorans_DSM_9078_383110164 ---------------------------------------------------------------------------M----------------------------HDLNVEFD-FK-PEI-TTVWSFPERG-KWSTH--KGTYRGNFAPQVARNLLLRYTKEGDVILDPMMGSG-TTLIEAKLLKRRAIGIDINPTSVELTKRNLSFNCP------N-------------------------------S-----YEPEVFIGDARDLS---------FIEDETVDFVLLHPPYLNIIKYS-EG-N-INGDLSNISDVKRFCTELEKVIIELFRVLKPGKFCSVLIGDTRKNGHYVPLSYYVLTLFLKNGFVLKEEIIKI----QHNCT-STPYW---RK-----------KVS---EN--NFYLIMHEHLFVFKK|-----------------------P-----ELGENV--SKIKY----S----- ACD_28C00322G0004_uncultured_bacterium_406967845 ---------------------------------------------------------------------MSEIK-L----------------------------HPE----E-FE-LEC-TTVWAFPRRG-NWATH--KSDWRGNWSPEVARNLILRYSKEKDHLLDCMIGGG-TTAIEAKILNRHITCIDVNEEALERTKKSLEFEVD------N-------------------------------K-----AKQRVAKCDARNMS---------FIKDNEIDFVLTHPPYADIIKYS-EG-K-IEEDLSGIHDIDAFVDEIEKVAKELFRVLKKGKYCAILMGDTRRNKMYQPLAFKVMQKFLDTGFVLKEDIIKR----QFNCK-ATGFW---VT-----------KSK---ES--NFLLIMHEHLFVFQK|-----------------------V---------------------------- ACD_18C00096G0009_uncultured_bacterium_406986924 ---------------------------------------------------------------------MVKMK-L----------------------------HPD----N-FD-LEC-STVWSFPRRG-KWATH--KSDWRGNWAPEVVRNLILRYSGEKDHLLDCMIGGG-TTAIEAKILNRHITCIDVNEEALERTRKSLNFEVN------N-------------------------------K-----ARQRIIKCDARKMD---------FIKDNEIDFVLTHPPYADIIKYG-EG-K-IKEDLSNIHDIEKFAEEMELVAKELYRVLKPQKYCAILIGDTRRNKMYQPMAYKVMDKFLKQGFKLKEDIIKQ----QHNCK-ATGFW---VK-----------KSK---KL--NFLLIMHEHLFVFQK|---------------------------------------------------- NA23_RS09565_Fervidobacterium_islandicum_701167223 ---------------------------------------------------------------------------M----------------------------RDLSREIE-FK-PEI-TSVWSFPDRG-KWSTH--RGNYRGNFAPQVARNLLLKYTQEGDLVLDPMMGSG-TTLIEAKLLKRKAIGIDINPESVELTRKNLDFNCD------N-------------------------------C-----YKPEVLLGDARKMS---------FLNDEVVDFIILHPPYLNIIKYS-NG-N-IVGDLSTISDVKTFCLELEKVVHELFRVLKQNKYCAVLIGDTRKNGHYVPLSYYVATLFLKNGFVLKEEIIKV----QHNCS-STPFW---EK-----------KVQ---EH--NFYLIMHEHLFVFRK|-----------------------P-----AQDENL--SRIKY----S-SGRF TTHE_RS09375_Thermoanaerobacterium_thermosaccharolyticum_503063371 ---------------------------------------------------------------------------M----------------------------ENI----N-FK-KEI-TTVWSFPERG-DWATH--NGKYRGNFAPQVPRNIILRYSKENDIVLDPMVGSG-TTLVEAKLLNRRGIGFDINPDAIDITKRNLNFGAN------FSGK---------------------------CK-----FEPDAKIGDIRNLK---------EIDDNSIDLIITHPPYLNIIKYS-NG-N-IEGDLSNISGVKKFLNELEKGVSELFRVLKNNRYCAILIGDTRKSGHYVPLAFYVMQLFLKNGFILKEDIIKV----QHNCK-STPYW---ES-----------QVE---KY--NFYLIMHEHLFVFRK|-----------------------P-----DIDEDV--SKVRY----S----- HMPREF9131_RS04375_Peptoniphilus_490963643 ------------------------------------------------MTS----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPERG-DWATH--DAKWRGNWSPYIPRNIILRYSKEKDLILDQFAGGG-TTLVEAKLLKRNIIGLDVNDVALNRCREKIDFEHE---------G----------------------------AD----GKVFLRKGDARNLD---------FISDNSIDLICTHPPYANIIKYS-EN---IKEDLSQL-KINDFLDEMKKVASESYRVLKKDKFCAVLMGDTRKNGHMIPLSFYVMQVFENAGFKMKEMIIKE----QHNCR-ATGFW---KT-----------NSI---KY--NFLLIAHEHLFIFRK|---------------------------------------------------- SPICO_RS02800_Sphaerochaeta_coccoides_503504517 -----------------------------------------------------------M-T---I---RK----W----------------------------EPD----N-FE-LET-NTVWSFPDRG-NWATH--DAKWRGNWSPYIPRNILLRYSGEGDWVLDQFVGGG-TTLVEAKLLNRNIIGIDVNPDALNRCKAKIDFEC----------P----------------------------NA----GTVKLYQNSAGNLS---------FIEANSIDLICTHPPYADIIHYS-ED---IEGDLSLM-SVRDFLGAMKPVAEECYRVLKKGKFCAVLMGDTRKKGCVIPMSFDVMKIFEAAGFVTKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KH--NFLLLAHEYLFVFRK|---------------------------------------------------- P159_RS0116605_Selenomonas_ruminantium_657829373 -----------------------------------------------------------------M---IK----W----------------------------EPE----D-FE-LRM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDMALERCREKTDFEHE---------G----------------------------AE----GRVTLQKGDARNLD---------FLKDEQIDLICTHPPYANIIQYS-ED---IPADLSRM-AIADFLEEMKKVAKESYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- BN820_00869_Acetobacter_sp_CAG:977_547265226 ------------------------------------------------MKR--------------I---KC----W----------------------------QPE----N-FE-LEM-TSVWSFPNRG-KWATH--DAKWRGNWSPYIPRNLILRYSQEGDVVLDQFVGGG-TTLVEAKLLNRNAIGVDINDAALERCAEKTSFQYE---------G----------------------------SE----GEISIVKADARDLS---------FISNESIDLICTHPPYANIIQYS-DD---LENDLSRL-SLKDFLAEIQKVASESYRVLKKGKYCAVLMGDLRKKGHVFPLGMNVMQIFESVGFSLKEIIIKE----QHNCK-ATGYW---KT-----------SSI---KY--NFFLLAHEYLFVFKK|--K------------------------------------------------- Q388_RS0120175_Ruminococcus_albus_503262746 -------------------------------------------------MK----------K---I---KK----W----------------------------EPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGVDCNDEALTRCREKIDFDYP---------P----------------------------AH----GKVFLYKGDARDLY---------FQSDESVDLICTHPPYADIIKYS-DG---IPEDLSQL-KVKDFLEAMKPVAAECYRVLKKGKFCAVLMGDTRQKGCMIPMSFDVMKIFQEAGFTLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- I825_RS0102520_Atribacteria_bacterium_SCGC_AAA255-G05_658522169 -------------------------------------------------MS--------K-K---I---KT----F----------------------------YPK----D-FK-EKQ-STVWSFKQRG-NWATH--SGEYRGNWSPFIPRNVILKYSNPGELILDYFCGAG-TTAVECKLLNRKCIAIDINDKAIELAKENVNFNTE---------S--RQLTF--E------------------KNHTQIYEPELLVGDARDLS---------SLKDNSVDLICAHPPYSNIIHYT-DS---KEGDLSFF-DIDEFLKEMEKVAKESFRVLKPGRQCAILIGDTRRKKHIIPLGFKLINIYLEAGFKLRELVIKR----QHNCK-TTGFW---YT-----------NSI---KY--NFLLLAHEYLPIFEK|---------------------------------------------------- _[Eubacterium]_siraeum_505332319 -----------------------------------------------MANK----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGIDINDVALERCREKTDFDYE---------P----------------------------AK----GKVYINKGDARHLD---------SIPDDSIDLICTHPPYADIIKYS-DG---IDGDLSQL-KVKEFLEQMKPVAEESYRVLKKGKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- N469_RS0107485_Marinimicrobia_703200955 ---------------------------------------------M--KVS--------K-K---I---KR----L----------------------------YPE----D-FK-EEL-TTVWSFKQRG-NWATH--SGEYRGNWSPYIPRNVILKYSKPDELVLDYFCGAG-TTAVECKLLGRKCIAFDINDKAIELARRNLNFIVE---------S--QQLSLIDE------------------KLHPQIYEPALSVGDARELS---------LLQDNSIDLICAHPPYANIIHYT-DS---KEGDLSFC-DIDEFLKEMGKVAKESFRVLKPGRQCVILIGDIRKKKHVIPLGFKLINVYLNAGFKLRELVIKR----QHNCK-TTGFW---YA-----------NSI---KY--NFLLLAHEYLPIFEK|---------------------------------------------------- BN720_00766_Eubacterium_sp_CAG:581_548315511 ------------------------------------------------MNK----------R---I---TK----W----------------------------GPD----D-FE-LEM-TTHWSFPDRG-KWATH--DAKWRGNWSPYIPRNILLRYSNEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDCNDVALKRCKEKIDFDYE---------P----------------------------AK----GKVYIRKGDARNLD---------FIKDDSIDLICTHPPYANIIQYS-DD---ITEDLSLL-KINDFLEQMKKVASESYRVLKKGKFCAVLMGDTRQKGHMIPMSFDVMNIFQNTGFKLKELIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLIAHEYLFVFHK|--I------------------------------------------------- Q355_RS15275_Meiothermus_cerbereus_738305516 --------------------------------------------MT--RRK--------S-K---I---TQ----W----------------------------EPK----G-FQ-LET-TTVWSFKNRG-KWATH--DGRYRGNWSPYIPRNLILRYSQPHEVVLDYFSGGG-TTAVEAKLLTRRCIARDINPDALALTKENLDFQLP---------Q--DMFS---G------------------NGH---FPIQIELGDARDLS---------SIEDESIDLICAHPPYAGIISYSANA---VDGDLSTL-CVPEFIDEMQKVARESYRVLKAGRQCAILIGDSRKSKHIVPIGFLTIRAFLNAGFVLRELIIKR----QHNCK-TTGFW---YS-----------NSI---RY--NFLLLAHEFLPVFEK|---------------------------------------------------- HMPREF1497_RS08290_Fusobacterium_492656844 ------------------------------------------------MNK----------K---N---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGVDVNNVAIERCKEKINFNFE---------N----------------------------S-----GKVYIHKGDARKLD---------FIKDETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KISEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|---------------------------------------------------- L21TH_RS00440_Caldisalinibacter_kiritimatiensis_493349121 --------------------------MDK--------KLG----KYEDKKS--------H-K---V---KE----W----------------------------EPK----D-FK-LEA-TTVWSFPNRG-KWATH--SGKYRGNWSPYIPRNIILRYSKKDDTVLDQFLGSG-TTLIETKLLHRNGIGVDVNQNAINIAKENLEFTKN---------K----------------------------E-----YEPKIIKGDARDLD---------FISDESIDLICTHPPYANIIKYS-DN---IKEDLSRY-DINQFLVEMKKVASECYRVLKKDKYCAILIGDTRRKKHMIPLGFKVMEVFLDAGFVLKENIIKE----QHNCK-ATGFW---YK-----------RSI---EY--NFLLIAHEYLLVFRK|-PVDNEDKKV------------------------------------------ HMPREF9093_RS05275_Fusobacterium_sp_oral_taxon_370_496969638 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTQEKDLILDQFAGGG-TTLVEAKLLNRNIIGIDINDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIRKGDARNLD---------FIKDETIDFVCTHPPYANIIEYS-ED---IEGDLSHL-KIPEFLKEIEKVATESYRVLKKDNFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- HMPREF1504_RS04555_Veillonella_sp_ICM51a_740284293 ------------------------------------------------MAK-------NIKK---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLLLRYSQEGDLILDQFAGGG-TTLVEAKLLNRDIIGIDINEVALERCKEKIDFDYE---------S----------------------------AK----GRVELHKGDARNLD---------FISDDSIDFVCTHPPYANIIKYS-EG---IEGDLSQL-KVPEFLEEMKLVASESYRVLKKGRFCAILMGDTRQKGHMVPMSFDVMRIFEEAGFKLKELIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFKK|---------------------------------------------------- MSUIS_RS03915_Mycoplasma_suis_503374808 --------------------------------------------MS--SKV--------K-K---F---TK----W----------------------------GPD----N-FE-LET-STIWNFPNRG-KWATH--DAKYRGNWSPYIPRNILLRYSSEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEESLKRCREKTSFEFN---------G---------P------------------K-----GQVEIVKGDARDLN---------FIKSESIDLICTHPPYANIIHYS-EG-QVIEEDLSNL-KVSEFLEEMKKVAQECCRVLKKNKYCVILMGDTRKNGHMIPLSFDVMKLFEDVGFKLKELIIKA----QHNCK-ATGFW---KT-----------NSV---KH--NFLLIAHEYLFVFRK|---------------------------------------------------- COPRO5265_RS06650_Coprothermobacter_proteolyticus_754097257 ----------------------------------------------------------------------M----W----------------------------EPE----D-FS-LET-TTVWGFPDRG-DWATH--SGKYRGNWSPYIPRNVILRYSNENDVVLDQFVGSG-TTLVEAKLLGRRGLGVDINPDAVKLALSNVNFEHK--------------------------------------C-----GLADVHIGDARNLD---------FVKDSSIDLICTHPPYSNIIKYS-DN---IEGDLSHY-DIPEFLKEMYKVASESYRVLKRGRFCAVLMGDTRRKGNIIPLGFRVMEVFCKAGLTLKEIVIKE----QHNCT-STGYW---KK-----------QSI---KY--NFLLIAHEYLFIFKK|---------------------------------------------------- SELSP_RS03130_Selenomonas_sputigena_493205676 ------------------------------------------------MVK----------K---I---TK----W----------------------------EPE----E-FE-LEM-TTHWSFPKRG-NWATH--DAKWRGNWSPYIPRNILLRYSEEKDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDTALERCKEKIDFEHD---------G----------------------------AD----GKVYIHKGDARNLD---------FIPDGSIDLICTHPPYADIIKYS-ED---IEADLSHL-KVKDFLEEMNAVAAESYRVMKKGKFCVVLMGDTRQKGHMIPMSFQVMRIFEDAGFTLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- MSU_RS04145_Mycoplasma_suis_762897706 -----------------------------------------------------------------M---LK----L----------------------------GPD----N-FE-LET-STIWNFPNRG-KWATH--DAKYRGNWSPYIPRNILLRYSSEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEESLKRCKEKTSFEFN---------G---------P------------------Q-----GQVEIVKGDARDLN---------FIKSESIDLICTHPPYANIIHYS-EG-QVIEEDLSNL-KVSEFLEEMKKVAQECYRVLKKNKYCVILMGDTRKNGHMIPLSFDVMKLFEDVGFKLKELIIKA----QHNCK-ATGFW---KT-----------NSV---KH--NFLLIAHEYLFVFRK|---------------------------------------------------- J145_RS0109710_Fusobacterium_hwasookii_657692329 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGE-TTLVEAKLLNRNIIGIDVNDVAIERCKEKINFEFE---------N----------------------------S-----GKVYINKGDARKLD---------FIKDESIDFVCTHPPYANIIEYS-EN---IDEDLSHL-KIPEFLKEMKKVASESHRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- MSU_0848_Mycoplasma_suis_str_Illinois_323652279 ---------------------------------------------------------------------MK----L----------------------------GPD----N-FE-LET-STIWNFPNRG-KWATH--DAKYRGNWSPYIPRNILLRYSSEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEESLKRCKEKTSFEFN---------G---------P------------------Q-----GQVEIVKGDARDLN---------FIKSESIDLICTHPPYANIIHYS-EG-QVIEEDLSNL-KVSEFLEEMKKVAQECYRVLKKNKYCVILMGDTRKNGHMIPLSFDVMKLFEDVGFKLKELIIKA----QHNCK-ATGFW---KT-----------NSV---KH--NFLLIAHEYLFVFRK|---------------------------------------------------- G497_RS0101670_Desulfovibrio_cuneatus_652933168 ------------------------------------------------MLK--------N-K---E---QK----W----------------------------GPD----N-FE-LEM-NTVWSFPQRG-NWATH--DAKYRGNWSPYIPRNILLRYSSEGDYVLDQFAGGG-TTLVEAKLLKRNVLGVDVNESALECCRVKCDFESE---------N----------------------------A-----GRVVIRHGDARNLN---------FIKDECIDLVCTHPPYANIIQYS-EN---NLNDLSHL-DVTSFLEQMKLVAAESYRVLKKDKFCAILMGDTRKKGHIIPMSFEVMRIFEHAKFKTKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLFIFKK|---------------------------------------------------- FSCG_RS00585_Fusobacterium_nucleatum_496076017 ------------------------------------------------MN-------------------KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARRLD---------FIKDETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|---------------------------------------------------- HMPREF1583_RS06235_Gardnerella_vaginalis_515155565 ------------------------------------------------MTV--------T-A---I---KR----W----------------------------EPE----N-FE-LEM-TTHWSFPDRG-NWATH--DSKWRGNWSPYVVRNLLLRYSAEKDLVLDQFVGGG-TTLVEAKLLNRDVIGVDVNDIAINRCREKVSFNHE---------G----------------------------AD----GRVYIRKGDARNLD---------FLDDESIDFICTHPPYANIIKYS-EN---IPEDLSLL-KVDAFLSQMKKVAEESYRVLKTNKFCAVLMGDTRQKGCMIPMSFDVMKIFQNAGFTLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- BN788_01674_Eubacterium_siraeum_CAG:80_547865125 ------------------------------------------------MHK----------K---I---TK----W----------------------------QPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGIDINDVALERCREKTDFDYE---------P----------------------------AK----GKVYINKGDARHLD---------SIPDDSIDLICTHPPYADIIKYS-DG---VDGDLSQL-KVKDFLEQMKPVAEESYRVLKKGKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- FUSO3_RS09465_Fusobacterium_necrophorum_737952516 ------------------------------------------------MVK----------K---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSKEEDLVLDQFAGGG-TTLVEAKLLNRDVIGVDVNEFALERCQEKISFEYE---------T----------------------------AK----GKVYLRKGDARKLD---------FIPDESVDLICTHPPYANIIQYS-ED---IEEDLSHL-KIKDFLEEMKKVAGESYRVLKKDKFCAILMGDTRQKGHMMPMSFEVMKIFEEVGFKLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- BN656_01315_Bacteroides_pectinophilus_CAG:437_547961389 -------------------------------------------------------------------------------------------------------MQPN----N-FQ-LEP-TTVWSFPDRG-SWATH--SGKYRGNWSPYVPRNLILRYSKPGEWVLDQFMGSG-TTLVEAKLLGRNAVGIDINPQSVSISETNLKFQCE---------T----------------------------K-----SKIFTKNADATNLH---------FIKDEHIDFICTHPPYADIIKYS-KG---ISGDISLL-CVDKFLGEMNKVAAESYRVLKRGKMCAVMIGDVREHGKVIPLGFRMMEGFLNAGFSNKEIIIKE----QHNCR-STKYW---EN-----------HNN---S----FLMLAHEYIFVFQK|---------------------------------------------------- HMPREF0405_RS07885_Fusobacterium_nucleatum_496073207 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARRLD---------FIKDETIDFICTHPPYANIIEYS-EE---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCVILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|---------------------------------------------------- CLOHIR_RS00470_[Clostridium]_hiranonis_493484190 -----------------------------------------------------------------M---IN----W----------------------------EPS----N-FK-LET-GTVWIFPERG-SWATH--TPKYRGNFSPYVPRNLILRYSKKGDMILDQFAGGG-TTLIEAKLLGRNIIGVDVNIQALALCRSSTNFEYK---------N----------------------------S-----SKVYLRRGDARNLN---------FIPDEKIDFICTHPPYADAIKYS-KD---IVEDISLL-DYKSFLKEMEKVAKESYRVLKKGKYCAILMGDIRKNGNVIPLGFEVMNIFKNVGFINKEIIIKE----QYNCK-STDYW---IK-----------KSF---ER--NFLLLEHEYLFVFRK|---------------------------------------------------- HMPREF1090_RS26280_[Clostridium]_clostridioforme_488666942 ------------------------------------------------MAK----------K---I---TK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLRRNIIGVDVNDVALARCREKIDFEHE---------G----------------------------AD----GKVYIHKGDARHLD---------FIPDGSIDLICTHPPYADIIRYS-ED---IDEDLSHL-KVKDFLEEMKTVAQESYRVLKKDKFCAVLMGDTRQKGHMVPMSFEVMRIFEDAGFKLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- HMPREF1501_RS03285_Fusobacterium_sp_OBRC1_492605690 ------------------------------------------------MKK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDINDIAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARKLD---------FIKDESIDFICTHPPYANIIEYS-ED---IDEDLSHL-KIPEFLKEIKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------SSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- BN647_00380_Firmicutes_bacterium_CAG:41_547820585 -----------------------------------------------MENK----------K---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGVDVNDAALDRCREKTDFDYE---------P----------------------------AK----GKVYIKKGDARNLD---------FVPDESIDLICTHPPYADIIKYS-DG---LKNDLSQL-KVKDFFEEMKKVASESYRVLKKDKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFKLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- HGRM_RS15055_Ruminococcus_sp_JC304_517992866 --------------------------------------------------------------------------------------------MSKKKGEILIREAPE----K-FK-LED-TTIWSFPERG-SWATH--SGKYRGNWSPYIPRNLILRYSKKNDWILDQFLGSG-TTLIEAKLLGRNAIGVDINSEAIKLSNTNLNFTCQ---------E----------------------------S-----SKIFTKQGNATELS---------FIKDESINLICTHPPYADIIRYS-NK---IPGDISHL-KYEEFLKALEQVAREAYRVLKKQGICAFMIGDIRRAGYVLPLGMNSMQKFVNAGFKLKEIVIKE----QHNCR-SADYW---DG-----------KER---N----FLMLAHEYIFILKK|-----------------TDDYKS----------------------------- HMPREF9124_RS05925_Oribacterium_sp_oral_taxon_108_738699070 -----------------------------------------------------------------------------------------------------MLLQPN----S-FN-LEQ-TSIWSFPERG-KWATH--SGKYRGNWSPYIPRNLILRYSKPGDWVLDQFLGSG-TTLVEAKLLNRNGIGIDINPKALSLSRTNLSFHSN---------S----------------------------R-----AQIFLKKGNAAKLA---------FIKDNRIDFICTHPPYSNIISYS-SD---LAGDISLC-NEKEFIIAMKKVAAESFRVLKKGKYCAVMIGDKRIHGNVIPLGFQLLTCFLETGFVLKEIIIKV----QHNCR-ATSNW---QN-----------KNR---N----FLMLAHEYIFVFYK|---------------PNF---------------------------------- HMPREF9124_1256_Oribacterium_sp_oral_taxon_108_str_F0425_333759614 ------------------------------------------------------------------------M--YSKNSIDFVYFQRVIKLNYYKTGGDFILLQPN----S-FN-LEQ-TSIWSFPERG-KWATH--SGKYRGNWSPYIPRNLILRYSKPGDWVLDQFLGSG-TTLVEAKLLNRNGIGIDINPKALSLSRTNLSFHSN---------S----------------------------R-----AQIFLKKGNAAKLA---------FIKDNRIDFICTHPPYSNIISYS-SD---LAGDISLC-NEKEFIIAMKKVAAESFRVLKKGKYCAVMIGDKRIHGNVIPLGFQLLTCFLETGFVLKEIIIKV----QHNCR-ATSNW---QN-----------KNR---N----FLMLAHEYIFVFYK|---------------PNF---------------------------------- J142_RS0109970_Fusobacterium_hwasookii_657695114 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKINFEFE---------N----------------------------S-----GKVYINKGDARKLD---------FIKDESIDFVCTHPPYANIIEYS-EN---IDEDLSHL-KIPEFLKEMKKVASESHRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- _Fusobacterium_nucleatum_496296625 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARKLN---------FIKNETIDFICTHPPYANIIKYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- G397_RS0107800_[Eubacterium]_siraeum_491499778 -----------------------------------------------MANK----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRDIIGIDINDVALERCSEKTAFDYE---------P----------------------------AK----GKVYINKGDARCLD---------SIPDDSIDLICTHPPYADIIKYS-DG---IDGDLSQL-KVKDFLEQMKPVAEESYRVLKKGKFCAILMGDTRQKGCMIPMSFDVMKIFQDAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- BN678_01434_Dialister_sp_CAG:486_547523036 ------------------------------------------------MKK--------I-K-------------W----------------------------EPE----N-FE-LEM-NTVWDFPERG-SWATH--DAKYRGNWSPYIPRNLLLRYSKEGDWVLDQFAGGG-TTLVEAKLLHRNCIGLDVNPAALSRCHEKCEFPFE---------N----------------------------A-----GKIIIREGDARHLD---------FLPDASIDFICTHPPYADIIRYS-ED---LAGDLSHL-RGEAFLAEMEKVAGESYRVLKKDKFCAVLMGDMRQKGCMIPLSFQVMERFLAAGFTLKELIVKT----QHNCR-ATGFW---KT-----------NSV---KY--NFLLIAHEHLFVFRK|---------------------------------------------------- FSDG_RS00930_Fusobacterium_nucleatum_495977000 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFG---------N----------------------------S-----GKVYIHKGDARKLN---------FIKNETIDFICTHPPYANIIKYS-ED---VEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- FSAG_RS07575_Fusobacterium_periodonticum_496069501 ------------------------------------------------MKK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTQEKDLILDQFAGGG-TTLVEAKLLNRNIIGIDVNDIAIERCREKIDFEFE---------N----------------------------S-----GKVYIHKGDARNLD---------FIKNETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KIPEFLKEIEKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEKVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- CTHBC1_RS07545_Ruminiclostridium_thermocellum_490598869 TAANNDVTCAFVKKVAKES----TICLEE--------KSK----SY-FADK--------L-N---I---KS----W----------------------------EPE----N-FN-LET-TTVWSFPDRG-DWATH--SGKYRGNWSPFIPRNVILRYSKEGETVLDQFVGSG-TTLVEAKLLKRKGIGVDINPEAVNLTCRNINFEKE---------D----------------------------C-----GETEVHVGDARHLG---------FIKDESVDLICTHPPYSNIIKYS-ED---IEGDLSHC-DINEFLVEMEKVAKESYRVLKKGRFCAILIGDTRRKGHMIPIGFNVMQTFLRAGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|---------------------------------------------------- JCM21531_RS04440_[Clostridium]_straminisolvens_740456070 TAANNDVTQAFVKKVAKES----RIYFEE--------NAK----GY-FVGK--------P-N---I---KL----W----------------------------EPE----N-FN-LET-STVWSFPDRG-NWATH--SGKYRGNWSPFIPRNVIMRYTKEGETVLDQFVGSG-TTIVEAKLLKRKGIGVDINPEAVNLTSRNINFEKE---------D----------------------------C-----GEVEVHVGDARHLG---------FIKDESIDLICTHPPYSNIIKYS-ED---IQGDLSHY-DIDDFLVEMEKVAKESYRVLKKDRFCAILMGDTRRKGHMIPIGFNVMQTFLRAGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|---------------------------------------------------- CLOCLA_RS0112375_[Clostridium]_clariflavum_653611723 TAYRNDVTQLFVKKIMKES----MIYIKE--------KES----DY-YVNK--------L-N---I---KS----W----------------------------EPE----N-FS-LET-TTIWGFPDRG-SWATH--SGKYRGNWSPFIPRNVILRYSTEGEIVLDQFVGSG-TTLVEAKLLNRKSIGIDINPEAVNIARHNTNFERD---------G----------------------------S-----GEVEVHVGDARHLE---------FIDDESIDLICTHPPYSNIIRYS-EN---IQGDLSHC-DIKEFYKEMEKVAIECYRVLKKNKFCAILIGDTRKKGHMIPIGFNVMEIFLRTGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|---------------------------------------------------- K364_RS0114940_Desulfitibacter_alkalitolerans_654856343 --------------------------MTS--------NKK----L----K-------------------------W----------------------------EPD----K-FE-LQT-NTVWSFPDRG-NWATH--NSKYRGNWSPYIPRNLILRYSKEGDTILDQFAGSG-TTLIEAKLLNRNCIGVDINAVSIELCRENTDFERE---------N----------------------------C-----GHVTIKRGDARDLS---------FINDKSIDFICTHPPYANIIKYS-ND---IIGDLSCY-EVGDFLKEMKKVAAECYRILKEDKYCAILIGDTRKKGHIVPIGFEVMKVFEAIGLKIKEIIIKE----QHNCT-STCYW---RN-----------KST---KY--NFLLLAHEYLFVFKK|---------------------------------------------------- TVG_RS02300_Thermoplasma_volcanium_499219160 -------HTTELARHYANE-VY-NMAFIS-R------KLP--------YSD--------I-D---L---SR----W------------R---------------EYD----F----VIT-DSLWLFDKRD-YRGSK--LGWYWGNFVPQIPRQLILRFSRKDEWVLDPFSGSG-TTLIEAKKLGRNSLGIEINEEVCKKSLEILNSIDG---------D----------------------------G-----FSTAIS-GDSASVN-LTKVME--YYGIPEFNLVIMHPPYHDIIKFT-DI----GGDLSNARDTKEFLSMLGKVTRNVSKYLQKGRFLALVIGDKYSNGEWIPLGFYSMQKVMDQGFRLKSTIVKNFEYTRGKAS-SSDLW---RY-----------RAL---AG--GFYVFKHEYIFVFQK|----T----------------------------------------------- AMDU3_IPLC00003G0029_Thermoplasmatales_archaeon_I-plasma_546149073 -------KSSKLPKL-----------------------AP--------FSD--------I-D---L---SR----W------------K---------------DYS----D----VWT-DSLWVIDRRD-TSGAH--INSYHGNFIPQIPHQLMMRYTKKGDWVLDPFLGSG-TTLIECIRMGRNGIGVELNESVADKSKSIIASEPN---------S-F--------------------------G-----VTTVTSVGDSSTLP-FRDLLD--SINVKSVQLAVLHPPYHGIIKFS-DN----PADLSNAGTVDDFLLQLSRVVSNTMQVLDEGRYMALVIGDKYEKGKWIPLGFYAMQKVMDAGMELKSIVVKNYGETRGKSH-QQSLW---RY-----------RAL---QG--GYYLFKHEYVFIFRK|----A----------------------------------------------- AMDU1_APLC00004G0008_environmental_samples_546147902 -------RGIDRAHYYVRS-VI-RDIGSS-A------SAS--------SLE--------F-N---I---NL----W------------K---------------AYD----E----IRT-DSLWILGKRD-REAGH--KGWYWGNFVPQIPHQLMMRYTRTSDWILDPFCGSG-TTLIEAIRMERNSVGIEINPEVYSRTREAVQSLPH---------D----------------------------G-----TRAEIILGDSYTVD-LVPVME--RNGVSSFDMVLLHPPYWDIIRFS-DD----QGDLSTSPDMESFISRFTQIAKKSIAVLKSGGYMGLVIGDAYRDGEIVPLGFRCMDAVASLNMKIRGIIVKDIQNTRGKRS-SENLW---RY-----------RAL---KS--GFYVFKHEYVFVFQK|----P----------------------------------------------- FFONT_0867_Fervidicoccus_fontis_504370902 RRPLKEITLEDFESIAKRK----KY-VTI--------GCK----K---IEL----------E---I---EG----F--KEL-----------------------QPK----E-FV-VEK-TSVWSFPERG-KWATHKYNAKFRGNWSPQVARNLLLLYSKSGDTVLDPFLGSG-TSMIECILLKRRCYGVDINIDSVMLSWSRIKPIYS---------S----------------------------D-----SFVKLFEGDAEYLD---------AFEDEKFDFILGHPPYASIIKYS-KG---SDGDLSKM-SIQEYLEKMRRIARELYRILKKDKYLAIMVGDIRRKKHVIPLGYMVMKIFLEEGFIIKEHIIKV----QHNMI-GTAFW---KN-----------KKN---D----FLLLKHEHIFVFRK|----PLDSSD-YEK---FQYYMLY---------------------------- _Taylorella_asinigenitalis_505364599 ------------------------------------------------MQK--------------S---KKIKT-W----------------------------EPE----N-FE-LEM-TTHWSFPKRG-NWATH--DAKWRGNWSPYIPRNVILRYSKEGDLILDQFAGGG-TTLVEAKLLNRNIIGIDINQDSIDRCKEKTDFKLT---------L----------------------------EL----GNVDIKKGDARELT---------NIKDESIDLICTHPPYADIIKYS-ED---IPEDLSRL-KIKEFLNEMTKVADESYRVLKKGKFCAILIGDMRKNGNVIPLSTKVMNVFTDAGFVLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLYIFKK|--T------------------------------------------------- Y919_RS08595_Caloranaerobacter_azorensis_737178046 IEEYTYLPEKLVLEKLEN--------LKE--------DRE----LYRVKKP--------L-R---I---QS----W----------------------------EPK----D-FA-LEA-TTVWSFPDRG-KWATH--NGKYRGNWSPYIPRNIILSYTKKGDIVLDQFLGSG-TTLVETKLLERRGIGVDINLDAIKVARANLRFNKN---------K----------------------------E-----YEPKIYKGDARNLD---------FIPDNSIDLICTHPPYANIIKYS-ND---IEGDLSLC-NIDEFINEMKKVAKEAFRVLKENKYCAILIGDTRKNKHMIPLGFKVMQVFLDAGFILKEIIIKE----QHNCK-ATGFW---YK-----------RSI---EY--NFLLIAHEYLFVFRK|-PKSK----------------------------------------------- HMPREF1630_RS01030_Anaerococcus_lactolyticus_490965414 ------------------------------------------------MSK--------------I---KK----W----------------------------EPD----D-FE-LEM-TTHWSFPQRG-NWATH--DAKWRGNWSPYIPRNIILRYSEEGDLVLDQFAGGG-TTLVEAKLLNRNIIGLDVNDVALNRCKEKIDFNLTD--------R----------------------------PL----GKVKLLKGDARNLD---------FLTDESIDLVCTHPPYADIIKYS-DG---IENDLSLL-KINDFLKEMNKVAAEAYRVLKKDKFCAILMGDTRKNKHMIHLGFDVLKVFEDEGFKLKELIIKE----QHNTR-ATGFW---KK-----------RSV---DY--NFLLIAHEYLFILKK|---------------------------------------------------- CAAU_RS08905_Caloramator_australicus_496184606 CVEKYNFTKDFIKKIFSGN----DMLLKE--------SKQ----LY-LTN-------------------------L----------------------------SPE----K-FE-LET-TTIWSFPDRG-NWATH--SGKYRGNWSPYIPRNILLRYSNEGELILDQFVGSG-TTLVEAKLLNRNAVGVDINPIALEITRENLKFNYE---------Y----------------------------N-----PKIDIKLGDARNLY---------FIEDNSIDLICTHPPYANIIKYS-EN---INGDLSHL-DVKEFLLEIEKVAKECYRVLKKDKYCAILIGDTRKKGHIIPIGFSVMQKFIDAGFKLKEIIIKE----QHNCN-STPYW---KN-----------KSL---KY--NFLLIMHEYLFVFRK|---------------------------------------------------- MB41_RS06705_Anaerosalibacter_sp_ND1_757126172 -----------------------------------------------MREP--------L-K---V---DS----W----------------------------APE----N-FA-LEA-TTVWSFPERG-KWATH--NGKYRGNWSPYIPRNIILRYSEENDTVLDQFLGSG-TTLIETKLLKRRGIGVDINSETVKLSKENLRFNKN---------K----------------------------E-----YQPIIYNADARNLN---------FISDNSIDLICTHPPYANIIKYS-KN---INGDLSHL-NIDKFIIEIRKVAEESFRVLKNNRYCAILIGDTRKDKHIIPLGFKVMEEFLNAGFVLKETIIKE----QHNCK-ATGFW---YN-----------KSL---QY--NFLLIAHEYLFVFRK|-NCNY----------------------------------------------- CTM_RS16890_Clostridium_tetanomorphum_737163966 -----------------------------------------------------------------M---DE----------------------------------------A-FK-LET-KSLWSFKERG-EWATH--KGDYPGNWSPFVPKNIILRYSKQNDVVLDQFLGSG-TTLIEAKLLNRRAIGCDINPKALEIAKDRISRVKA--------------------------------------N-----TAIQLMECNARNLD---------CIKDNSIDLICTHPPYSNIIKYS-KN---IQGDLSLL-NLNEFYEAIKEVSIECFRVLKKTKYCTIMMGDIRKNGCVIPLGFNVMNLFLNQGFKLKEIIIKE----QHNCN-STKFW---KE-----------ISL---KK--NFYLLAHEYLFVFFK|---------------------------------------------------- LEBU_RS11230_Leptotrichia_buccalis_506250654 ------------------------------------------------MVK--------K-----I---KK----W----------------------------EPE----E-FE-LEM-NTVWSFPDRG-KWATH--DAKYRGNWSPYIPRNLLLRYSNEGDLILDQFAGGG-TTLVEAKLLNRNIIGVDINSNALKRCKEKCDFEYE---------N----------------------------L-----GKVYFYEADARNLN---------FIPDENIDFICTHPPYANIIKYS-ED---IENDLSHL-KVKDFLIEMEKVASESYRVLKKDKFCAILMGDTRQKGHIIPMSFEVMKIFEKVGFKTKEIIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLIAHEYLFVFKK|---------------------------------------------------- VE20218_RS15590_Clostridiales_bacterium_VE202-18_657680312 ---------------------------------------------------------------------------------------------------------MK----T-YE-LQN-TTIWSFPDRG-SWATH--KGDYRGNWSPHIPKNLILKYTKQNDLILDCFAGSG-TTLIEAKLLNRNAIGVDINNDALNISKKRLHFDCH---------N----------------------------S-----AKIELYQCDAKKMT---------MLKDNSIDFICTHPPYTNIIKYS-KN---LENDLSLL-DYKDFLLHMDKVSKELYRVLKQGHNCSFMIGDIRKNGNVIPLGFNTMQVFLNNGFTLKEIIIKE----QHNCS-STKYW---QN-----------KIQ---NL--NFYLLAHEYIFVLSK|---------------------------------------------------- CC89_RS03170_Clostridium_sp_KNHs214_737306053 -----------------------------------------------------------------M---EK----------------------------------------N-FK-LET-DTIWNFEERG-NWCTH--RGDYPGNWSPYVPKNIILRYSKEGEFVLDQFVGSG-TTLIEACLLNRKIIGCDINDRALNICSDRIKNLSK---------K----------------------------------DNVFLKKRDARNLY---------DIKDESIDLICTHPPYANIIKYS-KN---IDGDISLL-DIEEFYEAMKDVAKECYRVLKKEKYCSILMGDTRKRGFIIPLAFNVLNIFMNSGFKLKEIIIKQ----QHNCK-STEYW---RD-----------ISI---KR--NFYLIAHEYLFVFKK|---------------------------------------------------- MELS_RS05115_Megasphaera_elsdenii_503781935 -----------------------------------------------------------M-K-------------W----------------------------QPD----D-FT-LEM-TSVWSFPQRG-KWATH--DGNYRGNWSPYIPRNLILRYSGEGDRILDCFVGGG-TTLVEAKLLSRNCIGVDVNEQALDRCRKKCDFSC----------P----------------------------NM----GKIYLKQGDARNLH---------FIQDASIDFICTHPPYANIIQYS-QD---IEQDLSRL-DVDSFLAEIKKVVCECYRVLKKGKFCAILMGDIRKKGHVIPLSFWVMDLFLQQGFSLKEMIIKE----QHNCR-ATGFW---KT-----------NSV---KY--NFLLLAHEHLFVFHK|-IS------------------------------------------------- HMPREF1580_RS03105_Gardnerella_vaginalis_515278394 ------------------------------------------------MTV--------T-S---I---KR----W----------------------------EPE----N-FE-LEM-TTHWSFPDRG-NWATH--DSKWRGNWSPYVVRNLLLRYSAEKDLVLDQFVGGG-TTLVEAKLLNRDVIGVDVNDIAINRCREKVSFNHE---------G----------------------------AD----GRVYIRKGDARNLD---------FLDDESIDFICTHPPYANIIKYS-EN---IPEDLSLL-KVDAFLSQMKKVAEESYRVLKTNKFCAVLMGDTRQKGCMIPMSFDVMKIFQNAGFTLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- HGPG_RS01070_Peptoniphilus_grossensis_517953989 ------------------------------------------------MTN----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPERG-DWATH--DAKWRGNWSPYIPRNIILRYSKEKDLILDQFAGGG-TTLVEAKLLNRDIIGIDVNDVALNRCKEKIDFHHE---------G----------------------------AD----GKVFLRKGDARNLD---------FIPDNSIDLICTHPPYANIIEYS-EN---IEEDLSHL-KTNEFLEEMKKVASESYRVLKKDKFCAVLMGDTRKNGHMIPLSFYVMQVFENAGFKLKEMIIKE----QHNCK-ATGFW---KT-----------NSI---KY--NFLLIAHEHLFIFRK|---------------------------------------------------- QSI_RS08630_Clostridiales_496091935 ---------------------------------------------------------------------------------------------------------MN----T-YK-LQN-TTIWNFPDRG-NWATH--KGDYRGNWSPHVPKNLILKYTEQKDLVLDCFVGSG-TTLIEAKLLDRNAIGIDINKKALEITRNRLNFDCN---------N----------------------------N-----AHIQLHLGDAQNLK---------MVKDNSIDFICTHPPYADIIKYS-KN---IENDISNL-EYNEFLAHMNQVSKELYRVLKPSHFCSFMIGDIRKKGNVIPLGFLTMQTFINNGFTLKEIIIKE----QHNCS-STSYW---ND-----------KSK---TL--GFYLLAHEYIFVLYK|---------------------------------------------------- BN748_01131_Fusobacterium_sp_CAG:649_547450181 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFMGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKIDFEFE---------N----------------------------S-----GKVYIHKGDARRLD---------FIKDETIDFICTHPPYTNIIQYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCVILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|---------------------------------------------------- HMPREF1586_RS06065_Gardnerella_vaginalis_490232946 ------------------------------------------------MTV--------T-S---I---KR----W----------------------------EPE----N-FE-LEM-TTHWSFPDRG-NWATH--DSKWRGNWSPYVVRNLLLRYSAEKDLVLDQFVGGG-TTLVEAKLLNRDVIGVDVNDIAINRCREKVSFNHE---------G----------------------------AD----GRVYIRKGDARNLD---------FLDDESIDFICTHPPYANIIKYS-EN---IPEDLSLL-KVDAFLSQMKKVAEESYRVLKANKFCAVLMGDTRQKGCMIPMSFDVMKIFQNAGFTLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- HMPREF0993_RS14405_Lachnospiraceae_bacterium_5_1_57FAA_496543443 ------------------------------------------------------------------------M--FLLSELGIILFYFN---GQKKKGEIFIREAPE----K-FK-LED-TTIWSFPERG-SWATH--SGKYRGNWSPYIPRNLILRYSKKKDWILDQFLGSG-TTLIEAKLLGRNAIGVDINSEAVKLSNTNLNFTCQ---------E----------------------------R-----SKIFTKQGNANNLS---------FIKDESIDLICTHPPYADIIRYS-KE---IPGDISHL-KYEEFLKELEQVARESYRVLKRQGICAIMIGDIRRKGYVLPLGMNSMQKFVEAGFKLKEIIIKE----QHNCR-SAYYW---EG-----------RER---K----FLMLAHEYIFILEK|-----------------TDCYNSM---------------------------- BN715_00862_Megasphaera_elsdenii_CAG:570_548306764 -----------------------------------------------------------M-K-------------W----------------------------QPD----E-FT-LEM-TSVWSFPQRG-KWATH--DGKYRGNWSPYIPRNLILRYSSEGDRILDCFVGGG-TTLVEAKLLNRNCIGIDINAKALDRCREKCDFSC----------P----------------------------NM----GKIYLKQGDARKLH---------FIQDEAIDFICTHPPYANIIQYS-QD---IEQDLSRL-EVELFLEEMKKVVCECYRVLKKGKFCAILMGDIRKKGYVIPLSFFVMDLFLRQGFSLKEMIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLLAHEHLFVFHK|-IS------------------------------------------------- F553_RS0104430_Megamonas_rupellensis_648605175 --------------------------------------------------------------------------------------------------------MKD----N-FK-LEM-TTVWSFPKRG-NWATH--SGMYRGNWSPYVPRNLILKFTAEHDWILDQFMGSG-TTLIEAKLLNRNIIGIDVNEKAYKITEKNLNFECK---------T----------------------------S-----SHIHIRLCSAENIY---------FIKNNSIDCICTHPPYANIIKYS-KD---NQYDISLL-SVEKYLLAMKNVAKESYRVLKSNHICAIMVGDIRKEGILIPLGFYVMNIFKQQGFILKDIIIKE----QHNCK-STSKW---VN-----------IKH---S----FYLLAHEYIFIFEK|---------------------K------------------------------ LEPGO_RS0100700_Leptotrichia_goodfellowii_652339649 ------------------------------------------------MGK--------K-KF--I---KK----S----------------------------EPE----N-FE-LEM-NTVWSFPNRG-KWGTH--DAKYRGNWSPYIPRNLLLRYSNENDLILDQFAGGG-TTLVEAKLLNRNIIGIDVNDEALNRCKEKCNFEYE---------N----------------------------S-----GKVKICKGDARNLD---------FISNESIDFICTHPPYANIIQYS-ET---IENDLSHL-KVKDFLVEMKKVAEESYRVLKKNKFCAVLMGDIRQKGHIIPMSFEVMKIFESVGFKTKEIIIKE----QHNCK-ATGFW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- HMPREF1040_RS02760_Megasphaera_sp_UPII_135-E_494634458 ------------------------------------------------MGK----------K---I---VK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSKENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNEVALARCREKINFDHS---------G----------------------------AN----GKVYLYKGDARTLD---------FIKDNSIDLICTHPPYADIIKYS-ED---IETDLSHL-KVKDFLIAMRDVAAESYRVLKKDKFCAVLMGDTRQKGHMIPMSFEVMKLFQSAGFKLKELVIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFIFRK|---------------------------------------------------- FNV_RS04020_Fusobacterium_nucleatum_492571686 ------------------------------------------------MNK----------K---N---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEEDLILDQFIGGG-TTLVEAKLLNRNIIGVDVNNVAIERCKEKINFNFE---------N----------------------------S-----GKVYIHKGDARKLD---------FIKDETIDFICTHPPYANIIEYS-ED---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEAEGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFVFKK|---------------------------------------------------- HMPREF9454_RS05820_Megamonas_funiformis_495813929 --------------------------------------------------------------------------------------------------------MKD----N-FK-LEM-TTVWSFPKRG-NWATH--SGMYRGNWSPYVPRNLILKFTAEHDWILDQFMGSG-TTLIEAKLLNRNIIGIDVNEKAYKITEKNLNFECK---------T----------------------------S-----SHIHIRLCSAENIY---------FIKNNSIDCICTHPPYANIIKYS-KD---NRYDISLL-SVEKYLLAMKNVAKESYRVLKSNHICAIMVGDIRKKGILIPLGFYVMNIFKQQGFILKDIIIKE----QHNCK-STLKW---VN-----------IKH---S----FYLLAHEYIFIFEK|---------------------K------------------------------ NW74_RS05595_Parvimonas_micra_754560689 ------------------------------------------------MKK--------------E---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPNRG-NWATH--DAKWRGNWSPYIPRNIILRYSKENDVVLDQFVGGG-TTLVEAKLLNRNIIGVDVNDIAIQRCKEKVDFEYK---------Q----------------------------SN----SKVIIKKGDARNLF---------FLENESIDLICTHPPYANIINYS-DD---LENDLSRL-NIKDFLIQMEEVANESYRVLKKGKFCAILMGDTRQKGNMIPMSFKVMEIFKKTGFTLKEIIIKE----QHNCK-ATGFW---KT-----------NSI---KY--NFLLIAHEYLFIFKK|---------------------------------------------------- CLPA_RS15010_Clostridium_pasteurianum_489540792 -----------------------------------------------------------------M---SE----------------------------------------V-FN-LET-KTLWSFKERG-DWGTH--KGDYPGNWSPFVPRNIILRYSKDNELILDQFLGSG-TTLIEAKLLNRRGIGCDVNSTALETSKNRIEGVNG---------N----------------------------------NSIKLVKGSAKNMN---------FIKNESIDLICTHPPYSNIIKYS-KD---IDEDLSLL-NIDEFYESIKEVSKEAFRVLKKGKYCAIMMGDIRRNGCVIPLGFNVMNLFLNQGFRLKEIIIKE----QHNCS-STKYW---EE-----------ISL---KK--NFYLLAHEYLFVFLK|---------------------------------------------------- Smon_1038_Streptobacillus_moniliformis_DSM_12112_268315129 ------------------------------------------------MIN--------K-K---L---TK----W----------------------------EPE----N-FE-LEM-NTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLLLRYSKENDLVLDQFAGGG-TTLVEAKLLNRDIIGVDINEVSLERCREKVNFEHE---------G----------------------------SN----GKVYIHKGDARNLD---------FISDESIDFICTHPPYANIIQYS-DN---IEEDLSHL-KIPQFLEEMKKVAFESYRVLKNDKFCAVLMGDTRIKGYMQPMSFEVMKIFESEGFKLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFIFKK|---------------------------------------------------- T504_RS0104020_Selenomonas_sp_ND2010_697204360 -------------------------------------------------MK----------K---I---KK----W----------------------------EPE----E-FE-LRM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDVALERCREKTDFEHE---------G----------------------------AN----GKVYLKKGDARNLS---------FIPDEHVDLICTHPPYADIIKYS-ED---IEEDLSRL-KIADFLEEMKKVAGECYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEVGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- NZ47_RS07170_Anaerovibrio_lipolyticus_746146226 ------------------------------------------------MAE----------KI--I---RK----W----------------------------EPD----D-FN-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSTENDLIIDQFAGGG-TTLVEAKLLNRDIIGVDVNENAITRCKEKIAFEHE---------G----------------------------AN----GKVSLYKGDARNLD---------FIDDESIDLICTHPPYADIIKYS-ED---IPEDLSLL-KVKDFLEEMKKVAAESYRILKKDKFCAILMGDTRKKGNMVPMSFGVMKIFEEAGFKLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- G598_RS0111930_Selenomonas_ruminantium_652371152 ------------------------------------------------MAK----------K---I---TK----W----------------------------EPD----D-FE-LEM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDVALERCREKTNFEHE---------G----------------------------AN----GKVYLKKGDARNLS---------FISNEHVDLICTHPPYADIIKYS-ED---IEEDLSRL-KIADFLEEMKKVAGECYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFKK|TMK------------------------------------------------- Q428_RS06840_Fervidicella_metallireducens_737398024 -----------------------------------------------------------------M---LD----------------------------------------N-FK-LET-TTIWSFKDRG-NYYTH--KGDYPGNWSPYVPRNIILRYSKENDVVLDQFAGSG-TTLIECRLLNRIGVGCDVNEVALKMAWERTKCIQS---------K----------------------------------SKTILLKRDARNLY---------DIKDSSIDLICTHPPYSNAIKYS-ED---IEEDISLL-EYDKFLNEIVKVASECYRVLKKGKYCALLIGDIRKNGYIKPLGYETLNKFLNQSFKLKEIIIKE----QHNCR-KTEYW---KE-----------ISI---KN--NFYLIAHEYLFVFQK|---------------------------------------------------- HMPREF9629_RS00455_unclassified_Peptostreptococcaceae_497210069 ------------------------------------------------MTK--------T-K---I---TK----W----------------------------EPE----N-FE-LEM-NTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLILRYSKENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDINDIALERCKEKTSFNYD---------G----------------------------AN----GKVYINKGDARNLS---------FIQDESIDFICTHPPYANIIRYS-EN---IEGDLSCC-KIPEFLKEMQKVANESYRVLKKEKFCAILMGDTRIKGNVQPMSFEVMKIFENTGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLFIFKK|V--------------------------------------------------- DP68_RS13195_Clostridium_sp_HMP27_737327616 -----------------------------------------------------------------M---NS----------------------------------------I-KEELQC-TTIWSFKDRG-DWATH--KGDYPGNWSPYVPRNIILRYSREGDLVLDQFLGSG-TTAVEAALLNRKFIGIDINDNALLLASKRCSNYI----------N----------------------------------KNISIIKGDAKDLK---------DIKDETINLICTHPPYSNIIKYS-KY---NKDDISLL-SLEGYYKAMDKVAKECFRVLKGNSYCAILIGDTRKNGFIQPLGFNVMNSFINAGFILKEIIIKE----QHNCS-STKKW---IE-----------ISK---KR--NFLLIAHEYLFVFKK|---------ILKELPQ------------------------------------ HMPREF0889_RS01300_Megasphaera_genomosp_type_1_496777936 ------------------------------------------------MGK----------K---I---VK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSEENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNETALARCREKINFEHS---------G----------------------------AN----GKVYLYKGDARTLD---------FIKDNSIDLICTHPPYADIIKYS-ED---IEADLSHL-KVKDFLIAMRDVAAESYRVLKKDKFCAVLMGDTRQKGHMIPMSFEVMKLFQSAGFKLKELVIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFIFRK|---------------------------------------------------- QX51_RS12035_Terrisporobacter_othiniensis_746722773 -----------------------------------------------------------------M---NN----------------------------------------N----IET-TTIWSFPDRG-NWLTH--KGDYPGNWSPHIPKNIILRYSKEKDKVLDQFIGSG-TTLIETNRLNRIGIGSDINIEALKLCQIRVPQ------------N----------------------------------NKTYIRKQDARYLK---------LIKDNTIDLICTHPPYANIVKYS-DS---IKEDISLL-DFESYYESMKFVAQSCYRVLKPQKHCAILIGDTRKNGLIEPLGFNVMNIFLKEGFKLKEIIIKE----QHNCK-CTDKW---KE-----------LSK---QR--NFLLIAHEYLFIFKK|-----------D---------------------------------------- G598_RS0113740_Selenomonas_ruminantium_652371446 ------------------------------------------------MAK----------K---I---TK----W----------------------------EPD----D-FE-LEM-TTHWTFPKRG-DWATH--DAKWRGNWSPYIPRNIMLRYSKEGDCVLDQFAGGG-TTLVEAKLLNRNIIGVDVNDVALERCREKTNFEHE---------G----------------------------AN----GKVYLKKGDARNLS---------FISNEHVDLICTHPPYADIIKYS-ED---IEEDLSRL-KIADFLEEMKKVAGECYRVLKKDKFCAILMGDTRKKGCMVPMSFDVMKIFEEAGFTLKELIIKE----QHNCR-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- HMPREF1039_RS02295_Megasphaera_sp_UPII_199-6_494632851 ------------------------------------------------MGK----------K---I---VK----W----------------------------EPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNILLRYSEENDLVLDQFAGGG-TTLVEAKLLNRNIIGVDVNETALARCREKINFEHS---------G----------------------------AN----GKVYLYKGDARTLD---------FIKDNSIDLICTHPPYADIIKYS-ED---IEADLSHL-KVKDFLIAMRDVAAESYRVLKKDKFCAVLMGDTRQKGHMIPMSFEVMKLFQSAGFKLKELVIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFIFRK|---------------------------------------------------- HMPREF9286_RS08365_Peptoniphilus_harei_492766054 ------------------------------------------------MAN----------K---I---TK----W----------------------------EPE----N-FE-LEM-TTHWSFPQRG-NWATH--DAKWRGNWSPYIPRNIILRYSNEKDLILDQFAGGG-TTLVEAKLLNRNIFGIDVNDVALNRCKEKVDFEHV---------G----------------------------AD----GKVFLRKGDARNLD---------FIPDNSIDLICTHPPYANIIEYS-ED---IEEDLSRL-KIKDFLAEMKKVAAESYRVLKKDKFCAVLIGDTRQKGHMIPLSFYVMQIFEEAGFKMKEMIIKE----QHNCK-ATGFW---KT-----------NSI---KY--NFLLIAHEHLFIFRK|---------------------------------------------------- SMON_RS05265_Streptobacillus_moniliformis_754175633 -------------------------------------------------MN--------K-K---L---TK----W----------------------------EPE----N-FE-LEM-NTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNLLLRYSKENDLVLDQFAGGG-TTLVEAKLLNRDIIGVDINEVSLERCREKVNFEHE---------G----------------------------SN----GKVYIHKGDARNLD---------FISDESIDFICTHPPYANIIQYS-DN---IEEDLSHL-KIPQFLEEMKKVAFESYRVLKNDKFCAVLMGDTRIKGYMQPMSFEVMKIFESEGFKLKEIIIKE----QHNCR-ATGYW---KT-----------NSI---KY--NFLLIAHEYLFIFKK|---------------------------------------------------- CD05_RS0101160_Ruminococcus_sp_NK3A76_655060651 -------------------------------------------------MK----------K---I---KK----W----------------------------EPD----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQEGDLVLDQFAGGG-TTLVEAKLLNRNIIGVDCNDEALTRCREKIDFDYPP--------------------------------------AQ---GKVFLYKGDARDLY---------FQSDESVDLICTHPPYADIIKYS-DG---IPEDLSQL-KVKDFLEAMKPVAAECYRVLKKGKFCAVLMGDTRQKGCMIPMSFDVMKIFQEAGFTLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|---------------------------------------------------- MR07_RS03745_Mycoplasma_ovis_568197217 --------------------------------------------MI--NRK--------------I---TK----W----------------------------EPE----N-FQ-LQT-NTLWSFPDRG-SWATH--DAKWRGNWSPYIPRNILLRYSKEGDLVLDQFAGGG-TTLVEAKLLNRNIIGIDVNGEAIKRCKEKIDFDYS---------N---------A------------------N-----GKVTTMKGDVRNLC---------FLDSDSIDLVCTHPPYADIIRYS-EG-KEIIEDLSNL-EINEFLSQMQLVAFECYRVLKKGKFCVILMEDTRKNGHMIPLSYKVMKIFEDKGFKLKELIIKV----QHNCK-TTGYW---AT-----------NSV---KY--NFLLIAHEYLFVFKK|---------------------------------------------------- J144_RS0109740_Fusobacterium_hwasookii_657696170 ------------------------------------------------MNK----------K---I---KK----W----------------------------EPD----N-FE-LEL-NSVWSFKERG-DWATH--DAKWRGNWSPYIPRNLILRYTNEKDLILDQFIGGG-TTLVEAKLLNRNIIGIDVNDVAIERCKEKINFEFE---------N----------------------------S-----GKVYINKGDARKLD---------FIKDESIDFVCTHPPYANIIEYS-EN---IEEDLSHL-KIPEFLKEMKKVASESYRVLKKDKFCAILMGDTRIKGHIQPLGFEVMKVFEGVGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFFLIAHEYLFIFKK|---------------------------------------------------- MD85_RS04325_[Clostridium]_cellulosi_736835888 ------------------------------------------------MKR--------------I---VK----W----------------------------EPD----N-FE-LEM-NTVWSFPERG-NWATH--DAKYRGNWSPYIPRNLLLRYSSKGDLVLDQFVGGG-TTLVEAKLLGRNIIGVDVNPRALERCQEKIDFDYD---------N----------------------------A-----GEVYLYNGDARNLY---------FIKNESIDFICTHPPYANIIRYS-ED---IEADLSHL-NVKDFLVEMHKVASESFRVLKKNKFCAILMGDTRKRGHVIPMSFEVMKIFESAGFKLKEIIIKE----QHNCK-ATGYW---KT-----------NSI---KY--NFLLLAHEYLFVFKK|---------------------------------------------------- Q346_RS0100700_Mycoplasma_gallinarum_653082102 -------------------------------------------------MK----------K---I---KK----W----------------------------EPE----D-FE-LEM-TTHWSFPKRG-DWATH--DAKWRGNWSPYIPRNIILRYSQENDLILDQFAGGG-TTLVEAKLLNRDIIGVDINDVAIERCKEKTAFDYQL--------------------------------------AT---GKVYIKKGDARNLD---------FIPDESIDLICTHPPYADIIKYS-EG---IDGDLSQL-KVKDFIEEMKKVASESYRVLKKDRFCAVLMGDTRQKGHMIPMSFDVMRVFEEAGFKLKELIIKE----QHNCK-ATGYW---KT-----------NSV---KY--NFLLIAHEYLFVFRK|EEK------------------------------------------------- _Candidatus_Magnetobacterium_casensis_749954601 -----------------------------------------------------------------M---KT----I----------------------------TPK----G-FA-VEK-TTVWSFKSRG-TWATH--NGNYRGNWSPYIPRNVILKYSKMHDLVLDCFCGAG-TTGVECKLLGRNFIGIDINAAAIGLATENMDFDPG---------Q--DH-----D------------------D-----ANAELFVGDARDLN---------GIGDATVDLICAHPPYADIIRYT-HD---NDKDLSAY-GVGRFLDEIDKVARESYRVLKRSGHCAILIGDMRKNKNVIPLGFRTIERYLMAGFVLKELIIKR----QHNCK-TTGFW---YN-----------NSV---KY--NFLLLAHEYLAVFVK|---------------------------------------------------- T263_RS0108015_Fusobacterium_nucleatum_696307008 -------------------------------------MPK--------FND--------F-D---L---KN----W------------K---------------EYE----D----IYT-DTLWIIEKRD-NSGVH--TSKYHGNFVPQIPNQLFRRYTKKGEWILDPFLGSG-TSIIEAQRLGRNSIGIELQEDVLKEAYERILVEKS---------N----------------------------D-----CRGKLYIGDSKEIN-ISKILK--SNSIKKVQFIIFHPPYWDIIKFS-DK----ENDLSNSKSVEDFLSSLGKVVDNTTEYLEKNRYCSIVIGDKYENSQIVPLGFYCMNLFLERNFLLKAIIVKNFEETKGKRN-QKSIW---RY-----------RAL---AS--DFFIFKHEYIMVFKK|----I-N--------------------------------------------- HMPREF1498_RS09070_Fusobacterium_sp_CM1_696259451 -------------------------------------MPK--------FND--------F-D---L---KN----W------------K---------------EYE----D----IYT-DTLWIMEKRD-NSGVH--TSKYHGNFVPQIPNQLFRRYTKKGEWILDPFLGSG-TSIIEAQRLGRNSIGIELQEDVLKEAYERILVEKS---------N----------------------------D-----CRGKLYIGDSKEIN-ISKILK--SNSIKKVQFIIFHPPYWDIIKFS-DK----ENDLSNSKSVEDFLSSLGKVVDNTTEYLEKNRYCSIVIGDKYENSQIVPLGFYCMNLFLERNFLLKAIIVKNFEETKGKRN-QKSIW---RY-----------RAL---AS--DFFIFKHEYIMVFKK|----I-K--------------------------------------------- G550_RS0101585_Megamonas_hypermegale_738301178 -------------------------------------------------------------------------------------------MYYTRGKKQVFIMDLK----N-FK-LET-TSVWSFPDRG-NWYTH--YGDYPGNWSPYVPRNLILKYSLEKEWILDQFMGSG-TTLIEAKLLNRNIIGTDINPKAYAITKSRLNFAYD---------S----------------------------T-----SHIHIRINDAQDLS---------FIKDNSISLICTHPPYANIIKYS-AD---IKNDLSLM-SYNSYFKAMAKVAKEAHRVLKNGRICAIMVADIRKNWKFIPLGHYVINEFLKVGFILKDIIIKE----QHNCK-SLIKW---SQ-----------REH---E----CYLIKHEYIFIFQK|---------------------ESR---------------------------- BN531_00658_Eubacterium_sp_CAG:202_547826215 -------------------------------------MAK--------YND--------I-D---M---KH----W------------K---------------EYD----D----ILT-DTLWMFDKRD-NSGVH--SASYHGNFVPQIPNQLFRRYTKKGDWILDPFMGSG-TSLIEAQRLGRNSIGIELQEDVAKNTRNLLLEEKN---------N----------------------------Y-----TKGKIIIGDSRNVN-LSEKLI--SIGIKKVQFVIYHPPYWDIIKFS-DK----KEDLSNCLSLEDFLKSFGQVIDNTVPFLEKNRYCAVVIGDKYANSEIVPLGFHCMNLFIQKGLKLKAILVKNFEETKGKAN-QKAIW---RY-----------RAL---AS--DFFIFKHEYIFVFKN|----A-QK---RGK-------------------------------------- K292_RS0108670_Anaerovorax_odorimutans_739513201 ---------------------------------------------------------------------------------------------------MILH--PN----N-FE-LER-TTIWSFSERG-SWATH--SGGYRGNWSPYVPRNLILRYSKSNDWVLDQFLGSG-TSLIEAKLLGRNAIGVDINEQAINLASSNIEFKCL---------A----------------------------N-----SKICIRLADAKKLN---------FIKSESIDLICTHPPYANIIKYS-DE---IENDLSLL-SYEEFLRAMEDVALESYRVLKRQKVCSIMIGDIRKNGNVVPLGMEVMNIFLKIGFKSKEIIIKQ----QHNCS-STPYW---RN-----------KNN---E----FLMLAHEYIFIFEK|---------------------------------------------------- H122_RS0107615_Clostridium_saccharoperbutylacetonicum_505207017 ---------------------------------------------------------------------------M------------------------------D----N-FS-QEL-TSIWSFRDRG-DWNNH--KGDYPGNCSPRVIRNLLLKYTKENDTVLDQFLGSG-TTAIEVLLLNRKIIGIDINKKALDISNCRIKDLN---------------------------------------------GNKILKVGNAEKLE----------ISNETVNFICTHPPYLDIIKYS-KD---IEGDLSLL-NKVDFYTAIKNVANECYRVLKFKSKCAIIIGDVRKKGYIEPLGFNVMNIFLSTGFLLKEIIIKE----QHNCK-STDKW---KE-----------IAK---QK--NFLLIQHEYIFVFEK|------NY---FK--------------------------------------- BN462_00619_Ruminococcus_sp_CAG:108_546656251 -------------------------------------MGK--------YND--------L-D---L---SQ----W------------R---------------EYT----D----IET-DSLWIIDKRD-NSGAH--SGHYHGNFVPQIPHQLFSRYTKKGNWILDPFMGSG-TSLIEAQRMGRNSIGIEIQHDVAKEAYDRIYTEKN---------D----------------------------V-----VRTKVVVADSQTCD-MNKILL--SEGINKVQFVIMHPPYWDIVKFS-EN----PNDLSNCDSINEFLDSFSKVIDNSLSVLEKNRYCAVVIGDKYANSQVIPLGFYCMNLIMEKGLLLKAILVKNFGETKGKSN-KQGIW---RY-----------RAL---AS--DFYIFKHEYIFVFKK|----V-K--------------------------------------------- G594_RS0107390_Clostridium_paraputrificum_736866276 ---------------------------------------------------------------------------M------------------------------D----S-FY-VEE-TSIWSFKDRG-DWATH--RGDYPGNCSPRVVRNLLIKYTKENDVILDQFLGGG-TTAIECLLLNRKIIGIDINKNAISITQDRTRKLN---------------------------------------------GDKSLYLGDAKKLN----------LQGESVDFICTHPPYLDIIKYS-NN---IKDDLSLL-RKEEFYSAMLEVATESFRVLKKHSRCAVIIGDVRKKGYIEPLGFTVMNIFINARFLLKEIIIKE----QHNCK-NTEKW---RE-----------IAK---KK--NFLLIQHEYIFVFEK|------SY---SN--------------------------------------- BN584_00043_Clostridium_sp_CAG:277_548245492 -------------------------------------MAK--------YND--------L-D---P---KK----W------------K---------------EYS----D----INT-DSLWLIEKRD-NSGAH--SGDYHGNFVPQIPHQLFTRYTKKGDWILDPFMGSG-TSLIEAQRLGRNSIGIDLQPDVVQEAEERIRTEQR---------K----------------------------N-----CIVRTVTGDSRTVN-IEEVMS--SVGIDKLQFVMMHPPYWDIIKFS-DN----EKDLSNTSTLDEFLESFGQVIDNSTKYLEKNRYCACVIGDKYANSQVIPLGFYCMNQFMERGFLLKAILVKNFGETKGKAN-QQGIW---RY-----------RAI---TN--DFYIFKHEYIFVFKK|----V-K--------------------------------------------- CHY_RS10255_Carboxydothermus_hydrogenoformans_499664364 IANNYFVTQKFIKEVVKHS----QSLLKE--------EEK----NY-TPVN--------N-K---P---RT----W----------------------------APE----N-FS-LET-TTVWSFPDRG-SWATH--SGKYRGNWSPFIPRNIILRYSKEGEVVLDQFVGSG-TTLVEAKLLKRKGIGVDINPEAVSLTLKNTNFEIE---------E----------------------------G-----GEIEVRVGDARNLY---------FLKDESIDLICTHPPYSNIIKYS-DN---IEGDLSHF-DVNDFLLEMEKVAKECYRVLKKGKFCAILIGDTRRKGYIIPIGFSVMEIFRKIGFKLKEIIIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFVFKK|---------------------------------------------------- CLOSCI_RS12555_[Clostridium]_scindens_490745127 ------------------------------------------------------------------------M--FLLSELGIILFYFN---GQKKKGEIFIREAPE----K-FK-LED-TTIWSFPERG-SWATH--SGKYRGNWSPYIPRNLILRYSKKKDWILDQFLGSG-TTLIEAKLLGRNAIGVDINSEAVKLSNTNLNFTCQ---------E----------------------------R-----SKIFTKQGNANNLS---------FIKDESIDLICTHPPYADIIRYS-KE---IPGDISHL-KYEKFLKELEQVARESYRVLKRQGICAIMIGDIRRKGYVLPLGMNSMQKFVEAGFKLKEIIIKE----QHNCR-SAYYW---EG-----------RER---K----FLMLAHEYIFILEK|-----------------TDCYNSM---------------------------- YSBL_RS07445_Ruminiclostridium_thermocellum_489613307 TAANNDVTCAFVKKVAKES----TICLEE--------KSK----SY-FADK--------L-N---I---KS----W----------------------------EPE----N-FN-LET-TTVWSFPDRG-DWATH--SGKYRGNWSPFIPRNVILRYSKEGETVLDQFVGSG-TTLVEAKLLKRKGIGVDINPEAVNLTCRNINFEKE---------D----------------------------C-----GETEVHVGDARHLG---------FIKDESIDLICTHPPYSNIIKYS-ED---IEGDLSHC-DINEFLVEMEKVAKESYRVLKKGRFCAILIGDTRRKGHMIPIGFNVMQTFLRAGFKLKEIVIKE----QHNCS-STGYW---RN-----------QSI---KY--NFLLIAHEYLFIFRK|---------------------------------------------------- COPRO5265_1445_Coprothermobacter_proteolyticus_DSM_5265_206738469 VAQKNSLTERFVKGVLGYR----TL-VKE--------EMT----NY-NVSK--------A-K---T---EI----W----------------------------EPE----D-FS-LET-TTVWGFPDRG-DWATH--SGKYRGNWSPYIPRNVILRYSNENDVVLDQFVGSG-TTLVEAKLLGRRGLGVDINPDAVKLALSNVNFEHK--------------------------------------C-----GLADVHIGDARNLD---------FVKDSSIDLICTHPPYSNIIKYS-DN---IEGDLSHY-DIPEFLKEMYKVASESYRVLKRGRFCAVLMGDTRRKGNIIPLGFRVMEVFCKAGLTLKEIVIKE----QHNCT-STGYW---KK-----------QSI---KY--NFLLIAHEYLFIFKK|---------------------------------------------------- consensus/100% ...........................................................................................................................a.h..ps....s+...s.a.ush.P...p..h..ao.....lls.h.G....p..bs....R..........sh..s.............................................................s....................hp.hh.HPPY.shl.as...........u.......a...h.....p....hp....hsh.h.D.........hs...h..h....h.....lhK.....b.....................................h....aE....h.p|.................................................... consensus/95% ........................................................................................................................so.ash..Ru.pa.oH...spa.Gsa.Pb.s+.hhb+ao...-.lLs.hhG.G.TshlEsbbL.Rp.huhDls..sh..s.pp.........................................................ssspph.............spphp.lh.HPPY.shl.ao.p.......DlS.h.p...a...h..hh.c..RlLK....hsl.hGD.+.p..h.Phu...hp.hb..GF.hc-.llK.....b...p............................p....Fhhh.HEalh.hbK|.................................................... consensus/90% .................................................................................................................h......so.ash..Ru.pWuTH...spa+Gsa.Pb.sRphll+Yop..-.lLs.hhG.G.TshlEsblL.Rp.lGhDlN..ul.bsbpp..h................................................p.....uDucpl.............spphc.lhsHPPY.shlpYS.p.......DlS.h.ph.pa...h.plh.E.aRlLK.sp.hsl.hGD.Rbp..h.Phua.hhp.abp.GF.LcE.llK.....Q..hp.s..b.......................p...sFhhh.HEalh.hcK|.................................................... consensus/85% ...........................................................................h.....................................ap.bp..so.Wsh.pRu.pWATH...upa+GNWsPblsRpllL+Yopp.-.lLD.hhGuG.TohlEsbLL.Rp.lGlDlN..ul.bsbpphpa................................................p..l..uDu+pLp..........h.spolDhlhoHPPY.sIIpYS.p....l..DLS.h.ph.cFh..hppVh.E.aRVLK.s+bhslhhGD.Rbp..h.Phua.hhp.abp.GF.L+E.lIK.....Q+php.sp.hW.................s....p...sFhlh.HEalhlFcK|.................................................... consensus/80% ...........................................................................h.............................sp....p.Fp.lp..sohWsFPpRG.pWATH..puca+GNWsPblsRsllL+Yopc.-.lLD.hhGuG.TThlEsbLL.Rp.IGlDlN..Al.bsbcphpFp...............................................c.bl..uDARpLp..........l.spolDhlhoHPPY.sIIpYS.p....lp.DLS.h.plpcFh.bhpcVh.EsaRVLK.s+bCulhhGDsRbp..hlPhua.lhp.Fbp.GF.L+E.IIKb....Q+pCp.spshW................ps....pb..sFhll.HEalalF+K|.................................................... consensus/75% ......................................................................p....h.............................sp....p.Fp.lc..oThWoFPcRG.pWATH..sucaRGNWuPblsRNllL+Yopcs-.lLDpFhGuG.TTllEsKLLsRp.IGlDlN..Al.bsbcphsFp..............................................sc.bl..GDARpLs.........blpspolDhIhTHPPYhsIIpYS.ps...lp.DLSph.plp-FhpbhpcVhpEsaRVLK.s+aCullhGDsRbps.hlPluaplhphFbp.GF.LKEbIIKb....Q+pCp.upshW...p............ps....pb..sFhLlhHEalFlF+K|.................................................... consensus/70% ......................................................................p....h............................pPp....p.Fp.LE..oThWSFPcRG.pWATH..sucaRGNWuPblPRNlILRYopcsDhlLDpFhGuG.TTLlEAKLLsRphIGlDlN..AlpbsbcplsFp.p............................................schbl.bGDARpLs.........blpDpSIDhIhTHPPYhsIIcYS.cs...lc.DLSph.plp-Flpchc+VupEsaRVLKbs+aCAlLhGDsRbcsahlPluFpVMplFbpsGF.LKEhIIKb....QHNC+.usshW...pp...........ps....ch..sFhLlhHEaLFlF+K|....................................................Back to Contents
General notesThe novel Fungal adenine methylase is present in a variety of early branching fungi such as Spizellomyces punctatus ( a euchytrid), Conidiobolus coronatus (Entomophthorales), Coemansia reversa (Zygomycota proper), the Mucoromycotina and Rhizophagus irregularis( glomeromycota). The phyletic pattern suggests that the domain was acquired early in an ancestral fungus and lost independently on multiple occasions including in Batrachochytridum, Ascomycetes and Basidiomycetes. Fungal DAMs are fused to multiple domains including Chromo, DNMT3-like finger, ZZ finger, PHD, GATA, AT-hooks and KRI domain, suggesting a strong association with chromatin. The c-terminal methylase lacks the characteristic PPY motif in strand-4 suggesting that it is inactive. The KRI domain is inserted between strands-3 and 4 of the C-terminal methylase. These methylases are characterized by a HPPY motif in strand-4. Additional conserved structural elements and motifs include a very typical strand-helix unit before the methylase core which contains a highly conserved histidine and arginine between the two elements. Additionally they contain a D** in strand -1, E in helix before strand-2 and R** before strand-2, Universal D** after strand-2 HPPY motif in strand-4, R**xxK motif in helix before strand-5, D after strand-5, K after strand-6, HE** before strand -7 and K** after strand-7. |
# 1; Eukaryotic versions GI Architecture Gene_name Len Taxonomy Species Genbank Bcir1000010688 DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI) Bcir1000010688 2188 eukaryota>fungi>mucoromycotina Backusella circina estExt_fgenesh1_pm.C_3310003 Uram1000000474 GATA(frag)+N6-MTase+N6-MTase(KRI) Uram1000000474 482 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.2_#_546_#_combest_scaffold_2_37280 Crev1000002507 CHROMO+CHROMO+CHROMO+AT-hook+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI) Crev1000002507 2200 eukaryota>fungi>kickxellomycotina Coemansia reversa fgenesh1_pg.10_#_86 552908586 GATA+N6-MTase++N6-MTase(KRI) GLOINDRAFT_316719 649 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 181602 hypothetical protein GLOINDRAFT_316719 [Rhizophagus irregularis DAOM 181602]. 595436939 CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase++N6-MTase(KRI) RirG_262840 2470 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 197198w hypothetical protein RirG_262840 [Rhizophagus irregularis DAOM 197198w]. Ccor1000008322 ZZ+GATA+N6-MTase++N6-MTase(KRI) Ccor1000008322 597 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus CE35304_1103 Spun1000004719 CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI) Spun1000004719 1591 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (1591 aa) Pbla1000013272 DNMT3-Trebleclef+CHROMO+CHROMO+AT-hook+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI) Pbla1000013272 2382 eukaryota>fungi>basal Phycomyces blakesleeanus estExt_fgeneshPB_pg.C_100091 671688888 DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+N6-MTase(KRI) LRAMOSA00608 2236 eukaryota>fungi Absidia idahoensis var. thermophila hypothetical protein LRAMOSA00608 [Absidia idahoensis var. thermophila]. 511008850 CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+N6-MTase+MTase(KRI) HMPREF1544_03082 1600 eukaryota>fungi Mucor circinelloides f. circinelloides 1006PhL hypothetical protein HMPREF1544_03082 [Mucor circinelloides f. circinelloides 1006PhL]. 758351301 CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ-like+GATA+N6-MTase+MTase(KRI) MAM1_0127c06017 1640 eukaryota>fungi Mucor ambiguus DNA N6-MTase N-4 [Mucor ambiguus]. 729708575 DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+SAM-N6-MTase+MTase(KRI) RMCBS344292_09167 1919 eukaryota>fungi Rhizopus microsporus hypothetical protein RMCBS344292_09167 [Rhizopus microsporus]. 729703045 DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTaseMTase(KRI) RMCBS344292_14260 1878 eukaryota>fungi Rhizopus microsporus hypothetical protein RMCBS344292_14260 [Rhizopus microsporus]. 384485890 DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+SAM-N6-MTase+MTase(KRI) RO3G_02774 1914 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_02774 [Rhizopus delemar RA 99-880]. 727142291 ZZ+PHD+ZZ+GATA+N6-MTaseMTase(KRI) RMATCC62417_10446 1039 eukaryota>fungi Rhizopus microsporus hypothetical protein RMATCC62417_10446 [Rhizopus microsporus]. 727142293 ZZ+ZZ+GATA+N6-MTaseMTase(KRI) RMATCC62417_10446 861 eukaryota>fungi Rhizopus microsporus hypothetical protein RMATCC62417_10446 [Rhizopus microsporus]. 727142292 ZZ+ZZ+GATA+N6-MTaseMTase(KRI) RMATCC62417_10446 856 eukaryota>fungi Rhizopus microsporus hypothetical protein RMATCC62417_10446 [Rhizopus microsporus]. 661176173 DNMT3-Trebleclef+CHROMO+CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase MTase(KRI) LCOR_11540.1 2275 eukaryota>fungi Lichtheimia corymbifera JMRC:FSU:9682 dna N6-MTase [Lichtheimia corymbifera JMRC:FSU:9682]. 672819038 CHROMO+CHROMO+CHROMO+ZZ+PHD+PHD+ZZ+GATA+N6-MTase+MTase(KRI) MVEG_09762 2147 eukaryota>fungi Mortierella verticillata NRRL 6337 hypothetical protein MVEG_09762 [Mortierella verticillata NRRL 6337]. 758369443 CHROMO+CHROMO+ZZ+GATA+SAM-N6-MTase+MTase(KRI) PARPA_01280.1 2216 eukaryota>fungi Parasitella parasitica hypothetical protein [Parasitella parasitica]. # 146; Bacterial homologs GI Gene neighborhoods Arch Pfam-arch Gene name Len Taxonomy Species name Genbank 557946003 N6-MTase*-><-DpnII<-DpnII<-DpnM-N6-MTase<-Phage_integrase N6-MTase UPF0020+N6_N4_Mtase MBMB1_0500 364 archaea>euryarchaeota Methanobacterium sp. MB1 DNA methylase N-4/N-6 domain protein [Methanobacterium sp. MB1]. <-557945996_?||557945997_?-><-557945998_?<-557945999_?<-557946000_?<-557946001_?<-557946002_?||557946003_N6-MTase*-><-557946004_DpnII<-557946005_DpnII<-557946006_DpnM-N6-MTase<-557946007_Phage_integrase<-557946008_?||557946009_?-><-557946010_? 499219160 <-N6-MTase* N6-MTase N6_N4_Mtase+Methyltransf_26 TVN0442 347 archaea>euryarchaeota Thermoplasma volcanium DNA methyltransferase [Thermoplasma volcanium]. <-13541266_?||13541267_?-><-13541268_?<-13541269_?<-13541270_?||13541271_?->13541272_?-><-499219160_N6-MTase*||13541274_?->13541275_?-><-13541276_?<-13541277_?||13541278_?->13541279_?->13541280_?-> 546147902 N6-MTase*-> N6-MTase Methyltransf_26 AMDU1_APLC00004G0008 346 archaea>euryarchaeota environmental samples MULTISPECIES: DNA adenine modification methylase [environmental samples]. 546151579_?-><-546151580_?<-546147897_?<-546151581_?<-546147899_?||546147900_?->546147901_?->546147902_N6-MTase*-><-546151582_?<-546147904_?<-546147905_?<-546147906_?<-546151583_?<-546151584_?||546147910_?-> 500164270 <-N6-MTase* N6-MTase N6_N4_Mtase+N6_N4_Mtase Smar_0588 342 archaea>crenarchaeota Staphylothermus marinus DNA methylase [Staphylothermus marinus]. 126465487_?->126465488_?-><-126465489_?<-126465490_?<-126465491_?<-126465492_?<-126465493_?<-500164270_N6-MTase*||126465495_?->126465496_?->126465497_?->126465498_?-><-126465499_?||126465500_?->126465501_?-> 731481703 <-N6-MTase*||DpnII-> N6-MTase N6_N4_Mtase+Methyltransf_26 Mpt1_c10100 341 archaea>euryarchaeota Candidatus Methanoplasma termitum modification methylase MjaII [Candidatus Methanoplasma termitum]. 731481696_?-><-731481697_?<-731481698_?<-731481699_?||731481700_?->731481701_?->731481702_?-><-731481703_N6-MTase*||731481704_DpnII-><-731481705_?<-731481706_?<-731481707_?<-731481708_?<-731481709_?<-731481710_? 504550199 N6-MTase*-> N6-MTase N6_N4_Mtase+N6_N4_Mtase TCELL_0627 340 archaea>crenarchaeota Thermogladius cellulolyticus DNA methylase [Thermogladius cellulolyticus]. <-389860942_?||389860943_?->389860944_?->389860945_?-><-389860946_?<-389860947_?||389860948_?->504550199_N6-MTase*->389860950_?->389860951_?->389860952_?->389860953_?-><-389860954_?<-389860955_?||389860956_?-> 502907573 <-N6-MTase* N6-MTase N6_N4_Mtase+N6_N4_Mtase Shell_0210 338 archaea>crenarchaeota Staphylothermus hellenicus DNA methylase [Staphylothermus hellenicus]. <-297526219_?||297526220_?->297526221_?-><-297526222_?||297526223_?->297526224_?->297526225_?-><-502907573_N6-MTase*||297526227_?->297526228_?-><-297526229_?<-297526230_?<-297526231_?<-297526232_?<-297526233_? 530785168 - N6-MTase Methyltransf_26 330 archaea>crenarchaeota Thermofilum sp. 1910b hypothetical protein [Thermofilum sp. 1910b]. 737178046 <-N6-MTase*<-DpnM-N6-MTase<-?<-Spermidine-synthase N6-MTase SP+N6_N4_Mtase+Methyltransf_26 Y919_RS08595 330 bacteria>firmicutes Caloranaerobacter azorensis DNA methylase N-4 [Caloranaerobacter azorensis]. 737178066_?-><-737178067_?<-737178040_?<-737178041_?<-737178042_?<-737178044_?<-737178046_N6-MTase*<-737178049_DpnM-N6-MTase<-737178051_?<-737178052_Spermidine-synthase<-737178053_?<-737178054_?||737178056_?->737178057_?-> 489613307 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase SP+Methyltransf_26 YSBL_RS07445 329 bacteria>firmicutes Ruminiclostridium thermocellum DNA methylase N-4 [Ruminiclostridium thermocellum]. 489613325_?-><-489613323_?<-489613321_?<-489613315_?<-489613313_?<-489613311_?<-489613309_?<-489613307_N6-MTase*<-489613306_DpnII<-489613301_DpnM-N6-MTase||489613300_?-><-489613298_?<-489613295_?<-489613292_?<-739434939_? 490598869 <-N6-MTase*<-DpnM-N6-MTase||?-><-?<-?<-?||?->METHYLASE-> N6-MTase SP+Methyltransf_26 CTHBC1_RS07545 329 bacteria>firmicutes Ruminiclostridium thermocellum DNA methylase N-4 [Ruminiclostridium thermocellum]. 490598858_?-><-490598859_?<-489613321_?<-500163438_?<-500163439_?<-490598866_?<-490598867_?<-490598869_N6-MTase*<-489613301_DpnM-N6-MTase||489613300_?-><-553726118_?<-490598882_?<-739435388_?||739445830_?->553726119_METHYLASE-> 499664364 <-DpnII<-N6-MTase*<-DpnM-N6-MTase N6-MTase SP+Methyltransf_26 CHY_RS10255 329 bacteria>firmicutes Carboxydothermus hydrogenoformans DNA methylase N-4 [Carboxydothermus hydrogenoformans]. <-753782109_?<-499664357_?<-499664359_?<-499664360_?<-499664361_?<-736527008_?<-753782391_DpnII<-499664364_N6-MTase*<-753782392_DpnM-N6-MTase<-499664366_?<-499664367_?||499664368_?-><-499664369_?<-499664370_?<-499664371_? 653611723 <-ABC-ATPase<-?<-?||?-><-?<-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase<-?<-Methylase N6-MTase SP+Methyltransf_26 CLOCLA_RS0112375 329 bacteria>firmicutes [Clostridium] clariflavum DNA methylase N-4 [[Clostridium] clariflavum]. <-653611718_ABC-ATPase<-653611719_?<-504022761_?||653611720_?-><-759895769_?<-653611721_?<-653611722_?<-653611723_N6-MTase*<-653611724_DpnII<-653611725_DpnM-N6-MTase<-653611726_?<-504022770_Methylase<-504022771_?||653611727_?->653611728_?-> 740456070 <-N6-MTase*<-DpnM-N6-MTase N6-MTase SP+N6_N4_Mtase+Methyltransf_26 JCM21531_RS04440 329 bacteria>firmicutes [Clostridium] straminisolvens DNA methylase N-4 [[Clostridium] straminisolvens]. <-740456184_?<-740456070_N6-MTase*<-740456072_DpnM-N6-MTase<-740456075_?<-740456077_?<-740456079_?<-740456080_?<-740456187_?<-740456082_? 206738469 <-METHYLASE<-?<-PLD+SFII-helicase<-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 COPRO5265_1445 327 bacteria>firmicutes Coprothermobacter proteolyticus DSM 5265 DNA methylase N-4 [Coprothermobacter proteolyticus DSM 5265]. 639380273_?-><-206737760_?<-206739175_METHYLASE<-206737708_?<-206738620_PLD+SFII-helicase<-639380274_?<-206738701_?<-206738469_N6-MTase*<-206738522_DpnII<-206738073_DpnM-N6-MTase<-206738705_?<-639380275_?||639380276_?-><-639380277_?<-206737751_? 526884977 <-N6-MTase* N6-MTase Methyltransf_26 CSUB_C0599 325 archaea Candidatus Caldiarchaeum subterraneum DNA methylase N-4/N-6 [Candidatus Caldiarchaeum subterraneum]. 557694027_?-><-557694028_?<-557694029_?||557694030_?->557694031_?->557694032_?-><-557694033_?<-526884977_N6-MTase*<-557694035_?<-557694036_?||557694037_?-><-557694038_?<-557694039_?<-557694040_?<-557694041_? 496184606 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase SP+N6_N4_Mtase+Methyltransf_26 CAAU_RS08905 322 bacteria>firmicutes Caloramator australicus DNA methylase N-4 [Caloramator australicus]. 749868668_?->496184600_?->496184601_?->496184602_?->496184603_?->749868670_DpnM-N6-MTase->496184605_DpnII->496184606_N6-MTase*-><-496184607_?||749868035_?->749868780_?->496184610_?->749868781_?->496184614_?->749868037_?-> 503327799 <-N6-MTase* N6-MTase N6_N4_Mtase+N6_N4_Mtase Desmu_0935 319 archaea>crenarchaeota Desulfurococcus mucosus DNA methylase [Desulfurococcus mucosus]. 320101121_?->320101122_?->320101123_?->320101124_?->320101125_?->320101126_?->320101127_?-><-503327799_N6-MTase*||320101129_?-><-320101130_?||320101131_?-><-320101132_?||320101133_?-><-320101134_?<-320101135_? 756971970 <-N6-MTase*||?->?->?->?->REase-4-> N6-MTase Methyltransf_26 I774_RS01625 319 archaea Aigarchaeota archaeon JGI 0000106-J15 DNA methylase [Aigarchaeota archaeon JGI 0000106-J15]. 756971964_?-><-756971969_?<-756971970_N6-MTase*||756971971_?->756971965_?->756971966_?->756971967_?->756971968_REase-4->756971972_?-> 504580152 <-ABC-ATPase||N6-MTase*-><-?||?->?->?->?->?->Pribosyltran-> N6-MTase N6_N4_Mtase+Methyltransf_26 Desfe_0448 318 archaea>crenarchaeota Desulfurococcus fermentans DNA methylase [Desulfurococcus fermentans]. 390938183_?->390938184_?-><-390938185_?||390938186_?->390938187_?-><-390938188_?<-390938189_ABC-ATPase||504580152_N6-MTase*-><-390938191_?||390938192_?->390938193_?->390938194_?->390938195_?->390938196_?->390938197_Pribosyltran-> 501637311 <-Pribosyltran<-?<-?<-?<-?||?->?-><-N6-MTase*||?->ABC-ATPase-> N6-MTase N6_N4_Mtase+Methyltransf_26 DKAM_0485 317 archaea>crenarchaeota Desulfurococcus kamchatkensis DNA methylase [Desulfurococcus kamchatkensis]. <-218883789_Pribosyltran<-218883790_?<-218883791_?<-218883792_?<-218883793_?||218883794_?->218883795_?-><-501637311_N6-MTase*||218883797_?->218883798_ABC-ATPase->218883799_?-><-218883800_?<-218883801_?<-218883802_?||218883803_?-> 756979360 <-Pribosyltran<-?<-?<-?<-?||?-><-N6-MTase*||ABC-ATPase-> N6-MTase N6_N4_Mtase+Methyltransf_26 SPHMEL_RS03490 317 archaea>crenarchaeota Desulfurococcus amylolyticus DNA methylase [Desulfurococcus amylolyticus]. <-756979355_?<-756979356_Pribosyltran<-756979357_?<-756979358_?<-756979917_?<-501637305_?||756979359_?-><-756979360_N6-MTase*||756979361_ABC-ATPase->756979362_?-><-756979363_?<-756979918_?<-756979364_?||756979365_?-><-756979366_? 504370902 N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 FFONT_0867 314 archaea>crenarchaeota Fervidicoccus fontis DNA methylase N-4 [Fervidicoccus fontis]. 385805902_?-><-385805903_?<-385805904_?||385805905_?->385805906_?-><-385805907_?<-385805908_?||504370902_N6-MTase*-><-385805910_?||385805911_?->385805912_?->385805913_?-><-385805914_?<-385805915_?||385805916_?-> 206739986 <-ABC-ATPase<-DpnII<-N6-MTase* N6-MTase Methyltransf_26 DICTH_1800 311 bacteria>dictyoglomi Dictyoglomus thermophilum H-6-12 putative DNA methylase [Dictyoglomus thermophilum H-6-12]. <-206740402_?<-206739665_?||206740399_?-><-206739566_?<-206740100_?<-206741023_ABC-ATPase<-206740340_DpnII<-206739986_N6-MTase*<-206741073_?<-206739739_?<-206739820_?||206739604_?-><-206740154_?||206740432_?->206739563_?-> 502895170 N6-MTase*->?->DCM-> N6-MTase N6_N4_Mtase+N6_N4_Mtase Tagg_1290 311 archaea>crenarchaeota Thermosphaera aggregans DNA methylase [Thermosphaera aggregans]. <-296243011_?<-296243012_?||296243013_?->296243014_?->296243015_?-><-296243016_?<-296243017_?||502895170_N6-MTase*->296243019_?->296243020_DCM->296243021_?-><-296243022_?||296243023_?-><-296243024_?<-296243025_? 643385301 <-DpnM-N6-MTase<-DpnII<-N6-MTase* N6-MTase N6_N4_Mtase+N6_N4_Mtase D891_RS0103440 306 bacteria>proteobacteria>deltaproteobacteria Hippea sp. KM1 DNA methylase [Hippea sp. KM1]. 643385287_?->643385289_?->643385291_?->643385293_?->643385295_?-><-737585466_DpnM-N6-MTase<-643385299_DpnII<-643385301_N6-MTase*||643385304_?->643385305_?->643385306_?->737585802_?->661256721_?-><-643385310_?<-643385312_? 502864863 N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 Metin_0423 305 archaea>euryarchaeota Methanocaldococcus infernus DNA methylase N-4 [Methanocaldococcus infernus]. <-296109101_?<-296109102_?<-296109103_?<-296109104_?||296109105_?->296109106_?-><-296109107_?||502864863_N6-MTase*->296109109_?-><-296109110_?<-296109111_?<-296109112_?<-296109113_?<-296109114_?<-296109115_? 754082338 <-ABC-ATPase<-DpnII<-N6-MTase* N6-MTase UPF0020 DICTH_RS08720 304 bacteria>dictyoglomi Dictyoglomus thermophilum DNA methylase [Dictyoglomus thermophilum]. 754082337_?->501542015_?->501542972_?-><-501542137_?<-501542672_?<-501543600_ABC-ATPase<-501542913_DpnII<-754082338_N6-MTase*<-501543650_?<-501542311_?<-501542392_?<-501542727_?<-754082042_?<-754082049_?<-754082339_? 746331486 N6-MTase*-><-DpnII<-DpnII<-DpnM-N6-MTase<-Phage_integrase N6-MTase UPF0020+N6_N4_Mtase MBMB1_RS02390 303 archaea>euryarchaeota Methanobacterium sp. MB1 hypothetical protein [Methanobacterium sp. MB1]. 746330837_?-><-746331483_?<-566004529_?||566004530_?-><-566004531_?<-566004532_?<-566004533_?||746331486_N6-MTase*-><-746331489_DpnII<-746331493_DpnII<-746331496_DpnM-N6-MTase<-746331499_Phage_integrase<-566004541_?||566004542_?-><-566004543_? 290559536 N6-MTase*->DpnM-N6-MTase-> N6-MTase N6_N4_Mtase+Methyltransf_26 BJBARM5_0369 297 archaea Candidatus Parvarchaeum acidophilus ARMAN-5 Methyltransferase type 11 [Candidatus Parvarchaeum acidophilus ARMAN-5]. 290559529_?-><-290559530_?||290559531_?-><-290559532_?<-290559533_?<-290559534_?<-290559535_?||290559536_N6-MTase*->290559537_DpnM-N6-MTase-><-290559538_?<-290559539_?||290559540_?->290559541_?->290559542_?-><-290559543_? 374851611 N6-MTase*-> N6-MTase Methyltransf_26 HGMM_F16H05C22 293 bacteria>aquificae uncultured Aquificae bacterium DNA methylase [uncultured Aquificae bacterium]. <-374851604_?<-374851605_?||374851606_?-><-374851607_?||374851608_?->374851609_?->374851610_?->374851611_N6-MTase*-><-374851612_?<-374851613_?<-374851614_?<-374851615_?<-374851616_?||374851617_?->374851618_?-> 502729540 N6-MTase*-> N6-MTase Methyltransf_26 HYDTH_RS09530 293 bacteria>aquificae Hydrogenobacter thermophilus DNA methylase N-4 [Hydrogenobacter thermophilus]. <-502729533_?||502729534_?-><-502729535_?||502729536_?->502729537_?->502729538_?->502729539_?->502729540_N6-MTase*-><-502729541_?<-502729542_?<-502729543_?<-502729544_?<-502729545_?<-502729546_?||502729547_?-> 551115149 - N6-MTase Methyltransf_26 292 bacteria Candidatus Calescibacterium nevadense RNA methyltransferase [Candidatus Calescibacterium nevadense]. 518679720 N6-MTase*-><-DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 FACI_IFERC00001G0455 284 archaea>euryarchaeota Ferroplasma acidarmanus hypothetical protein [Ferroplasma acidarmanus]. 518651809_?-><-518651810_?||518651811_?->518651812_?-><-518651813_?<-518651814_?||518651815_?->518679720_N6-MTase*-><-518651817_DpnM-N6-MTase<-518651818_?<-518651819_?<-518651820_?<-518651821_?<-518651822_?||518651823_?-> 383110164 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 Ferpe_1711 278 bacteria>thermotogae Fervidobacterium pennivorans DSM 9078 DNA methylase [Fervidobacterium pennivorans DSM 9078]. 383110157_?->383110158_?->383110159_?->383110160_?->383110161_?->383110162_DpnM-N6-MTase->383110163_DpnII->383110164_N6-MTase*->383110165_?->383110166_?-><-383110167_?<-383110168_?<-383110169_?<-383110170_?<-383110171_? 546149073 ASCH->?->?->?->?->?->?->N6-MTase*-> N6-MTase N6_N4_Mtase AMDU3_IPLC00003G0029 272 archaea>euryarchaeota Thermoplasmatales archaeon I-plasma hypothetical protein [Thermoplasmatales archaeon I-plasma]. 546149066_ASCH->546149067_?->546149068_?->546149069_?->546149070_?->546149071_?->546149072_?->546149073_N6-MTase*->546149074_?->546149075_?->546149076_?-><-546149077_?||546149078_?-><-546149079_?<-546149080_? 490745127 N6-MTase*->?->DpnM-N6-MTase-> N6-MTase N6_N4_Mtase+Methyltransf_11 CLOSCI_RS12555 271 bacteria>firmicutes [Clostridium] scindens DNA methylase N-4 [[Clostridium] scindens]. 490745118_?->490745119_?->748651615_?->490745121_?->490745122_?->496543445_?->490745127_N6-MTase*->490745128_?->490745129_DpnM-N6-MTase->490745130_?->490745132_?->490745133_?->490745134_?->748651604_?-> 496543443 <-DpnM-N6-MTase<-?<-N6-MTase* N6-MTase N6_N4_Mtase+Methyltransf_11 HMPREF0993_RS14405 271 bacteria>firmicutes Lachnospiraceae bacterium 5_1_57FAA DNA methylase N-4 [Lachnospiraceae bacterium 5_1_57FAA]. <-496543439_?<-496543440_?<-496543441_?<-769170250_?<-496543442_?<-490745129_DpnM-N6-MTase<-490745128_?<-496543443_N6-MTase*<-496543445_?<-490745122_?<-496543446_?<-769170261_?<-490745119_?<-496543448_?<-496543449_? 333759614 MACRODOMAIN->?->?->?->?->?->?->N6-MTase*->DpnM-N6-MTase->?->N6-MTase->DpnII->Primase+SNF+PLD-> N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF9124_1256 270 bacteria>firmicutes Oribacterium sp. oral taxon 108 str. F0425 putative DNA (cytosine-5-)-methyltransferase [Oribacterium sp. oral taxon 108 str. F0425]. 333759038_MACRODOMAIN->333759543_?->333759315_?->333758850_?->333758658_?->333759566_?->333759020_?->333759614_N6-MTase*->333760338_DpnM-N6-MTase->333758772_?->333760010_N6-MTase->333760591_DpnII->333760705_Primase+SNF+PLD->333758828_?->333759066_?-> 701167223 DpnM-N6-MTase->DpnII->N6-MTase*-><-?<-ABC-ATPase N6-MTase N6_N4_Mtase+N6_N4_Mtase NA23_RS09565 268 bacteria>thermotogae Fervidobacterium islandicum DNA methylase N-4 [Fervidobacterium islandicum]. 701167713_?->701167218_?->701167219_?-><-701167220_?<-701167221_?||701167714_DpnM-N6-MTase->701167222_DpnII->701167223_N6-MTase*-><-701167224_?<-701167715_ABC-ATPase||701167225_?-><-701167226_?||737436023_?->701167227_?->701167228_?-> 493349121 <-N6-MTase*<-MutH N6-MTase Methyltransf_26 L21TH_RS00440 267 bacteria>firmicutes Caldisalinibacter kiritimatiensis sensor protein fixL [Caldisalinibacter kiritimatiensis]. 493349112_?->493349115_?->736405791_?-><-493349117_?<-493349118_?<-493349119_?<-493349120_?<-493349121_N6-MTase*<-493349122_MutH 504218926 N6-MTase-><-?||?-><-?<-?<-?||DpnM-N6-MTase->N6-MTase*-><-?||?->MACRODOMAIN->?->Ploop+REase-> N6-MTase UPF0020+N6_N4_Mtase Mtc_1443 267 archaea>euryarchaeota Methanocella conradii RNA methyltransferase [Methanocella conradii]. 383319865_N6-MTase-><-383319866_?||383319867_?-><-383319868_?<-383319869_?<-383319870_?||383319871_DpnM-N6-MTase->504218926_N6-MTase*-><-383319873_?||383319874_?->383319875_MACRODOMAIN->383319876_?->383319877_Ploop+REase->383319878_?->383319879_?-> 503063371 <-ABC-ATPase||?-><-?<-?<-?<-?<-N6-MTase*<-DpnII<-DpnII<-DpnM-N6-MTase N6-MTase N6_N4_Mtase+N6_N4_Mtase TTHE_RS09375 266 bacteria>firmicutes Thermoanaerobacterium thermosaccharolyticum RNA methyltransferase [Thermoanaerobacterium thermosaccharolyticum]. <-503063364_?<-503063365_ABC-ATPase||503063366_?-><-503063367_?<-503063368_?<-503063369_?<-753831842_?<-503063371_N6-MTase*<-753831957_DpnII<-753831958_DpnII<-503063372_DpnM-N6-MTase<-503063373_?<-503063374_?||753831959_?->503063376_?-> 547826215 <-N6-MTase*<-?<-DpnII-like-REase||?-><-?<-ABC-ATPase N6-MTase N6_N4_Mtase+Methyltransf_26 BN531_00658 266 bacteria>firmicutes Eubacterium sp. CAG:202 DNA modification methylase [Eubacterium sp. CAG:202]. 547826208_?->547826209_?->547826210_?->547826211_?->547826212_?->547826213_?->547826214_?-><-547826215_N6-MTase*<-547826216_?<-547826217_DpnII-like-REase||547826218_?-><-547826219_?<-547826220_ABC-ATPase<-547826221_?<-547826222_? 752594110 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 FERPE_RS08390 263 bacteria>thermotogae Fervidobacterium pennivorans DNA methylase N-4, partial [Fervidobacterium pennivorans]. 504265091_?->504265092_?->504265093_?->504265094_?->504265095_?->752594108_DpnM-N6-MTase->752594109_DpnII->752594110_N6-MTase*->504265099_?->504265100_?-><-504265101_?<-504265102_?<-504265103_?<-504265104_?<-504265105_? 546656251 <-N6-MTase*<-DpnII-like-REase N6-MTase Methyltransf_26 BN462_00619 262 bacteria>firmicutes Ruminococcus sp. CAG:108 DNA methylase N-4/N-6 domain protein [Ruminococcus sp. CAG:108]. 546656244_?-><-546656245_?<-546656246_?<-546656247_?||546656248_?->546656249_?->546656250_?-><-546656251_N6-MTase*<-546656252_DpnII-like-REase<-546656253_?<-546656254_?<-546656255_?<-546656256_?<-546656257_?<-546656258_? 548245492 DpnII-like-REase->N6-MTase*-> N6-MTase N6_N4_Mtase BN584_00043 262 bacteria>firmicutes Clostridium sp. CAG:277 DNA adenine modification methylase [Clostridium sp. CAG:277]. <-548245485_?<-548245486_?<-548245487_?<-548245488_?<-548245489_?<-548245490_?||548245491_DpnII-like-REase->548245492_N6-MTase*-><-548245493_?<-548245494_?<-548245495_?<-548245496_?||548245497_?->548245498_?-><-548245499_? 696259451 <-N6-MTase*<-DpnII-like-REase N6-MTase N6_N4_Mtase HMPREF1498_RS09070 262 bacteria>fusobacteria Fusobacterium sp. CM1 hypothetical protein [Fusobacterium sp. CM1]. <-696259449_?||696259450_?-><-696259451_N6-MTase*<-696259452_DpnII-like-REase 696307008 <-N6-MTase*<-DpnII-like-REase N6-MTase N6_N4_Mtase T263_RS0108015 262 bacteria>fusobacteria Fusobacterium nucleatum hypothetical protein [Fusobacterium nucleatum]. <-658616850_?<-658616852_?<-658616855_?<-658616858_?<-492627213_?<-696307007_?<-658616862_?<-696307008_N6-MTase*<-658616866_DpnII-like-REase 703200955 <-N6-MTase* N6-MTase SP+Methyltransf_26 N469_RS0107485 262 bacteria Marinimicrobia MULTISPECIES: DNA methylase N-4, partial [Marinimicrobia]. <-703200955_N6-MTase*||551208956_?->551208955_?->655262737_?-><-551208953_?<-661311063_?<-703200953_?<-661311064_? 489963634 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 TTHWC1_RS07155 259 bacteria>firmicutes Thermoanaerobacter MULTISPECIES: RNA methyltransferase [Thermoanaerobacter]. 490535680_?-><-490535682_?<-490535684_?<-490535685_?<-490535686_?<-489966042_?||490535687_?-><-489963634_N6-MTase*<-490535690_DpnII<-490535692_DpnM-N6-MTase||489963637_?-><-490535694_?<-490535698_?<-490535699_?<-490535700_? 490206260 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 H17AP60334_RS10630 259 bacteria>thermotogae Thermosipho africanus RNA methyltransferase [Thermosipho africanus]. 740200843_?->490206258_?-><-490206260_N6-MTase*<-490206262_DpnII<-490206263_DpnM-N6-MTase||490206264_?->490206265_?->490206266_?->490206268_?->490206269_?-> 501003459 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 TMEL_RS01630 259 bacteria>thermotogae Thermosipho melanesiensis RNA methyltransferase [Thermosipho melanesiensis]. <-501003453_?<-501003454_?||752786045_?->501003455_?->501003456_?->501003457_?->752786296_?-><-501003459_N6-MTase*<-501003460_DpnII<-501003461_DpnM-N6-MTase<-501003463_?<-752786297_?<-501003465_?<-501003466_?<-501003467_? 502759633 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 THIT_RS02440 259 bacteria>firmicutes Thermoanaerobacter italicus RNA methyltransferase [Thermoanaerobacter italicus]. 502759626_?->502759627_?->502759628_?->502759629_?->502759630_?->502759631_DpnM-N6-MTase->502759632_DpnII->502759633_N6-MTase*-><-502759634_?||502759635_?->502759636_?->502759637_?->502759638_?->502759639_?->502759640_?-> 658480004 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 M663_RS0111020 259 bacteria>firmicutes Thermoanaerobacter sp. A7A DNA methylase N-4 [Thermoanaerobacter sp. A7A]. <-658479990_?<-658479992_?<-658479994_?<-658479996_?<-658479998_?<-658480000_?||658480002_?-><-658480004_N6-MTase*<-658480006_DpnII<-658480009_DpnM-N6-MTase<-658480011_?<-658480013_?<-658480016_?<-658480018_?<-658480020_? 658542249 - N6-MTase Methyltransf_26 259 bacteria EM3 bacterium JGI 0000106-B10 hypothetical protein [EM3 bacterium JGI 0000106-B10]. 694165517 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 TKV_c04890 259 bacteria>firmicutes Thermoanaerobacter kivui DNA methylase N-4/N-6 domain protein [Thermoanaerobacter kivui]. 694165510_?->694165511_?->694165512_?->694165513_?->694165514_?->694165515_DpnM-N6-MTase->694165516_DpnII->694165517_N6-MTase*-><-694165518_?||694165519_?->694165520_?->694165521_?->694165522_?->694165523_?-><-694165524_? 757582161 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 THYS13_RS12835 259 bacteria>firmicutes Thermoanaerobacter sp. YS13 DNA methylase N-4 [Thermoanaerobacter sp. YS13]. 757582157_?->757582158_?->757582317_?->757582318_?->757582319_?->757582159_DpnM-N6-MTase->757582160_DpnII->757582161_N6-MTase*-><-757582162_?||757582163_?->757582164_?->757582165_?->757582166_?->757582320_?-><-757582167_? 658522169 N6-MTase*->?->?->?->?->N6-MTase-> N6-MTase Methyltransf_26 I825_RS0102520 258 bacteria Atribacteria bacterium SCGC AAA255-G05 DNA methylase N-4, partial [Atribacteria bacterium SCGC AAA255-G05]. <-658522163_?<-658522164_?||658522165_?->658522166_?-><-658522167_?||658522168_?->658522169_N6-MTase*->658522170_?->658522171_?->658522172_?->658522173_?->658522174_N6-MTase->658522176_?->658522177_?-> 738305516 <-N6-MTase* N6-MTase SP+Methyltransf_26 Q355_RS15275 258 bacteria>deinococci Meiothermus cerbereus DNA methylase N-4, partial [Meiothermus cerbereus]. 654400345_?->654400346_?->654400347_?->654400348_?->654400349_?->654400350_?-><-738305515_?<-738305516_N6-MTase*<-654400352_?<-738305505_?<-654400354_?<-738305517_?||738305518_?->654400372_?-><-654400356_? 504218923 DpnM-N6-MTase->N6-MTase*-><-?||?-><-?<-?<-?||DpnM-N6-MTase->N6-MTase-> N6-MTase UPF0020+N6_N4_Mtase Mtc_1434 256 archaea>euryarchaeota Methanocella conradii RNA methyltransferase [Methanocella conradii]. 383319858_?->383319859_?-><-383319860_?<-383319861_?<-383319862_?<-383319863_?||383319864_DpnM-N6-MTase->504218923_N6-MTase*-><-383319866_?||383319867_?-><-383319868_?<-383319869_?<-383319870_?||383319871_DpnM-N6-MTase->383319872_N6-MTase-> 517992866 <-DpnM-N6-MTase<-?<-N6-MTase* N6-MTase N6_N4_Mtase+Methyltransf_11 HGRM_RS15055 255 bacteria>firmicutes Ruminococcus sp. JC304 DNA methylase N-4 [Ruminococcus sp. JC304]. <-517992859_?<-517992860_?<-517992861_?<-517992862_?<-517992863_?<-517992864_DpnM-N6-MTase<-517992865_?<-517992866_N6-MTase*<-769258210_?<-517992871_?<-517992872_?<-517992873_?||769258211_?-><-517992878_?<-517992879_? 503374808 <-N6-MTase* N6-MTase SP+Methyltransf_26 MSUIS_RS03915 253 bacteria>tenericutes Mycoplasma suis DNA methylase N-4 [Mycoplasma suis]. <-503374801_?<-503374802_?<-503374803_?<-503374804_?<-503374805_?<-503374806_?<-503374807_?<-503374808_N6-MTase*<-503374809_?<-503374810_?<-763029221_?||503374812_?->503374813_?-><-503374814_?<-503374815_? 738301178 N6-MTase*->DpnM-N6-MTase->DpnII-><-?<-?||METHYLASE-> N6-MTase Methyltransf_26 G550_RS0101585 253 bacteria>firmicutes Megamonas hypermegale DNA methylase N-4 [Megamonas hypermegale]. 654417979_?->654417980_?->654417981_?->654417982_?-><-654417983_?<-654417984_?<-738301202_?||738301178_N6-MTase*->654417986_DpnM-N6-MTase->738301180_DpnII-><-738301182_?<-738301184_?||654417990_METHYLASE->654417991_?->654417992_?-> 742671763 - N6-MTase Methyltransf_26 253 bacteria Gracilibacteria bacterium JGI 0000069-K10 hypothetical protein [Gracilibacteria bacterium JGI 0000069-K10]. 757126172 <-N6-MTase*<-DpnII<-DpnM-N6-MTase||?-><-?<-?<-ABC-ATPase N6-MTase Methyltransf_26 MB41_RS06705 253 bacteria>firmicutes Anaerosalibacter sp. ND1 DNA methylase N-4 [Anaerosalibacter sp. ND1]. <-757126039_?<-757126041_?<-757126043_?<-757126045_?<-757126047_?||757126049_?->757126051_?-><-757126172_N6-MTase*<-757126053_DpnII<-757126055_DpnM-N6-MTase||757126057_?-><-757126059_?<-757126061_?<-757126063_ABC-ATPase<-757126065_? 406901678 <-N6-MTase*<-?<-?<-DpnII<-?<-?<-DpnM-N6-MTase N6-MTase Methyltransf_26 ACD_71C00187G0001 251 bacteria uncultured bacterium (gcode 4) hypothetical protein ACD_71C00187G0001 [uncultured bacterium (gcode 4)]. <-406901678_N6-MTase*<-406901679_?<-406901680_?<-406901681_DpnII<-406901682_?<-406901683_?<-406901684_DpnM-N6-MTase||406901685_?-> 505364599 - N6-MTase SP+Methyltransf_26 251 bacteria>proteobacteria>betaproteobacteria Taylorella asinigenitalis DNA methylase [Taylorella asinigenitalis]. 568197217 N6-MTase*-> N6-MTase Methyltransf_26 MR07_RS03745 251 bacteria>tenericutes Mycoplasma ovis DNA methylase N-4 [Mycoplasma ovis]. <-568197213_?<-568197214_?<-763016747_?<-763016748_?<-763016749_?||763016750_?-><-568197216_?||568197217_N6-MTase*-><-763016751_?||568197219_?->763016752_?->568197221_?-><-568197222_?||568197223_?-><-568197224_? 652371152 <-N6-MTase*<-DpnII N6-MTase UPF0020+N6_N4_Mtase G598_RS0111930 251 bacteria>firmicutes Selenomonas ruminantium DNA methylase N-4 [Selenomonas ruminantium]. <-739504480_?<-739504482_?<-652371147_?<-652371148_?<-652371149_?||652371150_?-><-652371151_?<-652371152_N6-MTase*<-652371153_DpnII 740284293 <-N6-MTase*<-DpnII<-HTH+DpnM-N6-MTase N6-MTase SP+Methyltransf_26 HMPREF1504_RS04555 251 bacteria>firmicutes Veillonella sp. ICM51a DNA methylase N-4 [Veillonella sp. ICM51a]. <-740284278_?<-740284279_?<-740284280_?||740284281_?->740284286_?->740284289_?-><-740284291_?<-740284293_N6-MTase*<-740284295_DpnII<-740284300_HTH+DpnM-N6-MTase<-740284301_?<-740284302_?<-740284306_?<-491520967_?<-491525280_? 754482757 <-N6-MTase*||?->?-><-DpnM-N6-MTase N6-MTase UPF0020 I759_RS06660 251 archaea>euryarchaeota Euryarchaeota archaeon SCGC AAA252-I15 hypothetical protein [Euryarchaeota archaeon SCGC AAA252-I15]. 754482742_?->754482744_?->754482746_?->754482749_?->754482751_?->754482753_?->754482755_?-><-754482757_N6-MTase*||754482934_?->754482760_?-><-754482937_DpnM-N6-MTase<-754482763_?||754482765_?->754482767_?->754482770_?-> 497210069 <-ParB<-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF9629_RS00455 250 bacteria>firmicutes unclassified Peptostreptococcaceae (miscellaneous) MULTISPECIES: DNA methylase N-4 [unclassified Peptostreptococcaceae (miscellaneous)]. <-497210062_?<-497210063_?<-497210064_?<-497210065_?<-497210066_?<-497210067_?<-497210068_ParB<-497210069_N6-MTase*<-497210070_DpnII<-497210071_DpnM-N6-MTase<-497210072_?<-497210073_?<-497210074_?<-497210075_?<-497210076_? 653082102 <-N6-MTase*<-?<-?<-MutH<-HTH+DpnM-N6-MTase N6-MTase Methyltransf_26 Q346_RS0100700 250 bacteria>tenericutes Mycoplasma gallinarum DNA methylase N-4 [Mycoplasma gallinarum]. 653082096_?->653082097_?->653082098_?->738487736_?->653082099_?->653082100_?-><-653082101_?<-653082102_N6-MTase*<-653082103_?<-738487739_?<-653082105_MutH<-653082106_HTH+DpnM-N6-MTase||653082107_?->653082108_?->653082109_?-> 268315129 <-N6-MTase* N6-MTase N6_N4_Mtase+Methyltransf_26 Smon_1038 249 bacteria>fusobacteria Streptobacillus moniliformis DSM 12112 putative RNA methylase [Streptobacillus moniliformis DSM 12112]. <-268315122_?<-268315123_?||268315124_?-><-268315125_?<-268315126_?<-268315127_?<-268315128_?<-268315129_N6-MTase*<-268315130_?<-268315131_?<-268315132_?<-268315133_?<-268315134_?<-268315135_?<-268315136_? 490232946 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF1586_RS06065 249 bacteria>actinobacteria Gardnerella vaginalis DNA methylase N-4 [Gardnerella vaginalis]. <-514910845_?<-514910846_?<-490232946_N6-MTase*<-490232945_DpnII<-514910847_DpnM-N6-MTase<-696236856_? 491499778 <-ABC-ATPase<-?<-Pribosyltran<-?||?->?->?-><-N6-MTase*<-?<-DpnII<-?<-DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 G397_RS0107800 249 bacteria>firmicutes [Eubacterium] siraeum DNA methylase N-4 [[Eubacterium] siraeum]. <-491494962_ABC-ATPase<-491494960_?<-491494958_Pribosyltran<-491494955_?||518492361_?->518492362_?->491499780_?-><-491499778_N6-MTase*<-491499771_?<-491499769_DpnII<-491499767_?<-491499766_DpnM-N6-MTase<-769258050_?||491499757_?->491499748_?-> 505332319 - N6-MTase N6_N4_Mtase+Methyltransf_26 249 bacteria>firmicutes [Eubacterium] siraeum DNA methylase [[Eubacterium] siraeum]. 515155565 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF1583_RS06235 249 bacteria>actinobacteria Gardnerella vaginalis hypothetical protein [Gardnerella vaginalis]. <-696236014_?<-515155563_?<-515155565_N6-MTase*<-515155567_DpnII<-490232944_DpnM-N6-MTase<-515155570_?||696235466_?->515155571_?-><-515155573_?<-515155575_? 515278394 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF1580_RS03105 249 bacteria>actinobacteria Gardnerella vaginalis hypothetical protein [Gardnerella vaginalis]. 515278393_?->490232944_DpnM-N6-MTase->490232945_DpnII->515278394_N6-MTase*->490232948_?->515278396_?-> 547820585 <-N6-MTase*<-?<-DpnII<-Mrr_cat<-HTH+DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 BN647_00380 249 bacteria>firmicutes Firmicutes bacterium CAG:41 DNA methylase [Firmicutes bacterium CAG:41]. <-547820578_?<-547820579_?<-547820580_?||547820581_?->547820582_?->547820583_?-><-547820584_?<-547820585_N6-MTase*<-547820586_?<-547820587_DpnII<-547820588_Mrr_cat<-547820589_HTH+DpnM-N6-MTase<-547820590_?<-547820591_?<-547820592_? 548315511 <-N6-MTase*<-DpnII<-HTH+DpnM-N6-MTase N6-MTase Methyltransf_26 BN720_00766 249 bacteria>firmicutes Eubacterium sp. CAG:581 DNA methylase [Eubacterium sp. CAG:581]. 548315508_?->548315509_?-><-548315510_?<-548315511_N6-MTase*<-548315512_DpnII<-548315513_HTH+DpnM-N6-MTase 652339649 <-N6-MTase*||DpnM-N6-MTase->wHTH+REase-DpnII-> N6-MTase SP+Methyltransf_26 LEPGO_RS0100700 249 bacteria>fusobacteria Leptotrichia goodfellowii DNA methylase N-4 [Leptotrichia goodfellowii]. 652339642_?->652339643_?->652339644_?->652339645_?->652339646_?->652339647_?->738097603_?-><-652339649_N6-MTase*||652339650_DpnM-N6-MTase->652339651_wHTH+REase-DpnII->652339652_?-><-652339653_?<-652339654_?<-652339655_?<-652339656_? 746146226 <-METHYLASE<-?<-?<-?||?-><-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase<-?<-ABC-ATPase<-?<-ABC-ATPase N6-MTase Methyltransf_26 NZ47_RS07170 249 bacteria>firmicutes Anaerovibrio lipolyticus DNA methylase N-4 [Anaerovibrio lipolyticus]. <-746146211_METHYLASE<-746146240_?<-746146214_?<-746146217_?||653148288_?-><-746146221_?<-653148286_?<-746146226_N6-MTase*<-746146229_DpnII<-746146242_DpnM-N6-MTase<-746146231_?<-746146245_ABC-ATPase<-746146233_?<-746146235_ABC-ATPase<-746146249_? 406967845 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 ACD_28C00322G0004 248 bacteria uncultured bacterium hypothetical protein ACD_28C00322G0004 [uncultured bacterium]. 406967842_?->406967843_DpnM-N6-MTase->406967844_DpnII->406967845_N6-MTase*->406967846_?-> 488666942 <-N6-MTase*<-HNH<-?<-DpnII<-?<-HTH+DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF1090_RS26280 248 bacteria>firmicutes [Clostridium] clostridioforme hypothetical protein [[Clostridium] clostridioforme]. 488666936_?-><-488666938_?||740461489_?->740461506_?->740461507_?->740461509_?-><-740461490_?<-488666942_N6-MTase*<-740461510_HNH<-488666944_?<-488666945_DpnII<-488666946_?<-488666947_HTH+DpnM-N6-MTase<-488666948_?<-488666949_? 490963643 DpnM-N6-MTase->DpnII->N6-MTase*-><-?||ABC-ATPase-> N6-MTase Methyltransf_26 HMPREF9131_RS04375 248 bacteria>firmicutes Peptoniphilus MULTISPECIES: DNA methylase N-4 [Peptoniphilus]. 496704092_?->496703981_?->496704035_?->496704093_?-><-496704027_?||738855338_DpnM-N6-MTase->496703967_DpnII->490963643_N6-MTase*-><-490963751_?||496704011_ABC-ATPase->496704063_?->496704052_?->496703991_?-> 490965414 DpnM-N6-MTase->wHTH+REase-DpnII->N6-MTase*-> N6-MTase Methyltransf_26 HMPREF1630_RS01030 248 bacteria>firmicutes Anaerococcus lactolyticus DNA methylase N-4 [Anaerococcus lactolyticus]. 739466286_?->490965420_?->490965419_?->490965418_?->739466289_?->739466291_DpnM-N6-MTase->739466292_wHTH+REase-DpnII->490965414_N6-MTase*-> 492766054 <-N6-MTase*<-wHTH+REase-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 HMPREF9286_RS08365 248 bacteria>firmicutes Peptoniphilus harei DNA methylase N-4 [Peptoniphilus harei]. <-492766020_?<-492766045_?<-492765924_?<-492766041_?<-492766025_?<-750245934_?||492766021_?-><-492766054_N6-MTase*<-492766035_wHTH+REase-DpnII<-492766033_DpnM-N6-MTase<-492766005_?||492766011_?->492766051_?-><-492765968_?<-492766003_? 493205676 HTH+DpnM-N6-MTase->DpnII->?->N6-MTase*-> N6-MTase Methyltransf_26 SELSP_RS03130 248 bacteria>firmicutes Selenomonas sputigena DNA methylase N-4 [Selenomonas sputigena]. <-493205689_?||493205688_?->493205687_?->493205686_?->493205685_HTH+DpnM-N6-MTase->739508203_DpnII->493205681_?->493205676_N6-MTase*->503505981_?-><-503505982_?||493205670_?->493205663_?-><-493205364_?<-493205363_?<-493205362_? 494632851 <-N6-MTase*<-DpnII<-DpnM-N6-MTase<-Radical_SAM N6-MTase Methyltransf_26 HMPREF1039_RS02295 248 bacteria>firmicutes Megasphaera sp. UPII 199-6 DNA methylase N-4 [Megasphaera sp. UPII 199-6]. 494632843_?-><-494632817_?||494632828_?->494632849_?-><-494632855_?<-494632825_?<-494632827_?<-494632851_N6-MTase*<-494632821_DpnII<-494632835_DpnM-N6-MTase<-494632831_Radical_SAM<-494632848_?<-494632839_?<-494632818_?<-494632853_? 494634458 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF1040_RS02760 248 bacteria>firmicutes Megasphaera sp. UPII 135-E DNA methylase N-4 [Megasphaera sp. UPII 135-E]. 494634456_?->494634450_?-><-738304381_?||738304383_?-><-738304385_?||494634454_DpnM-N6-MTase->494634445_DpnII->494634458_N6-MTase*->494634451_?->738304386_?->494634457_?-><-494634453_?<-494634452_?||494634448_?-> 496777936 <-N6-MTase*<-DpnII<-DpnM-N6-MTase<-Radical_SAM N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF0889_RS01300 248 bacteria>firmicutes Megasphaera genomosp. type_1 DNA methylase N-4 [Megasphaera genomosp. type_1]. 494632843_?-><-496777933_?||494632828_?->496777934_?-><-496777935_?<-494632825_?<-494632827_?<-496777936_N6-MTase*<-496777937_DpnII<-496777938_DpnM-N6-MTase<-494632831_Radical_SAM<-494632848_?<-494632839_?<-496777939_?<-494632853_? 517953989 DpnM-N6-MTase->wHTH+REase-DpnII->N6-MTase*-> N6-MTase Methyltransf_26 HGPG_RS01070 248 bacteria>firmicutes Peptoniphilus grossensis DNA methylase N-4, partial [Peptoniphilus grossensis]. 517953982_?-><-517953983_?<-517953984_?<-517953985_?||517953986_?->517953987_DpnM-N6-MTase->517953988_wHTH+REase-DpnII->517953989_N6-MTase*-><-517953990_?||517953991_?->517953992_?->738841560_?->738841562_?->738841762_?->517953996_?-> 547265226 Radical_SAM-><-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 BN820_00869 248 bacteria>proteobacteria>alphaproteobacteria Acetobacter sp. CAG:977 sensor protein fixL [Acetobacter sp. CAG:977]. 547265219_?->547265220_?->547265221_?->547265222_?-><-547265223_?<-547265224_?||547265225_Radical_SAM-><-547265226_N6-MTase*<-547265227_DpnII<-547265228_DpnM-N6-MTase||547265229_?-><-547265230_?<-547265231_?||547265232_?-> 547865125 DpnM-N6-MTase->?->DpnII->?->N6-MTase*-><-?<-?<-?||?->Pribosyltran->?->ABC-ATPase-> N6-MTase Methyltransf_26 BN788_01674 248 bacteria>firmicutes Eubacterium siraeum CAG:80 DNA methylase [Eubacterium siraeum CAG:80]. <-547865118_?<-547865119_?||547865120_?->547865121_DpnM-N6-MTase->547865122_?->547865123_DpnII->547865124_?->547865125_N6-MTase*-><-547865126_?<-547865127_?<-547865128_?||491494955_?->547865129_Pribosyltran->547865130_?->505332324_ABC-ATPase-> 652371446 <-N6-MTase* N6-MTase UPF0020+N6_N4_Mtase G598_RS0113740 248 bacteria>firmicutes Selenomonas ruminantium DNA methylase N-4 [Selenomonas ruminantium]. 652371439_?-><-652371440_?<-652371441_?<-652371442_?<-739504782_?<-652371444_?<-652371445_?<-652371446_N6-MTase* 652933168 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 G497_RS0101670 248 bacteria>proteobacteria>deltaproteobacteria Desulfovibrio cuneatus DNA methylase N-4 [Desulfovibrio cuneatus]. 652933161_?->652933162_?->652933163_?-><-652933164_?<-652933165_?||652933166_?->652933167_?-><-652933168_N6-MTase*<-652933169_DpnII<-652933170_DpnM-N6-MTase<-652933171_?<-652933172_?<-652933173_?<-652933174_?<-652933175_? 654856343 <-N6-MTase*<-wHTH+REase-DpnII<-DpnM-N6-MTase N6-MTase UPF0020+N6_N4_Mtase K364_RS0114940 248 bacteria>firmicutes Desulfitibacter alkalitolerans DNA methylase N-4 [Desulfitibacter alkalitolerans]. <-654856335_?||654856336_?-><-654856337_?<-654856338_?<-737286368_?<-654856340_?<-654856341_?<-654856343_N6-MTase*<-737286550_wHTH+REase-DpnII<-654856345_DpnM-N6-MTase<-654856346_?<-654856347_?<-654856348_?<-654856349_?<-654856350_? 737952516 HTH+DpnM-N6-MTase->DpnII->N6-MTase*-><-?<-?<-?<-?||N6-MTase-> N6-MTase Methyltransf_26 FUSO3_RS09465 248 bacteria>fusobacteria Fusobacterium necrophorum DNA methylase N-4 [Fusobacterium necrophorum]. 737971124_?->737952519_HTH+DpnM-N6-MTase->737952518_DpnII->737952516_N6-MTase*-><-737952514_?<-737952511_?<-737952508_?<-492764123_?||492765822_N6-MTase->492764117_?->737971127_?-> 754175633 <-N6-MTase* N6-MTase N6_N4_Mtase+Methyltransf_26 SMON_RS05265 248 bacteria>fusobacteria Streptobacillus moniliformis DNA methylase N-4 [Streptobacillus moniliformis]. <-502622237_?<-502622238_?||502622239_?-><-502622240_?<-502622241_?<-502622242_?<-502622243_?<-754175633_N6-MTase*<-754175102_?<-502622246_?<-502622247_?<-502622248_?<-502622249_?<-502622250_?<-502622251_? 406986924 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 ACD_18C00096G0009 247 bacteria uncultured bacterium hypothetical protein ACD_18C00096G0009 [uncultured bacterium]. <-406986917_?<-406986918_?||406986919_?->406986920_?->406986921_?->406986922_DpnM-N6-MTase->406986923_DpnII->406986924_N6-MTase*->406986925_?-> 492571686 <-N6-MTase*<-Mrr_cat<-DpnII N6-MTase UPF0020+N6_N4_Mtase FNV_RS04020 247 bacteria>fusobacteria Fusobacterium nucleatum DNA methylase N-4 [Fusobacterium nucleatum]. 740585784_?->492571673_?-><-492571675_?<-492571678_?<-492571680_?<-492571682_?<-740585788_?<-492571686_N6-MTase*<-492571690_Mrr_cat<-740585790_DpnII 492605690 <-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase SP+Methyltransf_26 HMPREF1501_RS03285 247 bacteria>fusobacteria Fusobacterium sp. OBRC1 DNA methylase N-4 [Fusobacterium sp. OBRC1]. 492596573_?-><-696257839_?<-696257842_?<-696257846_?<-696257853_?<-696257855_?<-696257856_?<-492605690_N6-MTase*<-696257867_DpnII<-696257868_DpnM-N6-MTase<-696257859_?<-492605693_?||696257861_?-> 492656844 <-N6-MTase*<-Mrr_cat<-DpnII<-DpnM-N6-MTase<-PHP+DNApolIIIa<-?<-?<-SNF N6-MTase UPF0020+N6_N4_Mtase HMPREF1497_RS08290 247 bacteria>fusobacteria Fusobacterium MULTISPECIES: DNA methylase N-4 [Fusobacterium]. 492652534_?-><-496079217_?<-552904728_?<-552904729_?<-552904730_?<-552904731_?<-696263596_?<-492656844_N6-MTase*<-492571690_Mrr_cat<-492656846_DpnII<-552904732_DpnM-N6-MTase<-492653383_PHP+DNApolIIIa<-552904733_?<-492653385_?<-492653388_SNF 495977000 SNF->?->PHP+DNApolIIIa->?->?->DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+UPF0020+N6_N4_Mtase FSDG_RS00930 247 bacteria>fusobacteria Fusobacterium nucleatum DNA methylase N-4 [Fusobacterium nucleatum]. 495977008_SNF->495977006_?->495977005_PHP+DNApolIIIa->511541204_?->495977003_?->495977002_DpnM-N6-MTase->492632699_DpnII->495977000_N6-MTase*->696266057_?->495976997_?->495976996_?->495976995_?->495976994_?->511541205_?-><-495976992_? 496069501 <-Ploop+REase||DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase SP+Methyltransf_26 FSAG_RS07575 247 bacteria>fusobacteria Fusobacterium periodonticum DNA methylase N-4 [Fusobacterium periodonticum]. 496069507_?->496069506_?->496069505_?->492793635_?-><-496069504_Ploop+REase||496069503_DpnM-N6-MTase->496069502_DpnII->496069501_N6-MTase*->737488074_?->496069499_?->496069498_?->496069497_?->492810497_?->496069496_?->496069495_?-> 496073207 <-N6-MTase*<-DpnII<-DpnM-N6-MTase<-PHP+DNApolIIIa<-?<-?<-SNF N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF0405_RS07885 247 bacteria>fusobacteria Fusobacterium nucleatum DNA methylase N-4 [Fusobacterium nucleatum]. 496073200_?-><-496073201_?<-496073202_?<-496073203_?<-496073204_?<-496073205_?<-740654570_?<-496073207_N6-MTase*<-496073208_DpnII<-496073209_DpnM-N6-MTase<-496073210_PHP+DNApolIIIa<-496073211_?<-644872653_?<-496073213_SNF<-492573569_? 496296625 - N6-MTase N6_N4_Mtase+UPF0020+N6_N4_Mtase 247 bacteria>fusobacteria Fusobacterium nucleatum DNA methylase N-4 [Fusobacterium nucleatum]. 496969638 Mrr_cat->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 HMPREF9093_RS05275 247 bacteria>fusobacteria Fusobacterium sp. oral taxon 370 DNA methylase N-4 [Fusobacterium sp. oral taxon 370]. <-496969577_?<-496969579_?<-737521001_?<-737521008_?<-496969609_?||737521003_?->496969633_Mrr_cat->496969638_N6-MTase*->496969641_?->496969644_?->496969660_?-><-496969666_?<-496969669_?<-496969672_?<-737521006_? 503262746 DpnM-N6-MTase->DpnII->?->N6-MTase*-> N6-MTase Methyltransf_26 Q388_RS0120175 247 bacteria>firmicutes Ruminococcus albus DNA methylase N-4 [Ruminococcus albus]. <-503262738_?<-503262739_?<-503262740_?||503262741_?->503262742_DpnM-N6-MTase->503262744_DpnII->739445280_?->503262746_N6-MTase*-><-739443706_? 506250654 <-N6-MTase*<-DpnM-N6-MTase N6-MTase Methyltransf_26 LEBU_RS11230 247 bacteria>fusobacteria Leptotrichia buccalis DNA methylase N-4 [Leptotrichia buccalis]. <-506250648_?<-506250649_?<-506250650_?<-506250651_?<-506250652_?<-506250653_?<-493858211_?<-506250654_N6-MTase*<-754128986_DpnM-N6-MTase<-506250656_?<-506250657_?<-506250658_?<-506250659_?<-506250660_?<-754128988_? 547450181 <-N6-MTase*<-DpnII<-DpnII<-DpnM-N6-MTase<-PHP+DNApolIIIa<-?<-?<-SNF N6-MTase N6_N4_Mtase+Methyltransf_26 BN748_01131 247 bacteria>fusobacteria Fusobacterium sp. CAG:649 sensor protein fixL [Fusobacterium sp. CAG:649]. 547450176_?->547450177_?->547450178_?->492574489_?->492574493_?->547450179_?-><-547450180_?<-547450181_N6-MTase*<-547450182_DpnII<-547450183_DpnII<-547450184_DpnM-N6-MTase<-547450185_PHP+DNApolIIIa<-492676764_?<-547450186_?<-547450187_SNF 655060651 <-MutH<-?<-N6-MTase*<-?<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 CD05_RS0101160 247 bacteria>firmicutes Ruminococcus sp. NK3A76 DNA methylase N-4 [Ruminococcus sp. NK3A76]. 655060645_?->739463238_?->739463240_?->655060647_?->655060648_?-><-655060649_MutH<-655060650_?<-655060651_N6-MTase*<-739463242_?<-655060653_DpnII<-655060654_DpnM-N6-MTase<-655060655_?||655060656_?->655060657_?->655060658_?-> 657692329 DpnM-N6-MTase->?->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_11 J145_RS0109710 247 bacteria>fusobacteria Fusobacterium hwasookii DNA methylase N-4 [Fusobacterium hwasookii]. 657692322_?->657692323_?->657692324_?->657692325_?->657692326_DpnM-N6-MTase->657692327_?->657692328_DpnII->657692329_N6-MTase*->696306406_?->657692331_?->657692332_?->492676742_?->657692333_?->657692334_?-><-492676735_? 657695114 DpnM-N6-MTase->?->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 J142_RS0109970 247 bacteria>fusobacteria Fusobacterium hwasookii DNA methylase N-4 [Fusobacterium hwasookii]. 657692322_?->657695109_?->657695111_?->657695112_?->657692326_DpnM-N6-MTase->657692327_?->657695113_DpnII->657695114_N6-MTase*->696306406_?->657692331_?->657695115_?->492676742_?->657695116_?->657695117_?-><-657695118_? 657696170 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_11 J144_RS0109740 247 bacteria>fusobacteria Fusobacterium hwasookii DNA methylase N-4 [Fusobacterium hwasookii]. 657692321_?->657692322_?->657692323_?->657692324_?->657692325_?->657692326_DpnM-N6-MTase->657696169_DpnII->657696170_N6-MTase*->696311518_?->657696172_?-> 697204360 <-N6-MTase*<-?<-?<-HTH+DpnM-N6-MTase N6-MTase Methyltransf_26 T504_RS0104020 247 bacteria>firmicutes Selenomonas sp. ND2010 DNA methylase N-4 [Selenomonas sp. ND2010]. <-697204304_?<-697204305_?||697204359_?-><-697204306_?<-697204307_?||697204308_?->697204309_?-><-697204360_N6-MTase*<-697204310_?<-697204311_?<-697204361_HTH+DpnM-N6-MTase<-697204312_?<-697204313_?||697204314_?-><-697204315_? 754560689 ABC-ATPase->?->?->?-><-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase N6_N4_Mtase+Methyltransf_26 NW74_RS05595 247 bacteria>firmicutes Parvimonas micra DNA methylase N-4 [Parvimonas micra]. <-754560678_?<-754560680_?||754560681_ABC-ATPase->754560683_?->754560684_?->754560686_?-><-754560688_?<-754560689_N6-MTase*<-754560690_DpnII<-754560692_DpnM-N6-MTase<-754560693_?<-754560694_?<-754560696_?||754560698_?->754561342_?-> 736835888 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase Methyltransf_26 MD85_RS04325 246 bacteria>firmicutes [Clostridium] cellulosi DNA methylase N-4 [[Clostridium] cellulosi]. <-736835869_?<-736835872_?||736835875_?->736835877_?->736835880_?->736835882_DpnM-N6-MTase->736835885_DpnII->736835888_N6-MTase*->736835890_?->736836198_?->736835893_?-><-736835895_?||736835898_?->736835900_?->736835903_?-> 749954601 - N6-MTase N6_N4_Mtase+N6_N4_Mtase 246 bacteria>nitrospirae Candidatus Magnetobacterium casensis DNA methylase N-4, partial [Candidatus Magnetobacterium casensis]. 762897706 <-N6-MTase*<-DpnII N6-MTase Methyltransf_26 MSU_RS04145 246 bacteria>tenericutes Mycoplasma suis DNA methylase N-4 [Mycoplasma suis]. <-503375509_?<-503375510_?<-503375511_?||762897703_?-><-503375512_?<-503375513_?<-503374807_?<-762897706_N6-MTase*<-762897832_DpnII<-503375516_?<-503375517_?<-762897716_?||503375520_?->503374813_?-><-503375521_? 323652279 <-N6-MTase*<-DpnII N6-MTase Methyltransf_26 MSU_0848 245 bacteria>tenericutes Mycoplasma suis str. Illinois DNA methylase N-4/N-6 domain-containing protein [Mycoplasma suis str. Illinois]. <-323652272_?<-323652273_?<-323652274_?<-323652275_?<-323652276_?<-323652277_?<-323652278_?<-323652279_N6-MTase*<-323652280_DpnII<-323652281_?<-323652282_?<-323652283_?<-323652284_?||323652285_?->323652286_?-> 503504517 HTH+DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 SPICO_RS02800 245 bacteria>spirochaetes Sphaerochaeta coccoides DNA methylase N-4 [Sphaerochaeta coccoides]. 503504509_?-><-503504510_?||752748369_?->503504512_?->503504513_?->503504515_HTH+DpnM-N6-MTase->503504516_DpnII->503504517_N6-MTase*->752747888_?->503504518_?-><-503504519_?||752748371_?->752748373_?->752747890_?->752748375_?-> 547523036 DpnM-N6-MTase->DpnII->N6-MTase*-> N6-MTase N6_N4_Mtase+Methyltransf_26 BN678_01434 245 bacteria>firmicutes Dialister sp. CAG:486 putative RNA methylase, partial [Dialister sp. CAG:486]. <-547523009_?<-547523013_?<-547523017_?||547523021_?->547523025_?->547523028_DpnM-N6-MTase->547523032_DpnII->547523036_N6-MTase*->547523040_?->547523043_?-><-547523047_?<-547523051_?||547523055_?->547523059_?->547523063_?-> 737327616 <-DpnII<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*<-?<-?<-?<-?<-PLD+SFII-helicase N6-MTase Methyltransf_26 DP68_RS13195 245 bacteria>firmicutes Clostridium sp. HMP27 DNA methylase N-4 [Clostridium sp. HMP27]. <-737327604_?<-737327606_?||737327608_?-><-737327610_?<-737327613_DpnII<-737327716_DpnM-N6-MTase<-737327615_N6-MTase<-737327616_N6-MTase*<-737327617_?<-737327619_?<-737327621_?<-737327623_?<-737327625_PLD+SFII-helicase||737327628_?-><-737327630_? 496076017 SNF->?->PHP+DNApolIIIa->DpnM-N6-MTase->DpnII->Mrr_cat->N6-MTase*-> N6-MTase Methyltransf_26+N6_N4_Mtase HMPREF0946_RS02930 244 bacteria>fusobacteria Fusobacterium nucleatum DNA methylase N-4 [Fusobacterium nucleatum]. 496076024_?->496076023_SNF->496076022_?->496076021_PHP+DNApolIIIa->496076020_DpnM-N6-MTase->496076019_DpnII->496076018_Mrr_cat->496076017_N6-MTase*->696308233_?->496076014_?->496076013_?->496076012_?->496076011_?->496076010_?-><-492652534_? 503781935 N6-MTase*->DpnM-N6-MTase->N6-MTase->DpnII->PHP+DNApolIIIa-> N6-MTase UPF0020+N6_N4_Mtase MELS_RS05115 244 bacteria>firmicutes Megasphaera elsdenii DNA methylase N-4 [Megasphaera elsdenii]. 503781928_?->503781929_?->503781930_?-><-503781931_?||753929833_?->503781933_?-><-503781934_?||503781935_N6-MTase*->503781936_DpnM-N6-MTase->503781937_N6-MTase->503781938_DpnII->503781939_PHP+DNApolIIIa-><-503781940_?||503781941_?->503781942_?-> 548306764 <-PHP+DNApolIIIa||?-><-DpnII<-N6-MTase<-DpnM-N6-MTase<-N6-MTase* N6-MTase UPF0020 BN715_00862 244 bacteria>firmicutes Megasphaera elsdenii CAG:570 putative RNA methylase [Megasphaera elsdenii CAG:570]. <-548306758_?||503781940_?-><-548306759_PHP+DNApolIIIa||548306760_?-><-548306761_DpnII<-548306762_N6-MTase<-548306763_DpnM-N6-MTase<-548306764_N6-MTase*||548306765_?-><-548306766_?<-548306767_?<-548306768_?<-548306769_?<-548306770_?<-548306771_? 657829373 <-N6-MTase* N6-MTase N6_N4_Mtase+Methyltransf_26 P159_RS0116605 244 bacteria>firmicutes Selenomonas ruminantium DNA methylase N-4 [Selenomonas ruminantium]. <-657829360_?<-657829361_?<-739517657_?||657829365_?->739517658_?-><-657829369_?<-739517500_?<-657829373_N6-MTase*<-657829375_?<-657829377_?<-657829379_?<-657829381_?<-657829383_?<-657829385_?<-657829387_? 493484190 N6-MTase*->DpnM-N6-MTase->N6-MTase->DpnII->METHYLASE-> N6-MTase Methyltransf_26 CLOHIR_RS00470 243 bacteria>firmicutes [Clostridium] hiranonis DNA methylase N-4 [[Clostridium] hiranonis]. <-493484183_?||493484184_?->750105496_?->493484186_?->493484187_?->493484188_?->750105505_?->493484190_N6-MTase*->493484191_DpnM-N6-MTase->493484192_N6-MTase->493484193_DpnII->750105506_METHYLASE->750105497_?->493484196_?-><-493484197_? 738699070 MACRODOMAIN->?->?->?->?->N6-MTase*->N6-MTase->DpnII->Primase+SNF+PLD-> N6-MTase N6_N4_Mtase+N6_N4_Mtase HMPREF9124_RS05925 243 bacteria>firmicutes Oribacterium sp. oral taxon 108 DNA methylase N-4 [Oribacterium sp. oral taxon 108]. 738698697_?->738698699_?->496987335_MACRODOMAIN->496986404_?->496988548_?->738698704_?->738698705_?->738699070_N6-MTase*->496990603_N6-MTase->496992270_DpnII->496992383_Primase+SNF+PLD->496987124_?->496987391_?->496992206_?-><-496987099_? 739513201 <-MutH<-?||N6-MTase*->DpnM-N6-MTase->N6-MTase->DpnII-> N6-MTase UPF0020+N6_N4_Mtase K292_RS0108670 240 bacteria>firmicutes Anaerovorax odorimutans DNA methylase N-4 [Anaerovorax odorimutans]. 739513208_?->653150344_?->653150345_?-><-653150346_?||653150347_?-><-653150348_MutH<-653150349_?||739513201_N6-MTase*->653150351_DpnM-N6-MTase->653150352_N6-MTase->653150353_DpnII->739513202_?->739513209_?->653150354_?->739513203_?-> 754097257 <-PLD+SFII-helicase<-?<-?<-N6-MTase*<-DpnII<-DpnM-N6-MTase N6-MTase Methyltransf_26 COPRO5265_RS06650 240 bacteria>firmicutes Coprothermobacter proteolyticus DNA methylase N-4 [Coprothermobacter proteolyticus]. 754096854_?-><-501537781_?<-754097253_?<-501537729_?<-501538863_PLD+SFII-helicase<-754097255_?<-501538944_?<-754097257_N6-MTase*<-501538765_DpnII<-501538213_DpnM-N6-MTase<-501538948_?<-754096859_?||501539125_?-><-501538866_?<-501537772_? 489540792 <-DpnII<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*<-?<-?<-?<-?<-PLD+SFII-helicase N6-MTase N6_N4_Mtase+N6_N4_Mtase CLPA_RS15010 238 bacteria>firmicutes Clostridium pasteurianum DNA methylase N-4/N-6 domain-containing protein [Clostridium pasteurianum]. <-489540779_?<-489540781_?||489540782_?-><-736828614_?<-489540787_DpnII<-736828612_DpnM-N6-MTase<-489540790_N6-MTase<-489540792_N6-MTase*<-489540794_?<-489540795_?<-489540797_?<-489540799_?<-489540801_PLD+SFII-helicase<-736828393_?<-489540806_? 495813929 <-DpnII<-DpnM-N6-MTase<-N6-MTase* N6-MTase N6_N4_Mtase+N6_N4_Mtase HMPREF9454_RS05820 238 bacteria>firmicutes Megamonas funiformis DNA methylase N-4 [Megamonas funiformis]. <-748623258_?<-495813923_?<-495813924_?<-495813925_?<-495813926_?<-495813927_DpnII<-495813928_DpnM-N6-MTase<-495813929_N6-MTase*<-495813930_?<-495813931_?<-495813932_?<-748623259_?<-495813934_?<-495813935_?<-495813936_? 496091935 <-wHTH+REase-DpnII<-DpnM-N6-MTase||N6-MTase*-> N6-MTase Methyltransf_26 QSI_RS08630 238 bacteria>firmicutes Clostridiales MULTISPECIES: DNA methylase N-4 [Clostridiales]. <-738856073_?<-496091933_wHTH+REase-DpnII<-497274454_DpnM-N6-MTase||496091935_N6-MTase*->496091936_?->738856076_?->496091938_?->488677428_?->488677427_?->545045779_?-><-497274460_? 505207017 HNH->?->?->?->?->?->N6-MTase*->N6-MTase->DpnM-N6-MTase->DpnII-> N6-MTase N6_N4_Mtase+N6_N4_Mtase H122_RS0107615 238 bacteria>firmicutes Clostridium saccharoperbutylacetonicum DNA methylase, partial [Clostridium saccharoperbutylacetonicum]. 516421054_?->505207023_HNH->505207022_?->505207021_?->505207020_?->505207019_?->505207018_?->505207017_N6-MTase*->505207016_N6-MTase->505207015_DpnM-N6-MTase->505207014_DpnII->505207013_?->505207012_?->505207011_?->505207010_?-> 547961389 <-DpnII<-N6-MTase<-DpnM-N6-MTase<-N6-MTase* N6-MTase N6_N4_Mtase+N6_N4_Mtase BN656_01315 238 bacteria>bacteroidetes Bacteroides pectinophilus CAG:437 putative DNA (Cytosine-5-)-methyltransferase [Bacteroides pectinophilus CAG:437]. <-547961385_?<-547961386_DpnII<-547961387_N6-MTase<-547961388_DpnM-N6-MTase<-547961389_N6-MTase*||547961390_?->547961391_?-> 648605175 N6-MTase*->DpnM-N6-MTase->DpnII-> N6-MTase N6_N4_Mtase+N6_N4_Mtase F553_RS0104430 238 bacteria>firmicutes Megamonas rupellensis DNA methylase N-4 [Megamonas rupellensis]. <-517828949_?||517828950_?->517828951_?->648605173_?->495813932_?->517828953_?->517828954_?->648605175_N6-MTase*->517828956_DpnM-N6-MTase->517828957_DpnII->517828958_?->495813924_?->517828959_?->517828960_?->517828961_?-> 657680312 <-N6-MTase*||DpnM-N6-MTase-> N6-MTase N6_N4_Mtase+N6_N4_Mtase VE20218_RS15590 238 bacteria>firmicutes Clostridiales bacterium VE202-18 DNA methylase N-4 [Clostridiales bacterium VE202-18]. <-657680300_?||657680302_?->657680304_?->736546666_?->657680307_?->657680309_?->657680311_?-><-657680312_N6-MTase*||657680314_DpnM-N6-MTase->657680316_?->657680318_?->657680320_?->657680322_?->657680324_?->489634871_?-> 736866276 <-DpnII<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*<-?<-?<-HNH N6-MTase N6_N4_Mtase+N6_N4_Mtase G594_RS0107390 238 bacteria>firmicutes Clostridium paraputrificum DNA methylase N-4 [Clostridium paraputrificum]. 652800875_?-><-652800876_?<-652800877_?<-736866318_?<-652800878_DpnII<-652800879_DpnM-N6-MTase<-652800880_N6-MTase<-736866276_N6-MTase*<-652800882_?<-652800883_?<-652800884_HNH<-652800885_?<-652800886_?<-736866322_?<-652800888_? 737163966 <-MutH||N6-MTase*->N6-MTase->DpnM-N6-MTase-> N6-MTase Methyltransf_26 CTM_RS16890 238 bacteria>firmicutes Clostridium tetanomorphum DNA methylase N-4 [Clostridium tetanomorphum]. 737164062_?->737164063_?->737163956_?->737163958_?->737163960_?->737163962_?-><-737163964_MutH||737163966_N6-MTase*->737163967_N6-MTase->737164065_DpnM-N6-MTase->737163969_?->737163971_?->737163974_?->737163975_?->737163977_?-> 737306053 PLD+SFII-helicase->?->?->McrC-NTD->METHYLASE-><-?||N6-MTase*->N6-MTase->DpnM-N6-MTase->DpnII->Mrr_cat-> N6-MTase UPF0020+N6_N4_Mtase CC89_RS03170 238 bacteria>firmicutes Clostridium sp. KNHs214 DNA methylase N-4 [Clostridium sp. KNHs214]. 737308255_?->737306041_PLD+SFII-helicase->737306043_?->737306045_?->737306048_McrC-NTD->737308257_METHYLASE-><-737306051_?||737306053_N6-MTase*->737306055_N6-MTase->737306058_DpnM-N6-MTase->737306060_DpnII->737306062_Mrr_cat->737306064_?->737306066_?->737306068_?-> 737398024 <-SNF<-?<-DpnM-N6-MTase<-N6-MTase<-N6-MTase*||DpnII-><-ABC-ATPase<-ABC-ATPase N6-MTase Methyltransf_26 Q428_RS06840 238 bacteria>firmicutes Fervidicella metallireducens DNA methylase N-4 [Fervidicella metallireducens]. <-737398017_?<-737398043_?||737398019_?-><-737398045_SNF<-737398047_?<-737398021_DpnM-N6-MTase<-737398022_N6-MTase<-737398024_N6-MTase*||737398026_DpnII-><-737398049_ABC-ATPase<-737398027_ABC-ATPase<-737398029_?<-737398031_?||737398033_?-><-737398035_? 746722773 N6-MTase*->DpnII->N6-MTase->?->?->?-><-?||PLD+SFII-helicase-> N6-MTase Methyltransf_26 QX51_RS12035 234 bacteria>firmicutes Terrisporobacter othiniensis DNA methylase N-4 [Terrisporobacter othiniensis]. 746722766_?->746722767_?->746722768_?->746722769_?->746722770_?->746722771_?->746722772_?->746722773_N6-MTase*->746722774_DpnII->746722865_N6-MTase->746722775_?->746722776_?->746722777_?-><-746722778_?||746722866_PLD+SFII-helicase-> # 1; 406873648 DpnM-N6-MTase->wHTH+REase-DpnII->N6-MTase*-> N6-MTase Methyltransf_11+Peptidase_S24 ACD_81C00186G0010 395 bacteria uncultured bacterium Sensor protein fixL [uncultured bacterium]. 406873641_?->406873642_?-><-406873643_?||406873644_?->406873645_?->406873646_DpnM-N6-MTase->406873647_wHTH+REase-DpnII->406873648_N6-MTase*->406873649_?->406873650_?->406873651_?-><-406873652_?<-406873653_?||406873654_?->Back to Contents
General notes |
General notesThe Group I/Clade 5/Ot12g00270 group of methylases are found in Dinoflagellates, chlorophytes and Emiliania. The Symbiodinium version is fused to PPR repeats. Perhaps there is a common plastid function across these. This group and its prokaryotic homologs are distinguished by a H before the Rossmann methylase, D in Str-1*, D after strand-2*, S at the end of Strand-3, T before the characteristic SPPY motif in strand-4 and E and R flanking strand-7. They also seem to have a large insert between Strands-2 and 3 and Strands 3 and 4. The SPPY motif suffests that they are N4 methylases. Operons suggest that they are derived from R-M systems. |
GI Gene neigborhoods Archs Pfam architectures Gene name Len Taxonomy Species Genbank #; Eukaryotic versions 551572077 <-FHA+N6-MTase* FHA+N6-MTase FAP+Methyltransf_26 EMIHUDRAFT_464003 688 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_464003 [Emiliania huxleyi CCMP1516]. 551572063_?-><-551572065_?||551572067_?-><-551572069_?||551572071_?-><-551572073_?||551572075_?-><-551572077_FHA+N6-MTase*||551572079_?-><-551572081_?||551572083_?->551572085_?-><-551572087_?<-551572089_?<-551572091_? 551554922 FHA+N6-MTase*-> FHA+N6-MTase Methyltransf_26 EMIHUDRAFT_459692 604 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_459692 [Emiliania huxleyi CCMP1516]. 551554918_?-><-551554920_?||551554922_FHA+N6-MTase*-><-551554924_?||551554926_?-><-551554928_?||551554930_?->551554932_?-><-551554934_?<-551554936_? 551585076 FHA+N6-MTase*-> FHA+N6-MTase FHA+Methyltransf_26 EMIHUDRAFT_115516 683 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_115516 [Emiliania huxleyi CCMP1516]. 551585076_FHA+N6-MTase*-><-551585078_?||551585080_?-><-551585082_?||551585084_?-><-551585086_?||551585088_?-><-551585090_? 551629083 FHA+N6-MTase*-> FHA+N6-MTase FHA+Methyltransf_26 EMIHUDRAFT_95251 658 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_95251 [Emiliania huxleyi CCMP1516]. 551629071_?-><-551629073_?<-551629075_?||551629077_?-><-551629263_?||551629079_?-><-551629081_?||551629083_FHA+N6-MTase*->551629085_?-><-551629087_?<-551629089_?||551629091_?-><-551629093_?||551629095_?-><-551629097_? 551572199 <-N6-MTase*<-?||?-><-?||?-><-?||N6-MTase-> N6-MTase SP EMIHUDRAFT_209088 136 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_209088 [Emiliania huxleyi CCMP1516]. <-551572185_?<-551572187_?<-551572189_?<-551572191_?||551572193_?-><-551572195_?||551572197_?-><-551572199_N6-MTase*<-551572201_?||551572203_?-><-551572205_?||551572207_?-><-551572209_?||551572211_N6-MTase->551572213_?-> 551605992 N6-MTase*-> N6-MTase DUF3597+Methyltransf_26 EMIHUDRAFT_231186 572 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_231186 [Emiliania huxleyi CCMP1516]. 551605978_?->551605980_?-><-551605982_?||551605984_?->551605986_?->551605988_?-><-551605990_?||551605992_N6-MTase*-><-551605994_?<-551605996_?<-551605998_?||551606000_?-><-551606002_?||551606004_?-><-551606006_? # 73; Prokaryotic homologs 509139481 DnaJ->?->?->?->?->?->ParB+N6-MTase*->N6-MTase->?->?->?->?->?->N6-MTase-> ParB+N6-MTase ParBc+Methyltransf_26 M201_gp11 526 viruses>dsdna viruses, no rna stage Halovirus HCTV-2 hypothetical protein HCTV2_11 [Halovirus HCTV-2]. 509139476_?->509139477_DnaJ->509139478_?->509139479_?->509139480_?->509139548_?->509139549_?->509139481_ParB+N6-MTase*->509139482_N6-MTase->509139483_?->509139484_?->509139485_?->509139550_?->509139486_?->509139487_N6-MTase-> 505358723 <-MuF+ART<-?<-Phage_portal<-Terminase_LS<-HTH<-ParB+N6-MTase*<-?<-VRRNUC ParB+N6-MTase Methyltransf_26 HGGM_RS12685 523 bacteria>bacteroidetes Alistipes shahii hypothetical protein [Alistipes shahii]. 505358716_?-><-505358717_?<-505358718_MuF+ART<-505358719_?<-505358720_Phage_portal<-648293960_Terminase_LS<-505358722_HTH<-505358723_ParB+N6-MTase*<-505358724_?<-505358725_VRRNUC<-505358726_?<-736777341_?<-505358727_?<-505358728_?<-505358729_? 740824887 VRRNUC->ParB+N6-MTase*->HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase Methyltransf_26 EL88_RS12025 518 bacteria>bacteroidetes Bacteroides dorei hypothetical protein [Bacteroides dorei]. 740827101_?->740827104_?->740824875_?->740824878_?->740824881_?->740824884_?->740827107_VRRNUC->740824887_ParB+N6-MTase*->740827110_HTH->740827113_Terminase_LS->740824891_Phage_portal->740827116_MuF-><-740824894_?<-740824899_?<-740824902_? 407249874 <-Antirestrict<-?<-?<-MuF<-?<-Terminase_LS<-?<-ParB+N6-MTase*<-?<-?<-?<-?<-VRRNUC ParB+N6-MTase SP+ParBc+Methyltransf_26 AMBLS11_12430 517 bacteria>proteobacteria>gammaproteobacteria Alteromonas macleodii str. 'Black Sea 11' hypothetical protein AMBLS11_12430 [Alteromonas macleodii str. 'Black Sea 11']. <-407249867_Antirestrict<-407249868_?<-407249869_?<-407249870_MuF<-407249871_?<-407249872_Terminase_LS<-407249873_?<-407249874_ParB+N6-MTase*<-407249875_?<-407249876_?<-407249877_?<-407249878_?<-407249879_VRRNUC<-407249880_?<-407249881_? 495813435 <-MuF<-Phage_portal<-Terminase_LS<-?<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 HMPREF9454_RS03645 481 bacteria>firmicutes Megamonas funiformis ParB-like partition protein [Megamonas funiformis]. <-495813426_?<-748623224_?<-748623225_MuF<-495813429_Phage_portal<-495813430_Terminase_LS<-495813431_?<-495813433_?<-495813435_ParB+N6-MTase*<-495813436_?<-748623226_?<-495813439_?<-495813441_?<-495813442_?<-495813443_?<-495813444_? 655611541 <-AHH||?->ParB+N6-MTase*->?->Terminase_LS-> ParB+N6-MTase ParBc+Methyltransf_26 EM93_14610 478 bacteria>proteobacteria>gammaproteobacteria Vibrio parahaemolyticus hypothetical protein EM93_14610 [Vibrio parahaemolyticus]. 655611534_?->655611535_?-><-655611536_?<-655611537_?<-655611538_?<-655611539_AHH||655611540_?->655611541_ParB+N6-MTase*->655611542_?->655611543_Terminase_LS->655611544_?->655611545_?->655611546_?->655611547_?->655611548_?-> 502738489 ParB+N6-MTase*-> ParB+N6-MTase ParBc+Methyltransf_26 AZL_RS04495 469 bacteria>proteobacteria>alphaproteobacteria Azospirillum lipoferum hypothetical protein [Azospirillum lipoferum]. 755092447_?->755092445_?->755092443_?->755091482_?->755091480_?->755091478_?->502738490_?->502738489_ParB+N6-MTase*->502738488_?->502738487_?-><-502738486_?<-755091476_?<-755092441_?||755091475_?->755091474_?-> 665395990 ParB+N6-MTase*->?->URI1->HTH->Terminase_LS->?->?->Phage_portal-> ParB+N6-MTase Methyltransf_26 JCM15093_3233 467 bacteria>bacteroidetes Bacteroides graminisolvens DSM 19988 = JCM 15093 sensor protein FixL [Bacteroides graminisolvens DSM 19988 = JCM 15093]. 665395983_?->665395984_?->665395985_?->665395986_?-><-665395987_?<-665395988_?<-665395989_?||665395990_ParB+N6-MTase*->665395991_?->665395992_URI1->665395993_HTH->665395994_Terminase_LS->665395995_?->665395996_?->665395997_Phage_portal-> 646357048 <-AHH||?->?->ParB+N6-MTase*->?->?->Terminase_LS-> ParB+N6-MTase SP+ParBc+Methyltransf_26 EM98_RS02575 465 bacteria>proteobacteria>gammaproteobacteria Vibrio parahaemolyticus hypothetical protein [Vibrio parahaemolyticus]. 639555748_?-><-491625684_?<-639577452_?<-545083456_?<-646357052_AHH||491637856_?->686139500_?->646357048_ParB+N6-MTase*->757509302_?->686140497_?->686140496_Terminase_LS->545125087_?->545125048_?->639577449_?->686140495_?-> 686232662 <-Terminase_LS<-ParB+N6-MTase*<-?<-?||AHH-> ParB+N6-MTase SP+ParBc+Methyltransf_26 EM79_05715 465 bacteria>proteobacteria>gammaproteobacteria Vibrio parahaemolyticus hypothetical protein [Vibrio parahaemolyticus]. <-545084109_?<-686232657_?<-686232658_?<-686232659_?<-545125048_?<-686232660_?<-686232661_Terminase_LS<-686232662_ParB+N6-MTase*<-686232663_?<-491637856_?||646357052_AHH->686232664_?->646368128_?->491625684_?-><-639555748_? 755016279 N6-MTase->?->?->?->ParB+N6-MTase*->?->HTH->Terminase_LS->Phage_portal-> ParB+N6-MTase Methyltransf_26 TY03_RS09575 463 bacteria>bacteroidetes Bacteroidaceae bacterium MS4 hypothetical protein [Bacteroidaceae bacterium MS4]. 755016269_?->755016271_?->755016274_?->755016853_N6-MTase->755016855_?->755016275_?->755016277_?->755016279_ParB+N6-MTase*->755016281_?->755016857_HTH->755016284_Terminase_LS->755016859_Phage_portal->755016286_?->755016288_?->755016289_?-> 771670996 ParB+N6-MTase*-><-?||HTH->Terminase_LS->Phage_portal-> ParB+N6-MTase ParBc+Methyltransf_26 UB46_RS23670 461 bacteria>proteobacteria>betaproteobacteria Burkholderiaceae bacterium 16 chromosome partitioning protein ParB [Burkholderiaceae bacterium 16]. 771670978_?-><-771670981_?||771670984_?->771670986_?->771670990_?->771670992_?->771670994_?->771670996_ParB+N6-MTase*-><-771670998_?||771671000_HTH->771671002_Terminase_LS->771671004_Phage_portal->771671007_?->771671010_?->771671012_?-> 771680244 <-MuF<-Terminase_LS<-HTH<-ParB+N6-MTase*||?->?-><-?<-?||?-><-DCM ParB+N6-MTase ParBc+Methyltransf_26 UB46_RS43165 461 bacteria>proteobacteria>betaproteobacteria Burkholderiaceae bacterium 16 chromosome partitioning protein ParB [Burkholderiaceae bacterium 16]. <-771680264_MuF<-771680266_Terminase_LS<-771680268_HTH<-771680244_ParB+N6-MTase*||771680247_?->771680249_?-><-771680252_?<-771680255_?||771680257_?-><-771680259_DCM<-771680261_? 429146233 <-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 HMPREF9998_01728 460 bacteria>firmicutes Peptostreptococcus anaerobius VPI 4330 = DSM 2949 ParB-like protein [Peptostreptococcus anaerobius VPI 4330 = DSM 2949]. <-429146226_?<-429146227_MuF<-429146228_Phage_portal<-429146229_Terminase_LS<-429146230_HTH<-429146231_?<-429146232_?<-429146233_ParB+N6-MTase*<-429146234_?<-429146235_?<-429146236_?<-429146237_?<-429146238_?<-429146239_?<-429146240_? 759627761 HNH->?->HNH->ParB+N6-MTase*->HTH->Terminase_LS->?->?->P22_CoatProtein-> ParB+N6-MTase ParBc+Methyltransf_26 RR42_RS08225 460 bacteria>proteobacteria>betaproteobacteria Cupriavidus basilensis chromosome partitioning protein ParB [Cupriavidus basilensis]. <-759627756_?||759627757_?->759627758_?->759627759_?->759633912_HNH->759627760_?->759633914_HNH->759627761_ParB+N6-MTase*->759633917_HTH->759627762_Terminase_LS->759627763_?->759633919_?->759627764_P22_CoatProtein->759627765_?->759627768_?-> 499617803 ParB+N6-MTase*->HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 REUT_RS12130 459 bacteria>proteobacteria>betaproteobacteria Cupriavidus pinatubonensis hypothetical protein [Cupriavidus pinatubonensis]. <-499617796_?<-754010951_?||499617798_?->499617799_?->754010954_?->499617801_?->499617802_?->499617803_ParB+N6-MTase*->499617804_HTH->499617805_Terminase_LS->499617806_Phage_portal->499617807_MuF->499617808_?->499617809_?->499617810_?-> 498505877 <-UPF0150+RHH_1<-?<-?||ParB+N6-MTase*->?->Terminase_LS->?->?->P22_CoatProtein-> ParB+N6-MTase ParBc+Methyltransf_26 C266_RS12110 457 bacteria>proteobacteria>betaproteobacteria Pandoraea sp. SD6-2 hypothetical protein [Pandoraea sp. SD6-2]. 498505870_?->498505871_?->738769516_?-><-498505873_?<-498505874_UPF0150+RHH_1<-498505875_?<-738769443_?||498505877_ParB+N6-MTase*->738769518_?->738769519_Terminase_LS->498505880_?->498505881_?->498505882_P22_CoatProtein->498505883_?->498505884_?-> 560179856 <-MuF<-Phage_portal<-Terminase_LS<-HTH<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 U875_RS13765 457 bacteria>proteobacteria>betaproteobacteria Pandoraea pnomenusa chromosome partitioning protein ParB [Pandoraea pnomenusa]. <-560179849_?<-560179850_?<-685479048_?<-685479057_MuF<-560179853_Phage_portal<-560179854_Terminase_LS<-560179855_HTH<-560179856_ParB+N6-MTase*<-560179857_?<-560179858_?<-560179860_?<-753868762_?<-753868765_?<-560179862_?<-685479087_? 564969930 HNH->?-><-?<-UPF0150+RHH_1<-?||ParB+N6-MTase*->HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 W930_RS0102395 457 bacteria>proteobacteria>betaproteobacteria Pandoraea MULTISPECIES: DNA methyltransferase [Pandoraea]. 738747188_?->564969915_?->738747191_HNH->564969918_?-><-564969922_?<-564969925_UPF0150+RHH_1<-498505875_?||564969930_ParB+N6-MTase*->564969934_HTH->564969935_Terminase_LS->564969937_Phage_portal->564969940_MuF->655322380_?->564969947_?->564969955_?-> 698968189 <-UPF0150+RHH_1<-?<-?||ParB+N6-MTase*->?->Terminase_LS->?->?->P22_CoatProtein-> ParB+N6-MTase ParBc+Methyltransf_26 LV28_24705 457 bacteria>proteobacteria>betaproteobacteria Pandoraea pnomenusa chromosome partitioning protein ParB [Pandoraea pnomenusa]. 698968182_?->698968183_?->698968184_?-><-698968185_?<-698968186_UPF0150+RHH_1<-698968187_?<-698968188_?||698968189_ParB+N6-MTase*->698968190_?->698968191_Terminase_LS->698968192_?->698968193_?->698968194_P22_CoatProtein->698968195_?->698968196_?-> 260093825 <-MuF+PBECR3<-Phage_portal<-Terminase_LS<-HTH||?-><-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 HIAG_01531 452 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae NT127 conserved hypothetical protein [Haemophilus influenzae NT127]. <-260093818_?<-260093819_?<-260093820_MuF+PBECR3<-260093821_Phage_portal<-260093822_Terminase_LS<-260093823_HTH||260093824_?-><-260093825_ParB+N6-MTase*<-260093826_?<-260093827_?<-260093828_?<-260093829_?<-260093830_?||260093831_?->260093832_?-> 501001894 <-MuF+PBECR3<-Phage_portal<-Terminase_LS<-HTH||?-><-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 HIBPF_RS01765 451 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. <-503290935_?<-491959484_?<-752488870_MuF+PBECR3<-503290937_Phage_portal<-503290938_Terminase_LS<-491883473_HTH||491959498_?-><-501001894_ParB+N6-MTase*<-503290939_?<-503292806_?<-503290941_?<-503290942_?<-503290943_?||494052963_?->491883430_?-> 497199068 <-Phage_capsid<-Phage_portal<-?<-?<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 OPIT5_RS03245 447 bacteria>verrucomicrobia Opitutaceae bacterium TAV5 ParB domain protein nuclease [Opitutaceae bacterium TAV5]. <-497199061_?<-645069321_?<-763429376_Phage_capsid<-763429377_Phage_portal<-497199065_?<-645069322_?<-497199067_?<-497199068_ParB+N6-MTase*<-497199069_?<-497199070_?<-645069323_?<-763429378_?<-497199073_?<-497199074_?<-645069324_? 736778761 <-MuF<-Phage_portal<-?<-Terminase_LS<-HTH<-URI1<-?<-ParB+N6-MTase* ParB+N6-MTase Methyltransf_26 C511_RS16595 446 bacteria>bacteroidetes Bacteroides graminisolvens hypothetical protein, partial [Bacteroides graminisolvens]. <-736778755_MuF<-640566463_Phage_portal<-653066796_?<-640566462_Terminase_LS<-736778758_HTH<-640566460_URI1<-640566459_?<-736778761_ParB+N6-MTase*||640566457_?->736778764_?->640566455_?-><-640566454_?<-640566453_?<-640566452_?<-653066797_? 157325361 ParB+N6-MTase*->?->gp79-> ParB+N6-MTase ParBc+Methyltransf_26 LiPB054_gp77 445 viruses>dsdna viruses, no rna stage>caudovirales Listeria phage B054 gp77 [Listeria phage B054]. 157325353_?->157325354_?->157325355_?->157325357_?->157325358_?->157325359_?->157325360_?->157325361_ParB+N6-MTase*->157325362_?->157325363_gp79->157325364_?-> 489827744 <-MuF+MPTase<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 HT51_04190 445 bacteria>firmicutes Listeria monocytogenes chromosome partitioning protein ParB [Listeria monocytogenes]. <-685901802_MuF+MPTase<-685901805_Phage_portal<-685901807_Terminase_LS<-685901809_HTH<-489827747_?<-489827746_gp79<-489827745_?<-489827744_ParB+N6-MTase*<-489827743_?<-489827741_?<-489827740_?<-489827739_?<-489827738_?<-489827736_?<-489827735_? 499299925 ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->?->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 LIN_RS06450 445 bacteria>firmicutes Listeria innocua chromosome partitioning protein ParB [Listeria innocua]. 499299918_?->499299919_?-><-499299920_?||754507716_?->754507717_?->499299923_?->499299924_?->499299925_ParB+N6-MTase*->499299926_?->499299927_gp79->499299928_?->754507766_HTH->499299930_Terminase_LS->499299931_?->754507767_MuF-> 499300722 <-MuF+MPTase<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 LIN_RS08830 445 bacteria>firmicutes Listeria innocua chromosome partitioning protein ParB [Listeria innocua]. <-489827751_MuF+MPTase<-489827750_Phage_portal<-489827749_Terminase_LS<-489827748_HTH<-499299928_?<-499300721_gp79<-489827745_?<-499300722_ParB+N6-MTase*<-489827743_?<-499300723_?<-754507717_?<-499300724_?<-499300725_?<-499299919_?<-499299918_? 518425096 <-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 F823_RS0101690 445 bacteria>firmicutes Peptostreptococcus anaerobius chromosome partitioning protein ParB [Peptostreptococcus anaerobius]. <-518425093_?<-488934558_?<-488934559_MuF<-488934560_Phage_portal<-488934562_Terminase_LS<-518425094_HTH<-488934568_?<-518425096_ParB+N6-MTase*<-488934571_?<-518425098_?<-488934574_?<-488934575_?<-488934576_?<-488934577_?<-488934578_? 685911099 ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF+MPTase-> ParB+N6-MTase ParBc+Methyltransf_26 HS95_01700 445 bacteria>firmicutes Listeria monocytogenes chromosome partitioning protein ParB [Listeria monocytogenes]. 685911096_?->685911097_?->489827738_?->489827739_?->489827740_?->489827741_?->489827743_?->685911099_ParB+N6-MTase*->489827745_?->489827746_gp79->685911100_?->489827748_HTH->685911102_Terminase_LS->685911103_Phage_portal->685911104_MuF+MPTase-> 746332729 <-MuF+MPTase<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 I794_RS04725 445 bacteria>firmicutes Listeria monocytogenes chromosome partitioning protein ParB [Listeria monocytogenes]. <-746332714_MuF+MPTase<-489827750_Phage_portal<-746332716_Terminase_LS<-746332718_HTH<-746332721_?<-746332725_gp79<-489827745_?<-746332729_ParB+N6-MTase*<-746332790_?<-746332731_?<-746332734_?<-746332792_?<-746332736_?<-746332739_?<-746332741_? 488292151 <-Terminase_LS<-HTH<-gp79<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 D927_RS13925 444 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB [Enterococcus faecalis]. <-658549994_Terminase_LS<-488292154_HTH<-498481714_gp79<-488292152_?<-488292151_ParB+N6-MTase*<-727131312_?<-488340732_?<-514887749_?<-642973505_?<-514887750_?<-488295138_?<-488295137_? 488295148 N6-MTase->?->?->?->ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 WUC_RS08035 444 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB [Enterococcus faecalis]. 488297008_?->488297009_?->488320992_?->488297012_N6-MTase->488331601_?->488295144_?->488295146_?->488295148_ParB+N6-MTase*->488292152_?->498479911_gp79->488297017_?->488295152_HTH->504337688_Terminase_LS->727003452_Phage_portal->488295155_MuF-> 488328083 <-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-gp79<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 HMPREF9518_RS16950 444 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB [Enterococcus faecalis]. <-727197464_MuF<-488328077_Phage_portal<-488328078_Terminase_LS<-488328079_HTH<-488328080_?<-488328081_gp79<-488292152_?<-488328083_ParB+N6-MTase*<-488295146_?<-488328084_?<-488328086_? 498397880 N6-MTase->?->?->?->?->ParB+N6-MTase*->?->gp79->Terminase_SS->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 WU9_RS08245 444 bacteria>firmicutes Enterococcus faecalis hypothetical protein [Enterococcus faecalis]. 488297009_?->488320992_?->488299640_N6-MTase->488299639_?->498398227_?->488293129_?->498397879_?->498397880_ParB+N6-MTase*->498397881_?->498397882_gp79->498397883_Terminase_SS->498397884_Terminase_LS->498397885_Phage_portal->514889620_MuF->498397887_?-> 498481713 N6-MTase->?->?->?->?->ParB+N6-MTase*->?->gp79->HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 UMY_RS13155 444 bacteria>firmicutes Enterococcus faecalis hypothetical protein [Enterococcus faecalis]. 488298694_?->488298693_?->488298692_N6-MTase->488298691_?->498481711_?->488298687_?->498481712_?->498481713_ParB+N6-MTase*->488292152_?->498481714_gp79->498481715_HTH->727155214_Terminase_LS->727155215_Phage_portal->498481718_MuF->642980108_?-> 498526876 N6-MTase->?->?->?->?->ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 WOK_RS09735 444 bacteria>firmicutes Enterococcus faecalis hypothetical protein [Enterococcus faecalis]. 488299642_?->488320992_?->488299640_N6-MTase->488299639_?->498398227_?->498526875_?->488295146_?->498526876_ParB+N6-MTase*->488292152_?->498479911_gp79->498526877_?->498526878_HTH->727020034_Terminase_LS->727020036_Phage_portal->488293090_MuF-> 506557932 ParB+N6-MTase*->?->gp79-> ParB+N6-MTase ParBc+Methyltransf_26 D351_RS17740 444 bacteria>firmicutes Enterococcus MULTISPECIES: DNA adenine methyltransferase [Enterococcus]. 642981009_?->498470081_?->514905202_?->514905317_?->506557932_ParB+N6-MTase*->506557933_?->488319782_gp79-> 514889617 <-MuF<-Phage_portal<-Terminase_LS<-Terminase_SS<-gp79<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 D349_RS04690 444 bacteria>firmicutes Enterococcus faecalis ParB-like protein [Enterococcus faecalis]. <-498397887_?<-514889620_MuF<-514889619_Phage_portal<-498397884_Terminase_LS<-514889618_Terminase_SS<-498397882_gp79<-498397881_?<-514889617_ParB+N6-MTase*<-514908240_?||514908242_?-> 514907821 ParB+N6-MTase*->?->gp79->HTH->Terminase_LS->Phage_portal->?->Phage_GP20-> ParB+N6-MTase ParBc+Methyltransf_26 D350_RS16800 444 bacteria>firmicutes Enterococcus faecalis ParB-like protein [Enterococcus faecalis]. <-514907818_?||514907819_?->727198410_?->514907821_ParB+N6-MTase*->514907822_?->642980399_gp79->514907824_HTH->514907825_Terminase_LS->514907826_Phage_portal->514907827_?->514907828_Phage_GP20-> 640121481 ParB+N6-MTase*->?->gp79->?->HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 JF27_RS10635 444 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB [Enterococcus faecalis]. 640121466_?->488327247_?->640121471_?->488333040_?->488296063_?->488293129_?->488295146_?->640121481_ParB+N6-MTase*->488292152_?->498479911_gp79->488297017_?->640121490_HTH->498481013_Terminase_LS->640121493_Phage_portal->488337259_MuF-> 694245431 <-Terminase_LS<-Terminase_SS<-gp79<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 ES21_06495 444 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB [Enterococcus faecalis]. <-694245427_Terminase_LS<-694245428_Terminase_SS<-694245429_gp79<-694245430_?<-694245431_ParB+N6-MTase*<-694245432_?<-694245433_?<-694245434_?<-694245435_?<-694245436_?<-694245437_?<-694245438_? 727050824 MazG-Phage->?->?->?->?->?->?->ParB+N6-MTase*->?->gp79->Terminase_SS->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 P791_RS10720 444 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB [Enterococcus faecalis]. 488300839_MazG-Phage->488333106_?->488285809_?->488300837_?->498521068_?->498521067_?->727050818_?->727050824_ParB+N6-MTase*->488292152_?->498397882_gp79->498406766_Terminase_SS->658430911_Terminase_LS->498407969_Phage_portal->727050857_MuF->488311051_?-> 495977294 <-P22_CoatProtein<-?<-MuF<-?<-Terminase_LS<-HTH<-ParB+N6-MTase*<-?||?-><-?<-?<-?<-?<-DAM ParB+N6-MTase ParBc+Methyltransf_26 FSDG_RS10625 441 bacteria>fusobacteria Fusobacterium nucleatum hypothetical protein [Fusobacterium nucleatum]. <-696266129_?<-495977306_P22_CoatProtein<-495977303_?<-495977301_MuF<-495977299_?<-495977297_Terminase_LS<-495977295_HTH<-495977294_ParB+N6-MTase*<-495977293_?||495977291_?-><-495977289_?<-495977286_?<-495977285_?<-495977284_?<-495977283_DAM 775458612 <-MuF<-?<-Terminase_LS<-?<-ParB+N6-MTase* ParB+N6-MTase Methyltransf_26 BAQ92410.1 441 viruses uncultured Mediterranean phage uvMED DNA modification N6-MTase (COG0863) [uncultured Mediterranean phage uvMED]. <-775458605_?<-775458606_?<-775458607_?<-775458608_MuF<-775458609_?<-775458610_Terminase_LS<-775458611_?<-775458612_ParB+N6-MTase*<-775458613_?<-775458614_?<-775458615_?||775458616_?-><-775458617_?<-775458618_?||775458619_?-> 491883456 - ParB+N6-MTase ParBc+Methyltransf_26 376 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae plasmid partitioning protein ParB [Haemophilus influenzae]. 675161105 <-Phage_GP20<-MuF<-Phage_portal<-Terminase_LS<-HTH<-?<-ParB+N6-MTase* ParB+N6-MTase Methyltransf_26 HMPREF5175_00726 369 bacteria>firmicutes Lactobacillus gasseri SV-16A-US hypothetical protein HMPREF5175_00726 [Lactobacillus gasseri SV-16A-US]. <-675161098_?<-675161099_Phage_GP20<-675161100_MuF<-675161101_Phage_portal<-675161102_Terminase_LS<-675161103_HTH<-675161104_?<-675161105_ParB+N6-MTase*<-675161106_?<-675161107_?<-675161108_?<-675161109_?<-675161110_?<-675161111_?<-675161112_? 763137539 <-MuF<-?<-?<-Terminase_LS<-?<-N6-MTase* N6-MTase Methyltransf_26 QI63_RS07900 330 bacteria>spirochaetes Treponema sp. OMZ 838 hypothetical protein, partial [Treponema sp. OMZ 838]. <-763135633_?<-763135634_?<-763135636_MuF<-763135638_?<-763137537_?<-763135640_Terminase_LS<-763135642_?<-763137539_N6-MTase*<-763135644_?<-763135645_?<-763135647_?<-763135649_?<-763135651_?<-763135653_?<-763135656_? 738746241 <-N6-MTase* N6-MTase Methyltransf_26 CG50_RS14910 327 bacteria>proteobacteria>alphaproteobacteria Paenirhodobacter enshiensis DNA methyltransferase [Paenirhodobacter enshiensis]. <-738746247_?<-738746249_?<-738746232_?||738746234_?->738746251_?->738746236_?-><-738746238_?<-738746241_N6-MTase*||738746242_?-> 282557516 ParB+N6-MTase*->?->?->HTH->Terminase_LS->Phage_portal->MuF->Phage_GP20-> ParB+N6-MTase ParBc+Methyltransf_26 HMPREF9209_0590 325 bacteria>firmicutes Lactobacillus gasseri 224-1 ParB-like protein [Lactobacillus gasseri 224-1]. 282557509_?->282557510_?->282557511_?->282557512_?->282557513_?->282557514_?->282557515_?->282557516_ParB+N6-MTase*->282557517_?->282557518_?->282557519_HTH->282557520_Terminase_LS->282557521_Phage_portal->282557522_MuF->282557523_Phage_GP20-> 502831711 <-N6-MTase*<-ParB<-?<-?<-?<-HTH N6-MTase Methyltransf_26 U717_RS25485 324 bacteria>proteobacteria>alphaproteobacteria Rhodobacter capsulatus DNA methyltransferase [Rhodobacter capsulatus]. <-739227891_?<-502831705_?<-665954923_?<-502831707_?||502831708_?->665954925_?-><-502831710_?<-502831711_N6-MTase*<-739227894_ParB<-739227896_?<-665954928_?<-502831715_?<-502831716_HTH||665954930_?->502831718_?-> 565833115 HTH->?->?->?->ParB->N6-MTase*-> N6-MTase Methyltransf_26 U713_RS23890 322 bacteria>proteobacteria>alphaproteobacteria Rhodobacter capsulatus DNA methyltransferase [Rhodobacter capsulatus]. <-565833101_?<-565833103_?||565833105_HTH->665960137_?->665960138_?->739227896_?->565833113_ParB->565833115_N6-MTase*->565833117_?-><-665960140_?<-665960141_?<-665960142_?||565833125_?->665960143_?->565833129_?-> 68058078 ParB+N6-MTase*->?-><-?||HTH->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+N6_N4_Mtase NTHI1522 319 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae 86-028NP hypothetical protein NTHI1522 [Haemophilus influenzae 86-028NP]. <-68058071_?<-68058072_?||68058073_?->68058074_?->68058075_?->68058076_?->68058077_?->68058078_ParB+N6-MTase*->68058079_?-><-68058080_?||68058081_HTH->68058082_Terminase_LS->68058083_Phage_portal->68058084_MuF->68058085_?-> 775455096 N6-MTase*->?->?-><-?<-DAMT1 N6-MTase Methyltransf_26 BAQ89246.1 316 viruses uncultured Mediterranean phage uvMED DNA modification N6-MTase (COG0863) [uncultured Mediterranean phage uvMED]. <-775455089_?||775455090_?->775455091_?->775455092_?->775455093_?->775455094_?->775455095_?->775455096_N6-MTase*->775455097_?->775455098_?-><-775455099_?<-775455100_DAMT1<-775455101_?||775455102_?->775455103_?-> 655293784 <-N6-MTase*<-?<-?<-?<-?<-DAM<-SNF<-DCM N6-MTase Methyltransf_26 H590_RS0105875 309 bacteria>actinobacteria Propionibacterium jensenii hypothetical protein [Propionibacterium jensenii]. <-655293778_?<-655293779_?<-655293780_?<-739111574_?<-655293781_?<-655293782_?<-655293783_?<-655293784_N6-MTase*<-655293785_?<-739111630_?<-655293787_?<-655293788_?<-655293789_DAM<-655293790_SNF<-655293791_DCM 768317646 N6-MTase*->?->?->?-><-?||?->?->HNH-> N6-MTase Methyltransf_26 T261_5811 309 bacteria>actinobacteria Streptomyces lydicus A02 ParB domain protein nuclease [Streptomyces lydicus A02]. 768317639_?->768317640_?->768317641_?->768317642_?->768317643_?->768317644_?->768317645_?->768317646_N6-MTase*->768317647_?->768317648_?->768317649_?-><-768317650_?||768317651_?->768317652_?->768317653_HNH-> 499633394 <-N6-MTase*<-ParB N6-MTase Methyltransf_26 NWI_RS04225 308 bacteria>proteobacteria>alphaproteobacteria Nitrobacter winogradskyi DNA methyltransferase [Nitrobacter winogradskyi]. <-499633388_?<-752697874_?<-499633390_?||499633391_?->752697876_?->752697160_?-><-499633393_?<-499633394_N6-MTase*<-752697878_ParB<-499633396_?<-499633397_?<-499633398_?<-499633399_?<-499633400_?<-499633401_? 307772411 Phage_GPD->?->?->N6-MTase*-> N6-MTase Methyltransf_26 TRICHSKD4_2718 305 bacteria>proteobacteria>alphaproteobacteria Roseibium sp. TrichSKD4 gp77 [Roseibium sp. TrichSKD4]. 307772404_?->307772405_?->307772406_?->307772407_?->307772408_Phage_GPD->307772409_?->307772410_?->307772411_N6-MTase*->307772412_?->307772413_?->307772414_?-><-307772415_?||307772416_?->307772417_?->307772418_?-> 740017278 <-N6-MTase* N6-MTase Methyltransf_26 IG92_RS0128815 304 bacteria>actinobacteria Streptomyces sp. NRRL S-1868 chromosome partitioning protein ParB [Streptomyces sp. NRRL S-1868]. <-664361403_?<-664361406_?<-664361409_?<-664361411_?<-664361413_?<-740017278_N6-MTase*||664361418_?->664361421_?->664361424_?->664361427_?->664361430_?->664361433_?->664361436_?-> 516125430 <-N6-MTase* N6-MTase Methyltransf_26 C892_RS0103080 301 bacteria>actinobacteria Nocardiopsis baichengensis chromosome partitioning protein ParB [Nocardiopsis baichengensis]. 516125423_?-><-516125424_?||648432354_?-><-516125426_?<-750403313_?||750403314_?-><-516125429_?<-516125430_N6-MTase*<-516125431_?||516125433_?-><-516125434_?<-516125435_?||516125436_?->516125437_?->516125438_?-> 750334917 Phage_GPD->?->N6-MTase*-> N6-MTase Methyltransf_26 TRICHSKD4_RS11975 300 bacteria>proteobacteria>alphaproteobacteria Roseibium sp. TrichSKD4 hypothetical protein, partial [Roseibium sp. TrichSKD4]. <-497094044_?||497094045_?->497092480_?->497092478_?->497092476_?->497092474_Phage_GPD->497092472_?->750334917_N6-MTase*->497094050_?->750334871_?->750334872_?->750334873_?->750334874_?-><-750334918_?<-750334875_? 304360713 N6-MTase*-> N6-MTase Methyltransf_26 phiCTP1_gp51 299 viruses>dsdna viruses, no rna stage>caudovirales Clostridium phage phiCTP1 hypothetical protein phiCTP1_gp51 [Clostridium phage phiCTP1]. 304360706_?->304360707_?->304360708_?->304360709_?->304360710_?->304360711_?->304360712_?->304360713_N6-MTase*->304360714_?->304360715_?->304360716_?->304360717_?->304360718_?->304360719_?->304360720_?-> 686217757 N6-MTase*-> N6-MTase Methyltransf_26 T245_RS0104245 298 bacteria>proteobacteria>gammaproteobacteria Vibrio parahaemolyticus hypothetical protein, partial [Vibrio parahaemolyticus]. 686217757_N6-MTase*->686217758_?->686217759_?-> 739963074 <-MuF<-Phage_portal<-?<-HNH<-?<-?<-N6-MTase* N6-MTase Methyltransf_26 B073_RS0131580 295 bacteria>actinobacteria Streptomyces sp. MspMP-M5 chromosome partitioning protein ParB [Streptomyces sp. MspMP-M5]. <-739963064_?<-739963066_MuF<-739963069_Phage_portal<-648556155_?<-648556156_HNH<-739963071_?<-517366965_?<-739963074_N6-MTase*<-739963075_?<-517366968_?<-517366969_?<-739963077_?<-517366972_?<-517366973_?<-739963079_? 648436510 N6-MTase*->?->?->MuF->?->Terminase_LS-><-Phage_portal N6-MTase Methyltransf_26+N6_N4_Mtase D471_RS0130950 292 bacteria>actinobacteria Nocardiopsis lucentensis hypothetical protein [Nocardiopsis lucentensis]. 516189936_?->648436510_N6-MTase*->516189946_?->516189949_?->516189952_MuF->516189954_?->516189956_Terminase_LS-><-750410233_Phage_portal||750410236_?-> 752848289 ParB->?->ParB->N6-MTase*->ParB->?->Terminase_LS->Phage_portal->MuF->?->P22_CoatProtein-> N6-MTase SP+Methyltransf_26 D881_RS16175 270 bacteria>actinobacteria Corynebacterium ulcerans hypothetical protein, partial [Corynebacterium ulcerans]. 665903459_?->504648867_?->560537641_?->504648869_?->757634954_ParB->560537654_?->757634957_ParB->752848289_N6-MTase*->757634964_ParB->503677968_?->560537665_Terminase_LS->560537671_Phage_portal->560537676_MuF->560537682_?->560537686_P22_CoatProtein-> 764990690 <-Antirestrict<-?<-?<-MuF<-?<-Terminase_LS<-?<-N6-MTase*<-?<-?<-?<-?<-VRRNUC N6-MTase Methyltransf_26 AMBLS11_RS12625 270 bacteria>proteobacteria>gammaproteobacteria Alteromonas macleodii hypothetical protein, partial [Alteromonas macleodii]. <-764990687_Antirestrict<-504811868_?<-504811869_?<-764990688_MuF<-504811871_?<-764990689_Terminase_LS<-504811873_?<-764990690_N6-MTase*<-504811875_?<-504811876_?<-764990487_?<-504811878_?<-504811879_VRRNUC<-504811880_?<-504811881_? 754504471 <-N6-MTase*<-ParB||N6-MTase->?-><-HNH N6-MTase Methyltransf_26 SINME_RS11520 256 bacteria>proteobacteria>alphaproteobacteria Sinorhizobium meliloti hypothetical protein, partial [Sinorhizobium meliloti]. <-754504418_?||754504470_?->503610625_?-><-503610626_?<-503610627_?<-503610628_?<-503610629_?<-754504471_N6-MTase*<-754504472_ParB||503610630_N6-MTase->503610631_?-><-503610632_HNH<-503610633_?<-754504473_?<-754504474_? 728810984 N6-MTase*->?->gp79->HTH->Terminase_LS->Phage_portal->MuF+MPTase-> N6-MTase Methyltransf_26 QR19_RS07135 228 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB, partial [Enterococcus faecalis]. 728810984_N6-MTase*->488292152_?->498481714_gp79->728810986_HTH->728810987_Terminase_LS->728810989_Phage_portal->728810990_MuF+MPTase->488311052_?-> 738605470 N6-MTase->?->?->?->?->ParB->N6-MTase*->?-><-?||?->?->?->?->MuF-> N6-MTase Methyltransf_26 DM07_RS11075 223 bacteria>proteobacteria>alphaproteobacteria Oceanicaulis sp. HL-87 DNA methyltransferase, partial [Oceanicaulis sp. HL-87]. 738603777_?->738603780_N6-MTase->738603783_?->738603785_?->738603788_?->738605465_?->738605468_ParB->738605470_N6-MTase*->738603791_?-><-738603794_?||738603798_?->738605473_?->738605476_?->738605479_?->738603799_MuF-> 736650700 ParB->N6-MTase*->ParB->CRISPR_assoc->?->?->?->?->N6-MTase-> N6-MTase Methyltransf_26 HMPREF1261_RS02180 222 bacteria>actinobacteria Corynebacterium sp. KPL1818 hypothetical protein, partial [Corynebacterium sp. KPL1818]. 736650695_?->552852507_?->552852510_?->736650696_?->552852517_?->552852521_?->736650698_ParB->736650700_N6-MTase*->552852530_ParB->552852532_CRISPR_assoc->552852535_?->552852539_?->552852542_?->552852545_?->736650701_N6-MTase-> 736660310 <-Thy1<-?<-?<-?<-ParB<-N6-MTase*<-ParB N6-MTase Methyltransf_26 HMPREF1267_RS11500 222 bacteria>actinobacteria Corynebacterium sp. KPL1824 hypothetical protein, partial [Corynebacterium sp. KPL1824]. <-736660162_?||736660164_?-><-552836611_Thy1<-552836618_?<-552836623_?<-552836631_?<-552836639_ParB<-736660310_N6-MTase*<-736660312_ParB<-552836652_?<-552836659_?<-552836663_?<-552836671_?<-552836675_?<-552836682_? 750124080 <-N6-MTase*<-?<-?<-?<-?<-?<-CRISPR_assoc N6-MTase N6_N4_Mtase A3EC_RS11435 206 bacteria>actinobacteria Corynebacterium ulceribovis hypothetical protein, partial [Corynebacterium ulceribovis]. <-516655108_?<-516655109_?<-516655110_?<-516655111_?<-750124076_?<-750124078_?<-516655115_?<-750124080_N6-MTase*<-516655117_?<-516655118_?<-516655119_?<-516655120_?<-516655121_?<-750124082_CRISPR_assoc<-750124084_? # 7; 189432761 <-MuF<-?<-Phage_portal<-Terminase_LS<-?<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 BACCOP_01158 518 bacteria>bacteroidetes Bacteroides coprocola DSM 17136 hypothetical protein BACCOP_01158 [Bacteroides coprocola DSM 17136]. 189432754_?-><-189432755_MuF<-189432756_?<-189432757_Phage_portal<-189432758_Terminase_LS<-189432759_?<-189432760_?<-189432761_ParB+N6-MTase*<-189432762_?<-189432763_?<-189432764_?<-189432765_?<-189432766_?<-189432767_?<-189432768_? 652946519 ParB+N6-MTase*->?->?->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase ParBc+Methyltransf_26 K252_RS0101690 518 bacteria>bacteroidetes Butyricimonas virosa hypothetical protein [Butyricimonas virosa]. 652946514_?->736480518_?->652946515_?->494838952_?->652946516_?->652946517_?->652946518_?->652946519_ParB+N6-MTase*->652946520_?->652946521_?->736480520_Terminase_LS->652946522_Phage_portal->736480522_MuF-><-652946523_?<-736480477_? 740821721 ParB+N6-MTase*->?->?->Terminase_LS->Phage_portal->MuF-> ParB+N6-MTase Methyltransf_26 GV66_RS18355 518 bacteria>bacteroidetes Bacteroides dorei chromosome partitioning protein ParB [Bacteroides dorei]. 740821704_?->652946513_?->740821707_?->740821710_?->740821713_?->740821715_?->740821718_?->740821721_ParB+N6-MTase*->652946520_?->494838940_?->740822740_Terminase_LS->740821727_Phage_portal->740822743_MuF-><-740821731_?<-494838927_? 498101954 <-MuF<-?<-Phage_portal<-Terminase_LS<-?<-?<-?<-ParB+N6-MTase*<-?<-?<-?<-?<-?<-N6-MTase ParB+N6-MTase Methyltransf_26 ATH1_RS0100585 515 bacteria>bacteroidetes Anaerophaga thermohalophila hypothetical protein [Anaerophaga thermohalophila]. <-763245641_MuF<-498101940_?<-498101943_Phage_portal<-498101946_Terminase_LS<-656214691_?<-498101951_?<-656214692_?<-498101954_ParB+N6-MTase*<-498101957_?<-498101960_?<-656214693_?<-498101969_?<-498101972_?<-498101979_N6-MTase<-498101981_? 518076130 DCM->?->?->?->?->?->ParB+N6-MTase*->?->?->Terminase_LS->Phage_portal->ParB->MuF-> ParB+N6-MTase Methyltransf_26 BN352_RS08925 512 bacteria>bacteroidetes Candidatus Alistipes marseilloanorexicus chromosome partitioning protein ParB [Candidatus Alistipes marseilloanorexicus]. 518076122_?->518076123_DCM->518076125_?->518076126_?->518076127_?->736151789_?->518076129_?->518076130_ParB+N6-MTase*->648397961_?->518076131_?->518076132_Terminase_LS->518076133_Phage_portal->518076134_ParB->648397962_MuF-><-518076136_? 749916162 <-MuF<-Phage_portal<-Terminase_LS<-?<-?<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 BACCOP_RS00640 500 bacteria>bacteroidetes Bacteroides coprocola chromosome partitioning protein ParB, partial [Bacteroides coprocola]. 494838927_?->494838929_?-><-749916157_MuF<-494838935_Phage_portal<-749916159_Terminase_LS<-494838940_?<-749916160_?<-749916162_ParB+N6-MTase*<-494838946_?<-494838948_?<-494838950_?<-494838952_?<-494838954_?<-749916163_?<-494838962_? 754558370 ParB+N6-MTase*->?->?->Terminase_LS-> ParB+N6-MTase Methyltransf_26 B157_RS0110235 444 bacteria>bacteroidetes Spirosoma spitsbergense hypothetical protein, partial [Spirosoma spitsbergense]. <-522092021_?<-522092022_?<-522092023_?<-754558369_?<-522092025_?<-522092026_?<-522092027_?||754558370_ParB+N6-MTase*->522092029_?->522092030_?->754558371_Terminase_LS->522092032_?->522092033_?->522092034_?->522092035_?-> # 4; 393402237 ParB->?->ParB+N6-MTase*->ParB->?->Terminase_LS->Phage_portal->MuF->?->P22_CoatProtein-> ParB+N6-MTase ParBc+Methyltransf_26 CULC0102_0528 516 bacteria>actinobacteria Corynebacterium ulcerans 0102 hypothetical protein CULC0102_0528 [Corynebacterium ulcerans 0102]. 393402230_?->393402231_?->393402232_?->393402233_?->393402234_?->393402235_ParB->393402236_?->393402237_ParB+N6-MTase*->393402238_ParB->393402239_?->393402240_Terminase_LS->393402241_Phage_portal->393402242_MuF->393402243_?->393402244_P22_CoatProtein-> 550751922 <-Thy1<-?<-?<-?<-ParB<-ParB+N6-MTase* ParB+N6-MTase ParBc+Methyltransf_26 HMPREF1267_02363 505 bacteria>actinobacteria Corynebacterium sp. KPL1824 hypothetical protein HMPREF1267_02363 [Corynebacterium sp. KPL1824]. <-550751915_?<-550751916_?<-550751917_Thy1<-550751918_?<-550751919_?<-550751920_?<-550751921_ParB<-550751922_ParB+N6-MTase*<-550751923_?<-550751924_?<-550751925_?<-550751926_?<-550751927_?<-550751928_?<-550751929_? 550761804 ParB+N6-MTase*->ParB->CRISPR_assoc->?->?->?->?->N6-MTase-> ParB+N6-MTase ParBc+Methyltransf_26 HMPREF1261_00448 505 bacteria>actinobacteria Corynebacterium sp. KPL1818 hypothetical protein HMPREF1261_00448 [Corynebacterium sp. KPL1818]. 550761797_?->550761798_?->550761799_?->550761800_?->550761801_?->550761802_?->550761803_?->550761804_ParB+N6-MTase*->550761805_ParB->550761806_CRISPR_assoc->550761807_?->550761808_?->550761809_?->550761810_?->550761811_N6-MTase-> 806900839 ParB+N6-MTase*->?->Terminase_LS->Phage_portal->?->Phage_capsid-> ParB+N6-MTase ParBc+Methyltransf_26 ERS075618_03274 503 bacteria>actinobacteria Mycobacterium abscessus ParB-like nuclease domain [Mycobacterium abscessus]. 806900832_?->806900833_?->806900834_?->806900835_?->806900836_?->806900837_?->806900838_?->806900839_ParB+N6-MTase*->806900840_?->806900841_Terminase_LS->806900842_Phage_portal->806900843_?->806900844_Phage_capsid->806900845_?->806900846_?-> # 1; 728810915 N6-MTase*-> N6-MTase Methyltransf_26 QR19_RS04500 147 bacteria>firmicutes Enterococcus faecalis chromosome partitioning protein ParB, partial [Enterococcus faecalis]. 728810915_N6-MTase*-> 68058079 ParB+N6-MTase->?*-><-?||HTH->Terminase_LS->Phage_portal->MuF-> - - NTHI1523 139 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae 86-028NP predicted DNA modification N6-MTase [Haemophilus influenzae 86-028NP]. <-68058072_?||68058073_?->68058074_?->68058075_?->68058076_?->68058077_?->68058078_ParB+N6-MTase->68058079_?*-><-68058080_?||68058081_HTH->68058082_Terminase_LS->68058083_Phage_portal->68058084_MuF->68058085_?->68058086_?-> 738080913 <-MuF<-Phage_portal<-Terminase_LS<-HTH||?-><-N6-MTase*<-ParB N6-MTase N6_N4_Mtase LA03_RS30870 212 bacteria>proteobacteria>betaproteobacteria Burkholderia gladioli hypothetical protein, partial [Burkholderia gladioli]. <-738080803_?<-738080907_?<-738080804_MuF<-738080909_Phage_portal<-738080806_Terminase_LS<-738080911_HTH||738080808_?-><-738080913_N6-MTase*<-738080914_ParB||738080810_?-><-738080811_?<-738080813_?<-738080815_?<-738080817_?<-738080819_? 739526353 Radical-SAM+N6-MTase*-> Radical-SAM+N6-MTase Methyltransf_26 ER57_RS06425 739 bacteria>proteobacteria>deltaproteobacteria Smithella sp. SCADC hypothetical protein [Smithella sp. SCADC]. 739526334_?->739526335_?->739526336_?->739526338_?->739526342_?->739526345_?->739526348_?->739526353_Radical-SAM+N6-MTase*->739526355_?->739526358_?->739526361_?-> 399528699 ASCH->ParB+N6-MTase*-> ParB+N6-MTase SP+Methyltransf_26 B620_gp66 452 viruses>dsdna viruses, no rna stage>caudovirales Croceibacter phage P2559S hypothetical protein P2559S_66 [Croceibacter phage P2559S]. 399528692_?->399528693_?->399528694_?->399528695_?->399528696_?->399528697_?->399528698_ASCH->399528699_ParB+N6-MTase*->399528700_?->399528701_?->Back to Contents
Str-1 Str-1 Str-2 Str-3 Str-4 Str-5 Str-6 Str-7 FINAL --EEEEE-E-------------------------------------EEEE-HHHHHHHHHHHH----------------H------------------HH-------------------------HHHHHHHHHHHHHHHH-H---------------------------------------HHHHHHHHHH-------EEE------------------------------------------------------------------------------EE-----HHH---------HHHHHH-------HHHHHHHH--------------------------EEEEEE-----------------------HH-HHHHHHH------------EEEE-----EE------E-----------EE-------EEEEE--E----EEEEEEHHHH-----HH--------H--H--HHHHHHHHH-- ALIGN ----EEE----------------------------------------HHHHHHHHHHHHHHHH----------------H------------------HH-------------------------HHHHHHHHHHHHHHHH-H---------------------------------------HHHHHHHHH--------EE------------------------------E------------EE------------------------------------------HH---------HHHHHH-------HHHHHHHHH-------------------------EEEEEE------------------------H-HHHHHH-------------HHHHH------------H-----------H----------EE--------EEEEEEE-----------------HH--H--HHHHHHHH--- HMM --EEEEE-EE-------------------------------------HHHHHHHHHHHHHHHH-----------------------------------HH-------------------------HHHHHHHHHHHHHHHH--------------------------EEE------------HHHHHHHHHH--H-----H---HHHHHHH-------HHHH-HHHHHHHH-------------EE--E--EE--------------------------EEEE----HHH---------HHHHHH-------HHHHHHHHHH---------------H--------EEEEEE---E-------------------HH-HHHHHHH----HH-------HEEHHH--EEEE----E-----------EE--E----EEEEE--E----EEEEEEHHHH-----HH--H----HH--H--HHHHHHHHH-- FREQ --EEEEE-E-------------------------------------EEEE---HHHHHHHHHH----------------E------------------------------------------------HHHHHHHHHHHHH-H--------------------------------------HHEEHHHHHHH---EE--EEE----------------EEE-----------------------------------------------------------------HHHH---------HHHHHH-------HHHHHH-----------------------------EEEEE--------------------------HHHHHHH---HHHH-----HHEE----HH-------E-----------EE----H--HHHEE--H-HHHHHEEEHH---------------------H--HHHHHHHHH-- PSSM --EEEEE-E--------------------------------------EE--HHHHHHHHHHHH----------------H----------------------------------------------HHHHHHHHHHHHHHH-------------------------------------------HHHHHHHH----E--E--------------------------------------------------------------------------------EEE-----HH---------HHHHHH-------HHHHHHHHH-------------------------EEEEE-------------------------H-HHHHHH----------------------------------------------------EEEE--------EEEEE--HH-----HH--H-----H--H--HHHHHHHH--- Homo_sapiens_18034767 NNVVCIR-YK--------GE--------------------------MVKVSRNYFSKLWLLYR----------------YS----C--IDDSA-----FE-------------------------RFLPRVWCLLRRYQMM-FG-VGLY---EG------T-----GLQGSL---------PVHVFEALHRL-FGVS--FECFASPLNCYF-------RQYCSAFPDTDGYFGSRGP-------CL--D--FAPL---S-------------------GSFEANPPFCEE---------LMDAMV-------SHFERLLESS---------------P---E--PLSFIVFI---PEW-REP---------P---TP-ALTRMEQ---SRFKR----HQLILPAFEHEYRSGSQH-----------IC--K-K-EEMHYK--A-VHNTAVLFLQNDP-----GF--AKWAPTP--E--RLQELSAAYRQ Aureococcus_anophagefferens_323451439 LPQTREL-LR--------HE--------------------------LAPASNLAYDTLGCVVI---DH---VAPA----RR-------ARDAD-----QE---------ALDA------------AAEAARLALVDDAAAL-VA-ARRG---GG------G-----GDAARLWARLDGRAADCRRLAALAAA-SAADR-ARADAARRSSLK-------APWATALDDV---FGPDGS--LEL--NV--A--LAPT---RRMKGLDTWRAVRWLGAQHPRRVSLQPPVLFE--Y------YAEALAWGDAGATKAVGRMLLDG------GEG------E------PGRHALLY-A-PPR-ARG---------P---AS-VFCALCR---AMERSG---DPSEIAASAADYVACFDW-----------VR--S-------------VCGRGAFAAALAD-R---GD--DAFGGAA-----VLHALCAATPS Tetrahymena_thermophila_89295554 LQQRYQN-FQ--T--I--QD------------------------E-QDQSDQEEVNQLLSQEE---QN-----------KD--L-S--ISDQR-SQG-NN-IAHKDT------------------IMNQKIFLTLFWYDYI--G-I------QN------------GQQWSL---------NSEVFDLLKQF-LNIN--TEVFASPFNRNL-------ENYFSLF-ESDKYFGSFGN--FNKN-YL--N-------I-Q-------------------QNFQANPPFIDN---------LFTHFA-------AQILQILEIN---------------TQNNR--EIGCVIVF---P-W-QDN---------Q------GYYQLQN---SDYFI----DEIELLKNAHYYTDQNSC--------SS-IK--S-------------KFNTYILILGNTF-FK------DKYLQCP--Y--LSDSIVQAFQI Naegleria_gruberi_284097202 NRERVQR-FS----NI--LT--------------------------SHQLNMSHYEKLKKHFH----------------HRLNIIS--ANDSRKREVLQD-------------IFFNRSNDYTTCVFDYFVYCLLLRYGAM-FG-AGEKRF-EG------T-----GLHAAC---------PVEVFETLNKN-MNVN--SENFASPLNSYF-------VDFCSACGDLDFWFGSLGG-------FF--E--FFPK---S-------------------GCFESNPPFSEE---------LMQMMV-------SHMENLLANS---------------T---E--SLSFLIVV---PNW-MDA---------L---S---LQRLCS---SSFLT----HEHVLGANVHAYITGSQH------N----EK--N-V-HKRKYN--A-VHETHFFVLQNEK-----GK--TINPITS--Q--FIDYFLKSFAS Emiliania_huxleyi_551547167 GSGGTVT-LS--C-----GG------------------------E-SVVCKRSHLEKLRAL------------------LR----S--GDAGD-----AA-AASGER------------------LFERRAYCVLARVLAL-----------QG------GEPRAGGMQAAV---------GYRVFDALARH-HGAA--FELFASPLNARF-------SSFCSAAPDVDRAFGSVGS-------FFSPS--LDPL-LAS-------------------GAFQANPPYDPP---------LVAAMG-------ERMHALLASA---------------DARRD--ALTFIVII---PHW-QDK---------P------CWRALEQ---SCRCS----AHLRLPQAEHGFFEGGQH-----------YR--P---ALWRAA----NHDTSLFFLQSAA-----AP--R--PSEA--S--LAA-LRTAFRA Ostreococcus_tauri_116057704 RVDKDVVTLS--V-----GK------------------------S-DVRVNAEHLEKLKEMY-----------------RI----A--NGDRF-----TE-K-----------------------VFMADVYTMVARYDAA-----------QGGQYRFAG-----GHHTAL---------HGEVFDVLRDA-FFVS--CELFASPLNARW-------PTFCSAHIDVDYAFGSLGS-------YR--D--FRPS---H-------------------GSYEVNPPFDEE---------LVGDMS-------NHLFELLQNA---------------T---G--ALTFVVIT---PYW-LNR---------P------CWEDMRR---SKFCT----RCEVLSVREAGYFEGAQH-----------RK--K---SRFRFA----TSDTSVLFLQNEP-----XX--XXXXXKSSTG--RSRSPNERMPR Toxoplasma_gondii_221486586 GRG-----WR--------GG----DW--------------------TCECVRDLIDRIRSVKS----------------IG----S---------------------------------------LFASAVFALLCRYHSI-CG-CQN----QG------K-----GLQYAV---------PPNVLDVLRDD-LKVN--CELFASPFNVHF-------DNYCSIFPDVDVMFGSRGS-------FF--D----PS-F-Q------------LLE----GSFEANPPFDEV---------LMARMV-------QRLLSWLKKS------EERQRESL--------PLSFCLSL---PDW-SNG---------P---SE-FMYLLKK---SEYLR----YSEVIPEGKHVYLNGFQH-----------FC--H-T-CDLEVP--A-VCGTFFAVLQNEA-----GT--KAWPVTD--T--FINRLKEAWS- Micromonas_pusilla_226459779 TAKAKAK-LT--C-----GK------------------------V-SVECNAAHLEKLRALHR---------AYAR---RR-GG-G--GGGRR-RAR-AR-SAADDE----R-------------IFREDAFSLLARYSSL-QG-AHYK---AG------------AMQAAL---------PPAMFDALREH-FDVS--MELCASPFNCRW-------RRYCGAALDVDAAFGSLGSGAFYFHTVF--E--FTPS---G-------------------GSFEVNPPFDPG---------FVERLV-------AHLESVLSRSATRTTDGDDDDDDDDDDDAE--ALSFVVVV---PYW-PEK---------R------AWLRLVN---SEFAR----KVLKLRAGAHGFVAGAQH-----------LR--P---DKVVPS----AAATSVVFLQNDA-----GA--KRWPVTS--E--KIDAVKEGFRG Sphaeroforma_arctica_Sarc1000002473 NEAKKGD-TA--I-----TG------------AIKRSNSSTVK-----RVTSDKHGKPIELDD---KQ---------------------------------HKNTEE------------------LFHNDLYSMLLRYNAL-----------SG------M-----GFQAAC---------SEHVFAALKSL-FRTD--FECFSSPLNTHH-------PLYCSAYIDTDFHFGSRGN-------FF--D--FYPA---S-------------------GSFQANPPFVTG---------VMERMA-------KHIETLLQKA---------------NTNSQ--PLSFIVVV---PGWVNEV---------S------YQTMLAS--PFMEGD----GPLLISKDDHGFCDGAQH-----------QR--Q---DRYRNS----PFDTAIFFLRTDAARNVYGE--RTFESDA--I--LRKAFAEALPS Nasonia_vitripennis_156553336 REQTMLR-FH--------ND--------------------------AMCINNTHLAKLEHLYR----------------HN----C--FDDRK-----FE-------------------------MFLPRVWCMLKRYQTY-LG-S--T---ES------Q-----ATQMAL---------PVTVFECLQRM-FGVT--FECFASPLNCYF-------RQYCSAFADTDAYFGSRGP-------FL--D--FRPI---S-------------------GSFQANPPYCEE---------LMEAMV-------NHFERLLSDS---------------T---E--ALSFVVFL---PEW-RDP---------A---PN-ALLKLEA---SHFKR----KQVVVPAMEHEYRHGFQH-----------VL--P-K-GEVNIR--A-IHGTLVVWLQNAA-----GH--ARWGPTE--E--RVEALLEAWRP Lottia_gigantea_Lgig1000003858 NDITSLR-YK-------SET---------------------------VKINSSHFHKLEQLYK----------------LN----C--RDDPR-----FD-------------------------HFLCRVWCLLRRYQTY-FG-IHTN---EG------F-----GLQGAL---------PVTVFECLHRV-FGVT--FECFASPLNCYF-------RQFCSAFTDTDGYFGSRGD-------IL--N--FFPK---S-------------------GSFEANPPFCEE---------LMEAMV-------DHFENLLHES---------------N---E--PLSFIVFI---PEW-RDP---------P---TE-ALMRLES---SRFKK----KQITFPAYEHEYRNGFQH-----------IC--P-K-NDMSVK--S-LHGTVAIFLQNDA-----GF--SKWGPTP--E--RIKELLLSSKP Phytophthora_sojae_Psoj1000010133 GMRQ----LT--Y-----GN------------------------S-TVKLSAAHFAKLREMYA---RK-----------QG----L--GGDGS-----SM-APKDQR------------------SFESALFCLLLRYDSL-----------DG------G-----GFQAAL---------NEECFDVLLKE-FDCK--MECFASPLNCRY-------SRFCSAFLDTDCAFGSVGS-------FF--D--FSPR---S-------------------GCFEANPPFIPK---------VIKRMA-------DHMTALLNAA---------------D---G--PLAFIVII---PAW-QDT---------E------GWQQLNS---SRYNQ----THLLIPQKQHGYCEGKQQ-----------IR--K---TRWRIA----SFDTSVFFWQNSK-----AC--NKWPVTE--K--KLDSLKSAFKS Capitella_spI_Caps1000025183 GDVVSLR-FK-------SEH--------------------------LIKVNTSHFHKLEQLYR----------------CN----C--RDDPK-----FE-------------------------NFLPRVWCLLRRYHTY-FG-LSAD---EG------S-----GLQGAL---------PVPVFECLHRV-FNVT--FECFASPLNCYF-------KQYCSAFVDTDGYFGSRGP-------LL--D--FSPT---S-------------------GSFEANPPFGEE---------LMEAMV-------DHFESLLSES---------------N---D--PLSFIVFV---PDW-RDP---------P---TE-ALMRLES---SRFKR----KQATVVAYEHEYRQGFQH-----------IV--N-K-ADTNIR--A-SHGTVIIFLQNEA-----GF--NKWGPTR--E--RLNELLLAYNP Ostreococcus_tauri_116058754 GLDARVV-IR--D-----TG------------PFVSFQLNQKK-P-YVKVTKTHLGKLRALYC---RT-----------CR----R--GKPLS-----EDVNSSEYI------------------VFAYCVFALLLRYGESL----------GG------A-----GYQAAL---------GEDAFDVLKER-LGVS--CECFASPLNARY-------ARFCSAFFDVDKYFGSLGN-------FF--GHGFKPR---A-------------------GSFEMNPPFVPE---------TMLAAV-------EKASALLDEA---------------QKRGA--ALSFVVVV---PAW-KEC---------K------FWHFLQS--CLHLQH----CD-IVDAESHGFCDGAQH-----------VRPMH---ERHRVS----SFDTGVCYLQTAA-----AA--AHRPCDR--A--LRDVVTSAMKR Phytophthora_infestans_262109934 GMRQ----LT--Y-----GR------------------------S-TVKLSANHFTKLREMFA---KK-----------QG----L--GGDGS-----NM-APKDQQ------------------QLECALFCLLLRYESL-----------DG------G-----GFQAAL---------NEECFDVLLKE-FDCK--MECFASPLNCRY-------SRFCSAFLDTDFAFGSVGS-------FF--D--FSPR---S-------------------GCFEANPPFIPK---------VIKRMA-------DHMTALLNAA---------------D---G--PLAFIVII---PAW-QET---------E------GWQQLNA---SRFNQ----RHLLIPQKQHGYCEGKQQ-----------IR--K---TRWRIA----SFDTSVFFWQNSK-----AC--NKWPVTE--K--KLEGLKQAFRS Helobdella_robusta_Hrob1000018333 GEVVTLR-LKLAAAGVANND--------------------------VMKINSMHFRKLEQLYM----------------LN----C--RDDPK-----ME-------------------------HFLHRTWCLLKRYNTF-FG-TKEN---EG------F-----GLQGAL---------PVSVFQCLNRS-FGVT--FECFASPLNCYF-------KQFCSAFPDTDGYFGSRGS-------IL--D--FYPI---S-------------------GSFEANPPFNEE---------LMEAMV-------DHFESLLSET---------------P---L--PLSFIIFL---PDW-KDP---------P---TE-ALIKLES---SRYKR----QQMTIPAMEHEYRHGFQH-----------IC--Q-R-KDLNVR--S-LHGTLVIFLQNDA-----GA--NKWSVNN--D--NMRELLYAYQL Phaeodactylum_tricornutum_219110361 SRVFSLV-FH--------RK----SWKKPF----------------RVKINVSHYHKLKTAFL---RV-----------HN----S--DHQLK-PIL-LY-DHGKPT-KAIH-------------SFHLIIMSLLLRYSAL-SG-GQLL---VD-LRG--G-----GMQGAV---------HDEVFEALQTC-FPNESFLECFASPLNCYA-------ANFGSAFTDIDFHFGSVGD-------FL--D--QSIS---H-------------------GVCEANPPFSPG---------LMDTMV-------DRIEYNLTLA---------------DQTSS--CLTFVVII---PTA-STSEDVRTAKRFA---TK-SFQRMLG---SAACR----LHISLAARDHGYIEGAQH-----------LR--P---TRYKES----NFDTSVILLQSSA-----AR--KENIDEN--N--LEKRLRSAFTS Harpegnathos_saltator_307208075 REQTMLR-FH--------GD--------------------------TMCINNIHLTKLEHLYR----------------YN----C--FDDKK-----FE-------------------------MFLPRVWCMLKRYQTY-LG-I--N---EG------Q-----ATQMAL---------PVTVFECLQRS-FGVT--FECFASPLNCYF-------RQYCSAFADTDSYFGSRGP-------FL--D--FRPV---S-------------------GSFQANPPYCEE---------LMEAMV-------NHFERLLADS---------------T---E--PLSFVVFL---PEW-RDP---------A---PN-ALIKLEG---SHFKR----KQVVVPAMEHEYRHGFQH-----------IL--P-K-GEVNIR--A-AHGTLVVWLQNPA-----GA--ARWGPTE--E--RVEALLEAWRP Sphaeroforma_arctica_Sarc1000006366 ------T-AE--T-----SARARSVDCVEV--VITGKKAREAL-L-ERATTDLRAFDLTYNQM---TVRINKAHHDKLRLL----H--SRNAP-----QS-ERGDDS------------------ALNSRVFSLLVRYHTLQGG------HVQG------G-----GMQAAL---------IEDTFDALLRN-FGVN--FECFASPLNSRY-------GQYCSMFADTDGPFGSVGS-------FF--D--FYPL---S-------------------GSFEANPPFEDG---------VIHRMA-------MHIDVLLDRS---------------DRENK--PLSFVVVI---PAW-AES---------S------GWQRLNQ---STHLK----RLLTLSQRDHGFCEGTQH-----------SR--P---TRYRIS----TYVVPIVVGLAMT-----GDFFAYWVISA--V--MVGMVLSLVRL Emiliania_huxleyi_551601616 GEEDGVG-LR--E-----AEVGGGVWCVSLPPALLAPLPQELR-K-PLKISDEWLSKLREMHT---ATVASTAPA----SA----P--ASASA-----AS-AAAAEL------------------RFRSDLARLLLRYKAL-----------GG------S-----GFQAAI---------GGGAFAVLRAS-FGAR--LECFASPLNARS-------APFCSAFPDVDAPFGSLGS-------FL--E--FEPE---A-------------------GAYEANPPFVPL---------VLRAMC-------AHMHRLLDRA---------------EASRR--PLLFVVVVGASSAL-KRH---------A------AWEDLQGLAAGRHGR----AQWLLPLHAHGYTEGHAH-----------IA--K---GGARAARRMSSCDTAVFVWASSA-----GA--EQWPVTD--G--AEAALRAAMKA Emiliania_huxleyi_551569354 GEGGLVE-VH--V-----HG--------PL--LRLSLSSAPGG-T-HVDVSHEHYAKLAALHA---KH---------------------------------APSVAG------------------NLRTRVLCMMLRYQSL-----------GA------H-----GNQCAL---------PPAGFEVLRQR-LGIR--FECFASPLNARY-------DRYCSAFADTDAAFGSIGS-------FF--G--FRPT---H-------------------GSFECNPPFVPEAPLAAVRPPVLLAAV-------KHAEALLSAA---------------EASGG--ALSFAFVV---PSW-ERV---------P------FHHQLCR-SAFLRGG----APLRLAAEAHGFVDGAQH-----------LK--AAAGDRLRVS----SFASTVGVLQTAA-----AA--ERWPVDA--A--LYSRLSAAFAG Saccoglossus_kowalevskii_291221943 GELTCLK-YK--------DQ--------------------------VCKVNSAHFQKLEQLYK----------------LH----C--LDDPR-----FD-------------------------NFLGRAWCLLKRYQTM-FG-LRTN---EG------S-----GLQGAL---------PIPVFQVLNRH-FGVT--FECFASPLNCYF-------KQYCSAFNDVDSYFGSRGP-------VL--D--FYPV---S-------------------GSFEANPPFGEE---------LMEAMV-------DHFESLLDKS---------------T---D--PLSFIVFI---PEW-RDP---------P---TP-ALVRMEA---SRFKR----KQCLIPALEHEYRSGTQH-----------TC--S-S-HELYYR--A-VHGTLAFFLQNDA-----GF--EKWGPTP--D--RVKALLDAFIP Emiliania_huxleyi_485621692 --------------------------------------------R-PRSADESMRADLRRAGM---APAASAAVVQAVRSA----S--ARAAT-----RV-DNFRST------------------AAEGGRLRVRLKREED-----------GA------T-----RLEAAL---------GGAAFSALQRC-LGVN--FECFASPLNCYY-------GAYCSAFPDVDAPFGSRGS-------FR--G--FAPR---R-------------------GSYEVNPPFVDG---------LIARMA-------ERLLSLLAAA---------------HAACE--PLTFVVVL---PGW-LDS---------E------GYRALDG---SSHLR----AKLLVAAADHGFVDGGQH-----------AR--T---RTFRES----PYDTALFFLQSGA-----AA--AV---DE--A--CVESVRTALAR Giardia_lamblia_159115031 SHSLDLK-FK---------------------------------------LSSIHFYKLRELYK----------------RT----SGKRFDPE-----MK-------------------------MFSKLLFILLRRYHTF-FG-TERF---EG------T-----SFHAAA---------PENIFRRLKSF-LEVS--QECFASPLNCFF-------SQFCSAFPEIDVFFGSLGS-------FF--D--YDIA---E-------------------GSFECGPPYTLE---------CMDRTA-------KHIIRTLDKS---------------E---NRRPIMFVVFV---PEW-RVP---------P---AQ-YHLDLEE---SAYTR----FHFCAPGGKHYYVSGEQHEPKCIASKGALTN--E-K-VGRYYL--V-PHGTHVYFVCNDA-----GF--KRYAKGS--EDYLEKAADDILRV Emiliania_huxleyi_485638053 GEEDGVG-LR--E-----AEVGGGVWCVSLPPALLAPLPQELR-K-PLKISDEWLSKLREMHT---ATVASTAPA----SA----P--ASASA-----AS-AAAAEL------------------RFRSDLARLLLRYKAL-----------GG------S-----GFQAAI---------GGGAFAVLRAS-FGAR--LECFASPLNARS-------APFCSAFPDVDAPFGSLGS-------FL--E--FEPE---E-------------------GAYEANPPFVPL---------VLRAMC-------AHMHRLLDRA---------------EASRR--PLLFVVVVGASSAL-KRH---------A------AWEDLQGLAAGRHGR----AQWLLPLHAHGYTEGHAH-----------IA--K---GGARAARRMSSCDTAVFVWASSA-----GA--EQWPVTD--G--AEAALRAAMKA Aureococcus_anophagefferens_323449955 --------------------------------------------T-ARATTVARRDGGARHFF---CGDATAELPEKV-DA----K--LRELA-----RR-AGTKPT------------------DVDACILAMTMRYDAL-----------GG------S-----GFQAAL---------PGAAFRALRDR-FGVN--FECFASPLNAYY-------ERYCSAHADVDAPFGSLGS-------FY--D--FSPR---R-------------------GAFECNPPFAPA---------PLLRAA-------RRCDALLAAA---------------EARRD--ALAFAFVA---PVW-TDQ---------A------AWAAVDG---SRFKR----GAVRVPREDHAWRD---------------AR--T---ARARRV----PVDTAIFILATSA-----AE--AAHPCDA--A--ALGEVRAALLV Ectocarpus_siliculosus_298709108 SAGQTSG-RR--------EA--------------------------SFAITGAHLHKTWSAYC----------------R-----C--VGGDS-PVW-DR-------------------------NFLRRLFCVLSRYETL-SA--------TS------D-----GYQMAF---------PASGFRLLRHL-VSVD--CECFASPLNCTL-------SRFCSVAYDTDKFFGSEGN-------FF--Q--SEYQ---Q-------------------GSFEANPPFVEE---------VMERMV-------DHMHHLLRRA---------------T---G--PMSFAVIV---PGW-DDD---------G---CV-SYQNMKN---SRFARPHPGFYLTLQKGMHNYRPGMQH---------------R-Q-DVEEKP--S-NCNTFLFILQNDW-----GA--DAWPVST--T--SLGQLQAELES Sphaeroforma_arctica_Sarc1000000137 HSTHQAQ-VE--E-----RG------------ALLAYSLRTKKKP-FFSLSRPHAAKLRSLYA---RT-----------R-----G--GKWVD-----KE-G-SDND------------------RFVDAVFCVLARYDAL-----------GG------A-----GYQAAL---------NEASFDVLKDK-MRVD--CECFASPLNCRY-------GQFCSAFPDTDSPFGSLGS-------FF--D--FYPS---K-------------------GSFEMNPPFVPE---------VLCAAA-------EHANALLSLT---------------KEP-----LSFVVVV---PAW-KEV---------R------MWQVLSN--SAYNKH----EPLILTASNHGYCDGQQH-----------QRRPS---ERYRVS----SYDTAVFFLQNDA-----GA--KKWPVSE--A--IRNELVESMHK Danio_rerio_125819445 NDIACLR-FK--------GE--------------------------MVKVSRGHFNKLELLYR----------------YS----C--IDDPR-----FE-------------------------KFLSRVWCLIKRYQVM-FG-SGVN---EG------S-----GLQGSL---------PVPVFEALNKQ-FGVT--FECFASPLNCYF-------KQFCSAFPDIDGFFGSRGP-------FL--S--FSPA---S-------------------GSFEANPPFCEE---------LMDAMV-------THFEDLLGRS---------------S---E--PLSFIIFV---PEW-RDP---------P---TP-ALTRMEA---SRFRR----HQMTVPAFEHEYRSGSQH-----------IC--K-R-EEIYYK--A-IHGTAVIFLQNNA-----GF--AKWEPTT--E--RIQELLAAYKV Ciona_intestinalis_198420771 KRHVCLT-YN--------GE--------------------------LVRLNYLYLEKLEALYR----------------IS----C--KDDPK-----ME-------------------------LFLQRVWCLLRRYQTF-FG-PNQY---EG------I-----MLQGAL---------PSTVFECLYSV-FGVT--MECFASPLNSYY-------KNYSSAFADTDCYFGSSGP-------LM--K--LFPV---S-------------------GSFEVNPPFAEE---------LMEAMV-------DHFEKLLAQS---------------N---E--PLSFIVFV---PEW-RDP---------T---PI-AILRMET---SKFKR----KQVLVPAFEHEYRSGLQH-----------VA--P-L-KEVYHK--A-VHGTMVFFLQNES-----GF--QKWGPTQ--D--RLRKLLTAFRP Camponotus_floridanus_307182697 REQTMLR-FH--------GD--------------------------TMYINNTHLTKLEHLYR----------------YN----C--FDDKK-----FE-------------------------MFLPRVWCMLKRYQTY-FG-I--N---EG------Q-----ATQMAL---------PVTVFECLQRS-FGVT--FECFASPLNCYF-------RQYCSAFADTDSYFGSRGP-------FL--D--FRPV---S-------------------GSFQANPPYCEE---------LMEAMV-------NHFERLLADS---------------A---E--PLSFVVFL---PEW-RDP---------A---PN-ALIKLEG---SHFKR----KQVVVPAMEHEYRHGFQH-----------IL--P-K-GEVNIR--A-VHGTLVVWLQNPA-----GA--ARWGPTE--E--RVEALLEAWRP Naegleria_gruberi_284095070 GTVVD-----------------------------------------SFKLNLVHFNKLRLLYQ----------------KH----N-QEIDPD-----LK-------------------------IFPYRLYALLRRYQTF-FGDSESE---EG------A-----NFHAAL---------PEKGFEFLYKE-FNVC--HECFASPINCYF-------SSFCSAFPDTDVYFGSRGS-------FF--E--FRPT---Q-------------------GFFECGPPYTLE---------VMNKTA-------EYCLQLLKAS---------------D---E--PLSFAVFV---PEW-TDT---------E---YG-RMLHPDS---TPLCT----GHLLAEQGKHEYVIGMQH-----------FK--E-N-EKRYWT--L-PFPTHVYFLQNEK-----GK--EKWPITP--Q--LIERYKKVMEI Ectocarpus_siliculosus_298711525 NTYV----LQ--L-----GK------------------------N-KLRMNSAHYDKMKELFS---RS-----------RV----E--GAARR-----QS-SSTEHPPPAWIG------------DFHDCLFSCLMRYEAL-----------QG------G-----GFQASM---------GGDAFDVLLKR-FGAR--MECFASPFNCRY-------SRYCSAFPDTDGPFGSAGS-------FF--D--FQPT---Q-------------------GAYEANPPFVRD---------VILKMA-------NHMDGLLQAT---------------A---K--ALTFVVII---PCW-EDS---------A------GWKRLRD---SAFLS----KHIKLDQKDHGYCEGKQH-----------LR--R---NRYRLA----SFHTSVFFLQTDV-----AR--RQQSPETLGQ--ACRELERAFAL Aureococcus_anophagefferens_323451821 --------------------------------------------S-ARAVPAATLDDANDEAE---AALAARPGRIPK-RK----R--PQARG-----DG-DDERPP------------------AFLSKVFSMLLRYDDL-----------AG------D---A-GQHGAI---------PAAVFDVLR-R-WGCD--AECCATPFNATL-------GSYCSPFRDTDAPFGSVGS-------FF--A--FEPA---S-------------------GCYEINPPFTLN----------SDVVE-------RHLRTLLDAA---------------ERGGR--PLMFVMVH-A-AAH-ARH---------ARDGATRALPARDG-PCARYLR----RDFLLAAGAHHYREGKFY-----------AR--A-L-PRAYVP----PMPSIVLFLATDA-----GA--RRWPATR--E--LQAGIEDAFAW Micromonas_pusilla_226462261 SIK-----LT--L-----HD------------------------A-EVEINEQQYEKLRWLYESDIQA-----------TQ----F--GGTCA-----KP-SLANTFCESGI-------------TFHSAVFAMLCRYASA-----------HGGMHCMAG-----GHHNAL---------HGDVFDALNIG-LGVH--AECCASPLNCHW-------RLYCSGHPDTDLTFGSLGS-------VF--S--FDPV---D-------------------GYFECNPPFEES---------VLLDCI-------KHIDSLLDVA---------------EVAGK--SLSCVFIV---PHW-PGR---------R------AWETLFR---SVHKS----HTEVIPLREHGFLEVLTG-----------PF--S---ATF-------SYNFHSCFLSKLK-----TF--VSHLIET--H--LTKFLCRGHRK Emiliania_huxleyi_485623173 AVEAPAW-WS--T---------------------------------EGSPGGVEIEAEVAG------------------AR----L--SAQAS-----EA-TPDPRA------------------AFHQRLFALLLRYKTL-----------RG------H-----GFHAAI---------APAVWRVLTSR-LGVG--FEAFASPLNCYL-------PTFGSGFSDVDGAFGSSGS-------FF--R--LKPAQLAS-------------------GSCACNPPFVHA---------ILDAAA-------ARVEELLAAA---------------AAADA--PLSFAFIM---PGW-KET---------R------AHASLSA---SPFLR----RAVLVAAADHGFCDGASH-----------QR--A---DPLRAS----PYDTVVFVLQTER-----GS--RKWPADG--R--FEAELRAAFAA Micromonas_pusilla_226460900 DADAVVT-VD--D-----AG------------PLLALKVNAQK-P-YMQVSKQHMGKLRALYS---RH-----------SL----G--GAPLP-----PE-GSSEHA------------------AFAASVFALLARYDAC-----------GG------A-----GYQAAL---------GEKAFDVLKKR-VGVG--CEAFASPLNARY-------GRFCSAFPDVDGPFGSLGS-------FF--D--FAPT---R-------------------GSFEMNPPFVPE---------VLLAAA-------ERAEKLLRTA---------------EESDS--RLSFVVVV---PAW-RDV---------P------MWTALEK--SAFKRG----DALIVPASAHGYCDGAQQ-----------IRSPS---ERHRVS----SYDTGVFFLQTTA-----GA--RRWPVTE--E--IRAELLEGMKA Thecamonas_trahens_Ttra1000007497 RKDR----FV--P-----TK------------------------SLAVAINVEHVTKLAARYT---GP-----------RP----A--EGDVD-----V-----------------------------ELLFLLLFQYNAL-----------EG------G-----GFQAAL---------PDPVFDLLAAAPFHAD--TEAFASPLNVTL-------PRYHSALPAIDAAFGSSGS-------FF--D--AMPD---A-------------------GVVEANPPFTEP---------FITRML-------AHMHKCLAAA---------------S---G--PLTFVVIV---PAW-RQS---------P------AWLALTT---SPHAS----RTAVLPAAEHAYCEGKQH-----------LR--K---TRFRLA----SNDTSIVFLQNEL-----AA--ASLTITD--A--HVEAIAAAFAA Monosiga_brevicollis_167534686 --ELVIY-FA----------------------------------A-SQPVALHRLRRLDWMVR-G-AG-----------RH----I--PDDVP-PAK-HD------------Q------------WQLAQVARCIFRY--------------AA------------HLHTTA--QHWGH--PQEFYNFLARR-LGLL--REAFACPLNSRVLGYNDPAARFCSLYRDTDAPFGSLGS-------FF--E--TDML-A-S---G---------------YGWVVHPPFTED---------ILNRLS-------AQCQSALQQA---AS--------------Q--DRQLIVGIGW-PNW-TDM---------P------SYHQLRD---SPFKR----SEVLQVKYNYHYERLNGT----L------VK--P-------------NFTNIYLILSAQP---------LNEDQQA-----AVREFETIVAH Monosiga_brevicollis_163777248 DDKVVLR-CP--TVGA--DC--------------------------LVSMKSDHYHRLAKLYR----------------TH----C-EAFDPE-----RQ-------------------------HFHRLALCVGLRYQTI-LF-EPYV-----------------DGENSM---------PAAFKAFLTKQ-LHCC--FDSFASPFNAYY-------RNFASAAPDLDSFFGSVGS-------FF--D--FTPS---Q-------------------GSFVAFPPFVEL---------VLDRTA-------DRIEALLNAS---------------S---D--PLSFVVMM---PEW-RLY---------K---LH-ALDVLDQ---SPHRR----ADFLIEGNKQAILSGQSW-----------YG--D-V-GQQWKL--A-RFGYRVYVLQNEA-----GF--AQWGAGV--A--ALEENKRAFGA Proteobacteria_bacterium_655448079 PAVSMFR-VE--V--L--RS------------RVLGAMR-----S-ACEVSGLKFVNVEELLT----------------KV-IF-S--VRAHN-----LP-------------------------QYDAVLPSIIAGK----------E---DG------D-REV-GGSAVV---------SKVFFNYITGN-VPSE--TKAMEN-LPSTL-------ATICGQAAALVYHHGS----------TL--Q-----------------------------GS--AEPP---------------KNVV-------EVKKRLWPGQ-T--------------------NAHYAALL-------YNG---------R------EVCRLDM---SRYKR------LVVAHNRYDPSSPGQC-----------------I-DRIFTM----AMRYQSLMLSTKC-L---GM--HAALPNN-----VFKLLVDALGC Psychroflexus_gondwanensis_489532476 SSPDTLL-ID------------------------------------DFKLKRNQYIDKAYHFV----------------KS-------KTDDV--------------------------------TATNAILRSALRY----------------------------GSIYAETRHIGP---PQKVYDLFYK--WGIR--NEGFASPFNARL-LGKPK-AQFYSLFEDTDEIFGSGGS-------FF--N--LNHP---E-----NP------------GHWCLDPPFTSE---------LMTKVD-------SILASWLETY-K--------------------DLSFLLII---PE--SHA---------P----------------SNVPD----ESVTLKKDLH-YYEGLEG----V------LK--P-L-----------PVNVCIHRYGNIE-----GF--------------SSDAIEEGYSK Psychroflexus_tropicus_517867866 IDATSFG-IG------------------------------------EFKLHRNQYIDKAFHFV----------------QK-------KSDKT--------------------------------EAFNAILRSALRY----------------------------ASIYAETRHIGP---PQRVYDMFYD--WGIR--NEGFASPFNARV-LGKPQ-AQFYSLFKDTDKIFGSGGS-------FF--N--LEMP---E-----NP------------GHWCLDPPFTTE---------IMTKVD-------HILETWLETY-K--------------------ELSFLLII---PE--SHT---------P----------------ANQPD----ETVTLKKDTH-YYEGLEG----V------LK--P-L-----------PVNVCIHRYGQFE-----GF--------------SAKAILDGYSK Psychroflexus_torquis_504836046 SSPDTLS-ID------------------------------------DFELKRNQYIDKAYHFV----------------NS-------ETDDA--------------------------------TATNAILRSALRY----------------------------GSIYAETRHIGP---PQKVYDLFYK--WGVR--NEGFASPFNARL-LGKPK-AQFYSLFENTDEIFGSGGS-------FF--N--LNSP---E-----NP------------GHWCLDPPFTSE---------LMIKVD-------SILASWLKTY-T--------------------DLSFLLII---PQ--SHT---------P----------------SNKPD----ETITLKKDLH-YYEGLEG----V------LK--P-L-----------PVNVCIHRYGNIE-----GF--------------SSDAILEGYSK Listeria_monocytogenes_733112108 --------MK------------------------------------TIANEYETLEAIKKAMA-M--------------YE--L-K--KADKD--------------------------------HVATPRYVVEDIYSLI-----------DI------E-----SFKSLWF--------PFNHYDSLFK--LRAE--ELNLKY---------------------KATHIFDNVGN-----D-FF--T--TEPP-I-D---------C---------DLMISNPPFSQQ-----------NEII-------ERSFQLIDEK-------------------K--IKSFALLL---P---LST--L------E------TEKRANI-F-AQYSDK---LAILIFKKRIKFLGHSTS-----------FN--R---GCCWVC-------YNISALEDKR---------IQWV------------------- consensus/100% .........................................................................................................................................................................................b..h..........................................ass..................................................PP................................h.............................hh........................................................................................................................................... consensus/95% ..............................................................................................................................h....h....bh................................s..............h..h....h......c.hus.hss...........a.u...p.s..FGS.Gs.......hh..................................h.h.PPa...........................h..hL..........................b.hhhhh...s.............................................h......a........................................h..h.................................... consensus/90% ..................................................p...h.p.....................................................................h...hh.hh.cY............................s.p.uh.........s...a.hL....h.hp...EshAoPhNs...........a.S.h.-.D..FGS.Gs.......hh................................G.h.hsPPa.............h..hs.........h..hL..........................h.ahhhh...P....p...........................u...p........l....a.a..s..................................s..h.hh.s.......s...................h...h.. consensus/85% ........h........................................hs...h.cb....................................................................h...hh.hh.RY...............s............u.p.uh.........s...aphL....h.hp...EsFASPhNs.h.........ahShh.D.D..FGS.Gs.......hh..p.........p...................G.aphsPPa............hh..hs........ph..hL..s......................sl.Fhlhh...P....p.....................h.....u...p......h.l....H.a.ps.p................................ss.hhhh.s.......u...................h..sh.. consensus/80% ........hp.......................................hs..ph.cl................................s...................................h...lh.hl.RY..h............u............u.psAh.........s..sFphL.p..hshp...EsFASPhNsbh........pahSha.DsD..FGS.Gs.......hh..p..h.P....p...................GsaphNPPFs...........lh..hs.......p+h..lL..u......................sLsFllhl...P.a..c..................h..hp....u...p......h.ls...H.a.pG.pp.............................s.ss.lhhl.s.......u........p..........l..uh.. consensus/75% ........hp.....................................hphs..ph.cl..hh............................s...................................h...lhshl.RYpsh............u............u.psAl.........s..sFchL.p..hshp..hEsFASPlNsbh........paCSua.DsD..FGS.Gs.......ah..s..h.P....p...................GsaphNPPFs...........lhp.hs.......p+hppLL..u......................sLoFllhl...P.W.pcs.................h..hp....o.b.p......h.ls...H.a.pG.pp...........hp................s.so.lhhLps.......u.....bs.s........p.l..uh.. consensus/70% ........hp.....................................hpls..ch.+Lb.ha............................s...................................a...lashLbRYpsh............G............GbpsAl.........s..sFchL.p..hshp..hEsFASPLNsbh........paCSAa.DsD..FGS.Gs.......Fh..s..h.P....p...................GsaphNPPFs...........lhp.hs.......p+hppLLp.u...............p......sLoFllhl...P.W.pcs................sh..lp....S.a.c.....ph.ls..pH.Y.pG.pa...........hp.......p........s.sT.lhhLpsps.....u....pas.s........p.lb.uh..Back to Contents
GI Domain-architecture Pfam Gene name Len Taxonomy Species Genbank 116059373 N6-MTase PCIF1_WW Ot08g03420 422 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product, partial [Ostreococcus tauri]. 116058754 N6-MTase PCIF1_WW Ot07g01880 729 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 116057704 N6-MTase PCIF1_WW Ot05g01520 1115 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 226514838 N6-MTase PCIF1_WW MICPUN_70757 203 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein, partial [Micromonas sp. RCC299]. 226523283 N6-MTase PCIF1_WW MICPUN_84425 98 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein, partial [Micromonas sp. RCC299]. 226522980 N6-MTase PCIF1_WW MICPUN_55447 432 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 226522164 N6-MTase PCIF1_WW MICPUN_64863 669 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 226463465 N6-MTase PCIF1_WW MICPUCDRAFT_12799 101 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 226462261 N6-MTase PCIF1_WW MICPUCDRAFT_70191 490 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 hypothetical protein MICPUCDRAFT_70191 [Micromonas pusilla CCMP1545]. 303290244 N6-MTase PCIF1_WW MICPUCDRAFT_43506 385 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 226460900 N6-MTase PCIF1_WW MICPUCDRAFT_57528 547 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 303290246 N6-MTase - MICPUCDRAFT_54412 180 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 226459779 N6-MTase PCIF1_WW MICPUCDRAFT_58337 686 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 220970802 N6-MTase PCIF1_WW THAPSDRAFT_24679 722 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 220978161 N6-MTase PCIF1_WW THAPSDRAFT_20873 816 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 220970802 N6-MTase PCIF1_WW THAPSDRAFT_24679 722 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. Psoj1000010133 N6-MTase PCIF1_WW Psoj1000010133 468 eukaryota>stramenopiles Phytophthora sojae 137293 Pram1000005973 N6-MTase SMC_N+AAA_21+APG6+SMC_hinge+PCIF1_WW Pram1000005973 1665 eukaryota>stramenopiles Phytophthora ramorum 77822 262109934 N6-MTase PCIF1_WW PITG_18050 344 eukaryota>stramenopiles Phytophthora infestans T30-4 conserved hypothetical protein [Phytophthora infestans T30-4]. 568046702 N6-MTase PCIF1_WW L914_10859 461 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein L914_10859 [Phytophthora parasitica]. 219110361 N6-MTase PCIF1_WW PHATRDRAFT_43163 549 eukaryota>stramenopiles Phaeodactylum tricornutum CCAP 1055/1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]. Fcyl1000123364 N6-MTase PCIF1_WW Fcyl1000123364 276 eukaryota>stramenopiles Fragilariopsis cylindrus estExt_Genewise1.C_350073 Fcyl1000111862 N6-MTase PCIF1_WW Fcyl1000111862 300 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.6.1306.1 Fcyl1000123395 N6-MTase PCIF1_WW Fcyl1000123395 271 eukaryota>stramenopiles Fragilariopsis cylindrus estExt_Genewise1.C_350074 Fcyl1000020973 N6-MTase PCIF1_WW Fcyl1000020973 514 eukaryota>stramenopiles Fragilariopsis cylindrus e_gw1.6.1362.1 Fcyl1000034107 N6-MTase PCIF1_WW Fcyl1000034107 514 eukaryota>stramenopiles Fragilariopsis cylindrus e_gw1.39.270.1 Fcyl1000046637 N6-MTase PCIF1_WW Fcyl1000046637 871 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_kg.27_#_20_#_0_0_CCUX4586.b1_CCUX_EXTA Fcyl1000123964 N6-MTase PCIF1_WW Fcyl1000123964 300 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.39.264.1 Fcyl1000047105 N6-MTase PCIF1_WW Fcyl1000047105 846 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_kg.35_#_10_#_0_0_CCUX4586.b1_CCUX_EXTA Fcyl1000016119 N6-MTase PCIF1_WW Fcyl1000016119 354 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.2_#_93 Fcyl1000014177 N6-MTase PCIF1_WW Fcyl1000014177 577 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.1.2000.1 Fcyl1000088053 N6-MTase PCIF1_WW Fcyl1000088053 190 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.27.207.1 Fcyl1000099591 N6-MTase PCIF1_WW Fcyl1000099591 185 eukaryota>stramenopiles Fragilariopsis cylindrus estExt_Genewise1.C_270101 Fcyl1000049607 N6-MTase PCIF1_WW Fcyl1000049607 618 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.1.174.1 Fcyl1000104830 N6-MTase PCIF1_WW Fcyl1000104830 269 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.1.1865.1 Fcyl1000014986 N6-MTase PCIF1_WW Fcyl1000014986 593 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.1_#_1050 298711525 N6-MTase PCIF1_WW Esi_0035_0124 771 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. 298709108 N6-MTase PCIF1_WW Esi_0231_0005 625 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. 323451821 N6-MTase PCIF1_WW AURANDRAFT_71790 749 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_71790 [Aureococcus anophagefferens]. 323451439 SPOUT+C2+Tox-ABHYDROLASE3+N6-MTase Aa_trans+SpoU_methylase+C2+Lipase_3+PCIF1_WW AURANDRAFT_71849 3487 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_71849 [Aureococcus anophagefferens]. 323449955 N6-MTase PCIF1_WW AURANDRAFT_66058 306 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_66058 [Aureococcus anophagefferens]. Bnat1000017417 N6-MTase PCIF1_WW Bnat1000017417 214 eukaryota>rhizaria Bigelowiella natans gw1.33.39.1 Lgig1000003858 WW+N6-MTase WW+PCIF1_WW Lgig1000003858 637 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.21.357.1 24667685 WW+N6-MTase WW+PCIF1_WW Dmel_CG11399 920 eukaryota>metazoa>hexapoda Drosophila melanogaster CG11399 [Drosophila melanogaster]. 156553336 WW+N6-MTase WW+PCIF1_WW LOC100116693 763 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: phosphorylated CTD-interacting factor 1 [Nasonia vitripennis]. 307208075 WW+N6-MTase WW+PCIF1_WW EAI_15124 733 eukaryota>metazoa>hexapoda Harpegnathos saltator Phosphorylated CTD-interacting factor 1 [Harpegnathos saltator]. 91095163 WW+N6-MTase WW+PCIF1_WW LOC656483 664 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: phosphorylated CTD-interacting factor 1 [Tribolium castaneum]. 66535528 WW+N6-MTase WW+PCIF1_WW LOC551754 729 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: similar to CG11399-PB [Apis mellifera]. 158300743 WW+N6-MTase WW+PCIF1_WW AgaP_AGAP011933 789 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP011933-PA [Anopheles gambiae str. PEST]. 193654861 WW+N6-MTase WW+PCIF1_WW LOC100159733 693 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: phosphorylated CTD-interacting factor 1-like [Acyrthosiphon pisum]. 307182697 WW+N6-MTase PCIF1_WW EAG_11439 735 eukaryota>metazoa>hexapoda Camponotus floridanus Phosphorylated CTD-interacting factor 1 [Camponotus floridanus]. 291221943 WW+N6-MTase WW+PCIF1_WW PCIF1 713 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: phosphorylated CTD-interacting factor 1 [Saccoglossus kowalevskii]. 115957714 N6-MTase PCIF1_WW LOC581054 1094 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to Chromosome 20 open reading frame 67 [Strongylocentrotus purpuratus]. 321469809 WW+N6-MTase WW+PCIF1_WW DAPPUDRAFT_51102 640 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_51102 [Daphnia pulex]. 321459458 N6-MTase PCIF1_WW DAPPUDRAFT_61197 232 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_61197, partial [Daphnia pulex]. 47212430 N6-MTase PCIF1_WW GSTEN:00009293:G:001 673 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product [Tetraodon nigroviridis]. 125819445 WW+N6-MTase WW+PCIF1_WW LOC553360 716 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: phosphorylated CTD-interacting factor 1 [Danio rerio]. 114682331 WW+N6-MTase WW+PCIF1_WW LOC458292 658 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: phosphorylated CTD interacting factor 1 isoform 3 [Pan troglodytes]. 224078117 WW+N6-MTase WW+PCIF1_WW LOC100228688 617 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: hypothetical protein, partial [Taeniopygia guttata]. 114682333 WW+N6-MTase WW+PCIF1_WW LOC458292 685 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: phosphorylated CTD interacting factor 1 isoform 4 [Pan troglodytes]. 62646369 WW+N6-MTase WW+PCIF1_WW RGD1310800_predicted 704 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to posphorylated CTD interacting factor PCIF1 [Rattus norvegicus]. 326936323 WW+N6-MTase WW+PCIF1_WW PCIF1 829 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: phosphorylated CTD-interacting factor 1-like [Meleagris gallopavo]. 22122647 WW+N6-MTase WW+PCIF1_WW Pcif1 706 eukaryota>metazoa>chordata>vertebrata Mus musculus phosphorylated CTD-interacting factor 1 [Mus musculus]. 18034767 WW+N6-MTase WW+PCIF1_WW PCIF1 704 eukaryota>metazoa>chordata>vertebrata Homo sapiens phosphorylated CTD-interacting factor 1 [Homo sapiens]. 114682325 WW+N6-MTase WW+PCIF1_WW LOC458292 704 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: phosphorylated CTD interacting factor 1 isoform 1 [Pan troglodytes]. 118100805 WW+N6-MTase WW+PCIF1_WW LOC771963 707 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: similar to Chromosome 20 open reading frame 67 [Gallus gallus]. 327288387 WW++N6-MTase+RING WW+PCIF1_WW+zf-RING_2 pcif1 860 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: phosphorylated CTD-interacting factor 1-like [Anolis carolinensis]. 210126797 WW++N6-MTase WW+PCIF1_WW BRAFLDRAFT_118827 781 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_118827 [Branchiostoma floridae]. 210099455 N6-MTase PCIF1_WW BRAFLDRAFT_267292 730 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_267292, partial [Branchiostoma floridae]. 198420771 N6-MTase PCIF1_WW LOC100187259 515 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: phosphorylated CTD-interacting factor 1-like [Ciona intestinalis]. Smar1000005290 WW+N6-MTase WW+PCIF1_WW Smar1000005290 715 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR008745-PA pep:novel scaffold:Smar1:JH431850:1046606:1050059:-1 gene:SMAR008745 transcript:SMAR008745-RA Hrob1000018333 N6-MTase PCIF1_WW Hrob1000018333 499 eukaryota>metazoa>annelida Helobdella robusta 102536 Caps1000025183 WW+N6-MTase WW+PCIF1_WW Caps1000025183 650 eukaryota>metazoa>annelida Capitella spI estExt_fgenesh1_pg.C_4020007 Sarc1000002473 N6-MTase PCIF1_WW Sarc1000002473 686 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (686 aa) Sarc1000000137 N6-MTase PCIF1_WW Sarc1000000137 556 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (556 aa) Sarc1000006366 N6-MTase PCIF1_WW Sarc1000006366 1130 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (1129 aa) 284097202 N6-MTase PCIF1_WW NAEGRDRAFT_56544 724 eukaryota>heterolobosea Naegleria gruberi phosphorylated carboxy-terminal domain interacting factor [Naegleria gruberi]. 284095070 N6-MTase PCIF1_WW NAEGRDRAFT_78364 756 eukaryota>heterolobosea Naegleria gruberi phosphorylated CTD interacting factor [Naegleria gruberi]. 485621692 N6-MTase PCIF1_WW EMIHUDRAFT_196229 460 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_196229 [Emiliania huxleyi CCMP1516]. 551547167 N6-MTase PCIF1_WW EMIHUDRAFT_422415 480 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_422415 [Emiliania huxleyi CCMP1516]. 485623173 N6-MTase PCIF1_WW EMIHUDRAFT_242884 541 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_242884 [Emiliania huxleyi CCMP1516]. 551569354 N6-MTase Stc1+PCIF1_WW EMIHUDRAFT_461366 920 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_461366 [Emiliania huxleyi CCMP1516]. 485634611 N6-MTase - EMIHUDRAFT_434687 237 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_434687, partial [Emiliania huxleyi CCMP1516]. 551536260 N6-MTase PCIF1_WW EMIHUDRAFT_221253 311 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_221253 [Emiliania huxleyi CCMP1516]. 485623605 N6-MTase Stc1+PCIF1_WW EMIHUDRAFT_117856 920 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_117856 [Emiliania huxleyi CCMP1516]. 485612123 N6-MTase PCIF1_WW EMIHUDRAFT_437886 208 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_437886 [Emiliania huxleyi CCMP1516]. 485638541 N6-MTase Dam EMIHUDRAFT_111979 250 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_111979 [Emiliania huxleyi CCMP1516]. 485641476 N6-MTase PCIF1_WW EMIHUDRAFT_440930 483 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_440930 [Emiliania huxleyi CCMP1516]. 551536260 N6-MTase PCIF1_WW EMIHUDRAFT_221253 311 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_221253 [Emiliania huxleyi CCMP1516]. 485638053 N6-MTase PCIF1_WW EMIHUDRAFT_112355 538 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_112355 [Emiliania huxleyi CCMP1516]. 551601616 N6-MTase PCIF1_WW EMIHUDRAFT_99458 538 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_99458 [Emiliania huxleyi CCMP1516]. 485617795 N6-MTase PCIF1_WW EMIHUDRAFT_197672 379 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_197672 [Emiliania huxleyi CCMP1516]. 528270883 N6-MTase PCIF1_WW AGDE_02289 379 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_02289 [Angomonas deanei]. 528228842 N6-MTase PCIF1_WW AGDE_11734 502 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_11734 [Angomonas deanei]. 528244288 N6-MTase PCIF1_WW AGDE_08890 400 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_08890 [Angomonas deanei]. 528274252 N6-MTase PCIF1_WW AGDE_01076 528 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_01076 [Angomonas deanei]. 528260169 N6-MTase PCIF1_WW AGDE_05813 455 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_05813 [Angomonas deanei]. 159115031 N6-MTase PCIF1_WW GL50803_24111 599 eukaryota Giardia lamblia ATCC 50803 Phosphorylated CTD interacting factor PCIF1 [Giardia lamblia ATCC 50803]. 146083518 N6-MTase PCIF1_WW LINJ_17_1050 1068 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 146088136 N6-MTase PCIF1_WW LINJ_24_2080 589 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 146104898 N6-MTase PCIF1_WW LinJ36.6020 657 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 hypothetical protein [Leishmania infantum]. 157867596 N6-MTase PCIF1_WW LMJF_17_0940 1068 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 157870335 N6-MTase PCIF1_WW LMJF_24_2000 592 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 157877658 N6-MTase PCIF1_WW LMJF_36_5530 656 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 163777248 N6-MTase PCIF1_WW MONBRDRAFT_6793 638 eukaryota>choanoflagellida Monosiga brevicollis MX1 predicted protein [Monosiga brevicollis MX1]. 167534686 N6-MTase PCIF1_WW MONBRDRAFT_28562 509 eukaryota>choanoflagellida Monosiga brevicollis MX1 hypothetical protein [Monosiga brevicollis MX1]. 239900371 N6-MTase PCIF1_WW Pmar_PMAR019893 212 eukaryota>alveolata Perkinsus marinus ATCC 50983 conserved hypothetical protein, partial [Perkinsus marinus ATCC 50983]. 594149527 N6-MTase PCIF1_WW GSHART1_T00000036001 565 eukaryota>euglenozoa>kinetoplastida Phytomonas sp. isolate Hart1 unnamed protein product [Phytomonas sp. isolate Hart1]. 124512340 N6-MTase PCIF1_WW MAL8P1.49 1466 eukaryota>alveolata>apicomplexa Plasmodium falciparum 3D7 hypothetical protein [Plasmodium falciparum 3D7]. 514681002 N6-MTase PCIF1_WW PTSG_10553 877 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_10553 [Salpingoeca rosetta]. 326429537 N6-MTase PCIF1_WW PTSG_06762 352 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_06762 [Salpingoeca rosetta]. 558598873 N6-MTase PCIF1_WW SS50377_14344 517 eukaryota Spironucleus salmonicida Phosphorylated CTD interacting factor PCIF1 [Spironucleus salmonicida]. 528231724 N6-MTase PCIF1_WW STCU_06168 583 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis hypothetical protein STCU_06168 [Strigomonas culicis]. 89295554 N6-MTase PCIF1_WW TTHERM_00426090 413 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 phosphorylated CTD-interacting factor 1 (macronuclear) [Tetrahymena thermophila SB210]. Ttra1000007497 N6-MTase PCIF1_WW Ttra1000007497 512 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 hypothetical protein (512 aa) 221486586 N6-MTase PCIF1_WW TGGT1_082560 722 eukaryota>alveolata>apicomplexa Toxoplasma gondii GT1 conserved hypothetical protein [Toxoplasma gondii GT1]. 72389220 N6-MTase PCIF1_WW Tb927.5.2310 711 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 hypothetical protein [Trypanosoma brucei brucei TREU927]. 72393303 N6-MTase PCIF1_WW Tb927.8.6220 572 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 hypothetical protein [Trypanosoma brucei brucei TREU927]. 71661834 N6-MTase PCIF1_WW Tc00.1047053511903.170 524 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71403247 N6-MTase PCIF1_WW Tc00.1047053511065.36 524 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71651022 N6-MTase PCIF1_WW Tc00.1047053509799.80 532 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71651450 N6-MTase PCIF1_WW Tc00.1047053511755.90 694 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71650340 N6-MTase PCIF1_WW Tc00.1047053504137.130 532 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71412328 N6-MTase PCIF1_WW Tc00.1047053511313.20 694 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener].Back to Contents
Back to Contents
Str-1 Str-2 Str-4 Str-5 RES HGRSP-SLD-VFADAHTTKVPG------SFFAA-NWCPG----------VKGVDAFAQ----DWGSPGWGRRDGGETVRPLLYINPPF--SA------VARVLRKVAEER--------------------PDCV-LILPVWP-RAWVAILRT-----LPIRAQMTLAHR--ELF------IPGPQVPNAAKRGPMTPRYR-VQAVY ALIGN -------HH-HHHH-------E------EEEE----------------------EEE-------------------HH-------------H------HHHHHHHHH-------------------------EE-EE---------HHHHHH-----HHHH--------------------------------------------- HMM ------HHE-EEHHHHHHHHH-------EEEEE-----------------EEEEEEEE-----------HHHHHHHH---EEEE---H--HH------HHHHHHHHHH------------------------EE-EEE--------HHHHHH-----H--HH-----------E-------------------------------- FREQ -----------E----------------EEE-------------------HHHHHHHH------------HHHHHHHH--HH---------H------HHHHHHHHHH------------------------EE-EEE---------HHHHH-----HHH-------------------------------------HHH-HHHH- PSSM --------------------------------------------------------------------------------E----------H------HHHHHHHHHH-----------------------EEE-EEE--------HHHHHH-----HH--------------------------------------------E-- FINAL ----------------------------EEEE------------------HHHHEEEE------------HHHHHHHH--EEE-------HH------HHHHHHHHHH------------------------EE-EEE--------HHHHHH-----HHH--------------------------------------HH-HHEE- Vcar1000004728_Vcar_Vcar1000004728 ---L--S-KLQDPD-------------DW-----MLKPSIFRRLHDTWG---P-FDIDLFASHA-------TFQLP------VYYS---------RYF-----TRTTSGV-------DAFRS---------SW-----G--RRCW--------ANP----PFH----LLLRVLQHAE-----AC--QS-R---L-----C--LV------APF-------WPTRDWW-PFITA-D-GV-WFKPFASGVRLLGRAAD-V-FLARTSGSPSPKAL--------PA-DLD--ARAEA-LL----------------- AAA33195.1_Dictyostelium_discoideum_167739 ---L--S-RLSEMNHKSSTR--VIKSYNW-----QLKKEVFNRIQLQFG---Q-IQMDLFASHL-------NHQTT------NY---------------------STIRM-------NTLHL---------DW-----SQWKQCL--------AFP----PPI----LLPSILEKMN-----SS--SS-KKVSI-----I--LI------FPI-------WRSATWY-PMIQA-Q-VP-RHHRHMFP-QVLGTFQE-V-L-----TKQSVESI--------PI-QIQ--QRWKL-GI----------------- AAA70202.1_Dictyostelium_discoideum_903714 ---L--S-RLSEMNHKSSTR--VIKSYNW-----QLKKEVFNRIQLQFG---Q-IQMDLFASHL-------NHQTN------NY---------------------STIRM-------NALHL---------DW-----SQWKQCL--------AFP----PPI----LLPSILEKMN-----SS--SS-KKVSI-----I--LI------FPI-------WRSATWY-PMIQA-Q-VP-RHHRHMFP-QVLGTFQE-V-L-----TKQSVESI--------PI-QIQ--QRWKL-GI----------------- AAL35360.1_Tetraodon_nigroviridis_17066696 ---L--S-R----------G--NPLYGEW-----RLHPQVVAQIWQRYG---K-AAVDLFASQE-------NAHCP------LFFS---------LAE-----GSAPLGV-------DALAH---------PW-----PD-VLLY--------AFP----PLS----LISPTLARVR-----EQ--GL-S---L-----I--LV------APR-------WPSKHWI-AEIVQ-L-LM-AEP------WPLPCRRD-L-L------SQARGEI--------FH-PRP--DRLSL-WA----------------- LOC100333442_Danio_rerio_292630533 ---L--S-R----------Q--RLEPGGW-----RLHPKVVAAIWQRFS---K-ADINLFACQK-------TTHCP------LWFS---------LTH------PAPLGL-------DAMVQ---------KW-----PR-LRLY--------AFP----PIA----LLPGILERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-LD-GLP------WEIPIQRD-L-L------SQAGGMI--------VH-PRP--DLWKL-WV----------------- LOC100334277_Danio_rerio_292615760 ---L--S-R----------Q--GVRSGEW-----KLHPEVVETIWERFG---K-AQVDLFASQE-------TTHCV------LWFS---------LSH------PAPLGL-------DAMVQ---------TW-----PR-LRLY--------AFP----PVA----LLPGVLERIR-----QD--GV-Q---L-----L--LV------APF-------WPTRIWF-SDLIA-L-LA-GLP------WEIPIRRD-L-L------SQAGGMI--------LH-PRP--DLWKL-WV----------------- LOC558928_Danio_rerio_189516844 ---L--S-R----------Q--GLEPGGW-----RLHPKVLAAIWQRFG---R-ADVDLFACQK-------TTHCP------LWFS---------QTH------PAPLGL-------DAMVQ---------TW-----PR-LRLY--------AFP----PIA----LLPGVLERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-LD-GLP------WEIPVQRD-L-L------SQAEGMI--------VY-PRP--DLWKL-WV----------------- LOC100331476_Danio_rerio_292613204 ---L--S-R----------Q--GLEPGGW-----RLHPKVVAAIWQRFG---R-ADVDLFACQK-------TTHCP------LWFS---------QTH------PAPLGL-------DAMVQ---------TR-----PR-LCLY--------VFP----PIA----LLRGVLERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-LD-GLP------WKNTVQRD-L-L------SQAEGMI--------VH-PRP--DLWKL-WV----------------- LOC100329396_Danio_rerio_292611038 ---L--S-R----------Q--LLRPGEW-----RLHPKSVQLIWARFG---E-AQIDLFASPE-------NAHCQ------LFFS---------LTE-------GSLGT-------DALAH---------SW-----PRGMRKY--------AFP----PVS----LLAQFLCKVR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WRIPLRED-L-L------SQGQGTI--------WH-PRP--DLWNL-HS----------------- LOC561204_Danio_rerio_292626487 ---L--S-R----------Q--LLRPGEW-----RLHPESVQLIWARFG---E-AQIDLFASPE-------NAHCQ------LFFS---------LTE-------GSLGT-------DALAH---------SW-----PRGMRKY--------AFP----PVS----LLTQFLCKVR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WRIPLRED-L-L------SQGQGTI--------WH-PRP--DLWNL-HV----------------- LOC100005823_Danio_rerio_125838616 ---L--S-R----------Q--GLEPGGW-----RLHPKVVAAIWQRFG---R-ADVDLFACQK-------TTHCP------LWFS---------QTH------PAPLGL-------DAMVQ---------TW-----PR-LRLY--------VFP----PIA----LLPGVLERVR-----QG--GY-N---L-----L--LV------APY-------WPTRVWF-SDLVS-L-------------------RD-L-L------SQAEGMI--------VH-PRP--DLWKL-WV----------------- LOC100369420_Saccoglossus_kowalevskii_291232955 ---L--S-R-------------IVDTDDW-----SLNPKIFGMLDKIWG---P-HSIDRFASCH-------NAQLP------RFNS---------AAS-----NPGTEAV-------DAFCQ---------DW-----ST-ENNWIQNWKAELSNP----ALDDFCDTLPDVMLASR-----AG--NT-L---S-----K--YT------GSW-------LRWKRWC-QSNLS-A-GA-ACP--AKP-LHIAIYLR-S-L------LDNANTV--------AP-MDS--ALYSIRWA----------------- CBG11637_Caenorhabditis_briggsae_268535232 ---A--S-R-------------NFDFDDW-----GVADRVFRQAQKLWG---E-IKVDWFADAN-------NRKTE------VYFS---------RYP-----EFGTSGV-------NVFDHIERAERMGCAW--------------------WVP----PPM----LIPHLLKIEG-----AK--R------------Q--FV------EELKQKAGEGFEHHIER-LKRIP-F-EA-K--------ATSTAAAY-K-A------ENDKRTLEGRRSFFNYE-PGSDAVILMC-------------------- TcasGA2_TC015886_Tribolium_castaneum_270017202 ---E--S-R----------R--LSPETEF-----ELAPYAFRKICTFFQ---I-PEVDLFASRN-------NTKCR------RFFS---------WFR-----DPEAEVV-------DAFTV---------PW-----TD-LKFY--------AFP----PFS----LVAHCLQKIV-----SD--RA-R---G-----I--LV------VPY-------WPTQPWF-PIFTS-L-LR-KEP------IFFEPDTN-L-L------LTSHRVP---HP---LH---------------QKLSLVAGLLSK---- TcasGA2_TC010690_Tribolium_castaneum_270016221 ---K--S-R----------R--LKSETEW-----QLSDFAYSDIIKRFG---Y-PEIDLFANRH-------NAKCE------KFVA---------WLR-----DPGALAI-------DAFTV---------SW-----ED-YYFY--------AFP----PFS----VVLRTLRKII-----SD--RP-C---G-----I--LV------VPN-------WPIQPWF-PLFIS-L-LT-NKP------FYCKPSKY-L-L------TSPDRRR---HP---IW---------------NQLSLVVGRLSG---- VOLCADRAFT_118754_Volvox_carteri_f_nagariensis_302846025 ---L--A-N-----------------SKW-----HETLEPLVCTYPTLF---H-GLLFKLQDMR-------DEDVR------GFVR---------CLT------PSTLQT-------GGYRG---------------------------RPQ-SVN----PLM--A-MSQRGMTKAH-----AV--DR-V---K-----L--LI--Y---QQL----VE-GLKKVSY-RIIKT-F-FE-ESD---QR-LLNDMAQF-P-I---F--AEFVGLV---AN---MD-TGK--------AI-GESD------------ LOC100493936_Xenopus_(Silurana)_tropicalis_301612402 ---L--S-R----------Q--RLDPGEW-----ALNPGIFQDIVALWG---L-PEVDLMASRQ-------NRKVT------QFMS---------RCR-----DPLALAA-------DALTT---------TW-----DF-DLAY--------AFP----PLP----LLPRVIRKIR-----SE--RC-T---V-----I--LI------APH-------WPKRAWF-TELVA-L-SR-SEP------WPLPQIPD-L-L------AQGPILH--------PN-PAF--LNLTA-WR----------------- LOC100331913_Danio_rerio_292614277 ---------------------------QW-----RLHPESVQLIWARFR---E-AQIDLFASPE-------NAHCQ------LFYS---------LTE-------GSLGT-------DALAH---------SW-----PRGVRKY--------AFP----PVS----LIAQLMCKFR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WQIPLRED-L-L------SQGQGTV--------WH-PHP-----TT-KL----------------- LOC561204_Danio_rerio_125850303 ---L--S-R----------Q--LLRPGEW-----RLHPESVQLIWARFG---E-AQIDLFASPE-------NAHCQ------LFFS---------LTE-------GSLGT-------DALAH---------SW-----PRGMRKY--------AFP----PVS----LLTQFLCKVR-----ED--EE-Q---V-----L--LV------APL-------WPNRTWI-SELSL-L-AT-ALP------WRIPLRED-L-L------SQGQGTI--------WH-PRP--DLWNL-HL----------------- IscW_ISCW000383_Ixodes_scapularis_240992967 ---A--S-R----------R--VMDASSW-----KLCPFTFSRVNFLWG---P-VHMDLFADFS-------NHQVR------HYYS---------WKP-----DPQAAAV-------DALSH---------DG-----TG-QGLY--------AFP----PFS----LVSRCIAKLQ-----TS--NS-L---L-----I--LV------APV-------RPSQPWY-ASLLY-H-SY-EEP------RLLPQSHD-L-L------RSHDGQV---HP---ML------------ST-GTLIL----------- LOC100487094_Xenopus_(Silurana)_tropicalis_301609594 ---S--Q-S----------T--RFP--EW-----ELHSDAFQDLTRRWG---T-PQIDLMASRS-------NQNVL------KFFT---------HCR-----DPLTTGI-------VAMTQ---------HW-----RF-DLVY--------VFP----PLP----MLPQVLKKIR-----QS--PT-T---V-----I--VI------APY-------WPRRTWF-SDLQE-L-CD-TQK------GP---------S------SSGPNRP----------------------------------------- LOC100378332_Saccoglossus_kowalevskii_291236647 ---L--S-R------R---L--CLQNTEW-----QILQWVTSRLFWLWD---G-PKIDLFAALR-------NAKLP------TYVT---------LLP-----TPGAWAV-------DALSI---------PW-----SH-MDST-GKSYAV-RLP----GGR----LQHSVLLMLK-----RF--HV-D----------------Y---RSK-------YDVLS---EKLSN-L-LK-TKP-DQMD-VAVNVKCE-M-G------LEYEASA--------FK------KLLKY-AF-GLNFVKMVNVTKKV-- ACB38666.1_Daphnia_pulex_170819724 ---E--S-R----------A--GPDSGDW-----KLDPMVFERIQQLW----P-SDVDVFASPW-------NAHLP------AFIS---------WFP-----QPGAMAT-------NAFSV---------NW-----KG-LSGY--------IFP----PFA----LIFKCIEKIR-----RE--RA-T---A-----V--FV------CPV-------WTGQPWF-PLLLE-L-VC-DVP------RLLTSSPV-L-L------TSALGES---HP---LI------------SS-NALHLAA--------W ACB38665.1_Daphnia_pulex_170819710 ---E--S-R----------A--EADTSDW-----RLDATIFSRISEIW----E-MDVDLFASSW-------NSQLP------RFIA---------WGP-----QPGAFAA-------NAFSI---------RW-----EN-IYGY--------AFP----PFS----LIFRCIEKIR-----RE--KA-S---I-----I--LI------CPV-------WTGQPWF-PVLLE-H-AC-DIP------RLLRPSPE-L-L------TSARGEP---HP---LI------------QS-GALSLAA--------W NEMVEDRAFT_v1g220156_Nematostella_vectensis_156352960 ---P--S-R-------------VLSDLDC-----TLSAQTWRSIDIAFG---P-HSIDLMALPS-------NVMHDHAGRPLRFFS---------QLP-----CVQAEST-------NVFAH---S-----LL-----PE-ENAY--------VFP----PFI----LMGPLLGHLS-----KR--AC-P-F-S-----I--VV------PDI-------TPRKYWW-SVLKR-R-AA-A------S-FKLGSRGS-L-S-SLL--FPAKSGA---AP---WL-NQE--RLRGS-RA----------------- ORF2_Panagrellus_redivivus_10058 ---A--S-R-------------ETDPDDW-----AISKEIFEKLTAKFQ---K-CQCDRFASHK-------TKQLD------KFMS---------RVP-----CPGSAGV-------NAFAY---------QW-----TD-WSSW--------CVP----PPA----LLVRTWKHIE-----SH--AC-E---G-----L--LV------SPD-------WP------ANVVA-T-AA-SRA------VRKGFAKL-V-Y--RI--RAGTRCI-T-PP-A-FS-TGA--------FQ-TPYAQSDLLVYR---F CBG07308_Caenorhabditis_briggsae_268581363 ---A--S-R-------------NFDFDDW-----GIAERVFIQAQRMWG---E-IKVDWFADAN-------NKKTE------LYFS---------RYP-----EFGTSGV-------NVFEHVERAERMGLPW--------------------LVP----PPV----LIPQLIKIMR-----RR--RL-R---G-----V--LV------APL-------WKSHISY-QALVD-Y-SG-R--------FIREVKDY-I-I------YQKNDCI--------FI-PGE--------------------------- CBG06557_Caenorhabditis_briggsae_268566149 ---A--S-R-------------NFDFDDW-----GIAERVFSHAQRMWG---E-IKVDWFADAN-------NKKTE------LYFS---------RYP-----EFGTSGV-------NVFEHVERAERMGLPW--------------------LVP----PPV----LIPQLIKIMR-----RR--RL-R---G-----V--LV------APL-------WKSHISY-QALVD-Y-SG-R--------FIREVKDY-I-I------YEKNDCI--------FI-PGE--------------------------- CBG03513_Caenorhabditis_briggsae_268553337 ---A--S-R-------------NFDFDDW-----GIAERVFSQAQRMWG---E-IKVDWFADAN-------NKKTE------LYFS---------RYP-----EFGTSGV-------NVFEHVERAERMGLPW--------------------LVP----PPV----LIPQLIKIMR-----RR--RL-R---G-----V--LV------APL-------WESHISY-QALVD-Y-SG-R--------FIREVKDY-I-I------YEKNDCI--------FI-PGE--------------------------- CRE_11685_Caenorhabditis_remanei_308493269 ---A--S-R-------------NFDFDDW-----GIADRVFKQAQRLWG---E-IKVDWFADAQ-------NKKTE------RFFS---------RYP-----EFGSSGV-------NVFEHIPRAERMGLAW--------------------WVP----PPV----MIPQLLKIAK-----SR--GL-K---G-----V--LV------APL-------WKSHPSY-QALVD-S-SG-R--------FVRYVRDY-I-I------YEKNDNI--------FI-PGE--------------------------- CBG19494_Caenorhabditis_briggsae_268562950 ---A--S-R-------------NFDFGDW-----GVADKVFRQAQRLWG---E-IKVDWFADAN-------NRKTE------LYLS---------RYP-----EFGTSGV-------NVFDHIERAERMGCAW--------------------WVP----PPV----LLPHLLKIAR-----KR--RL-R---G-----V--LV------APL-------WQSHASY-QSLVD-K-TR-R--------FIREIKDY-I-I------YEKNDSI--------FI-PGD--------------------------- CRE_13303_Caenorhabditis_remanei_308490919 ---A--S-R-------------DFDYDDW-----AVQNWAFEWAQKRWG---E-VKCDWFADEQ-------NTKTE------LFFS---------RLP-----EPGTLGA-------DVFEHVDKAGAIGLAW--------------------WVP----PPA----LIPRLMRVAR-----QK--KL-R---G-----I--LA------TPL-------WKAHPSY-QALVN-E-RG-E--------FIPEIRDS-R-I------FKVNTKI--------IS-PGR--------------------------- CRE_30767_Caenorhabditis_remanei_308500876 ---A--S-R-------------DFDYDDW-----AVQNWAFEWAQKRWG---E-VKCDWFADEQ-------NTKTE------LFFS---------RLP-----EPGTLGA-------DVFEHVDKAGAIGLAW--------------------WVP----PPA----LIPRLMRVAR-----QK--KL-R---G-----I--LA------TPL-------WKAHPSY-QALVN-E-RG-E--------FIPEIRDS-R-I------FKVNTKI--------IS-PGR--------------------------- CRE_30766_Caenorhabditis_remanei_308500992 ---A--S-R-------------DFDYDDW-----AVQNWAFEWAQKRWG---E-VKCDWFADEQ-------NTKTE------LFFS---------RLP-----EPGTLGA-------DVFENVDKAGSIGLAW--------------------WVP----PPV----LIPRLMRVAR-----QK--KL-R---G-----I--LA------TPL-------WKTHPSY-QALVN-E-RG-E--------FIPEIRDS-R-I------FKVNTKI--------IS-PGR--------------------------- CBG17454_Caenorhabditis_briggsae_268571541 ---A--S-R-------------EFDTDDW-----GVQNWAFEWAQKRWG---H-VKCDLFASER-------NAKHS------VYFS---------RYP-----EPTSSGT-------DAFDHF-TCAAKSLTW--------------------WVP----PPV----LVPKLIQVAR-----RN--RC-R---G-----I--LA------TPL-------WKTHPSY-LALVD-Q-NG-N--------FIREIRDL-R-R------RTRENNF--------SM-TRS--------------------------- CRE_21296_Caenorhabditis_remanei_308475765 ---A--S-R-------------DFDFDDW-----GVDQKVFLWAQTRWG---K-FKCDWFADEA-------NAKTQ------LFYS---------RDP-----CKCSQGA-------NVFDHIDVAKELGFAW--------------------WVP----PSN----LVPQLIAECR-----KT--SM-R---G-----V--LA------MPL-------WENHVSF-QAILD-S-RG-N--------WIRQLVDL-R-V------YPAKDRI--------IV-PGT--------------------------- CRE_26772_Caenorhabditis_remanei_308462401 ---A--S-R-------------DFDFDDW-----GVDQKVFLWAQTRWG---E-FKCDWFADEA-------NAKTQ------LFYS---------RDP-----GKCSQGA-------NVFDHIDVAKQLGFAW--------------------WVP----PPN----LVPRLIAECR-----KT--SM-R---G-----V--LA------MPL-------WENHVSF-QAILD-S-RG-N--------WIRQLVDL-R-V------YPAKDRI--------IV-PGA--------------------------- LOC100492777_Xenopus_(Silurana)_tropicalis_301606863 ---L--V-K---FK-----K--DSPHKNYIDNLKNIFNSHIADCMNLWD---D-TNIMDIEDNC-------KATCY-LDPT-GLLE---------KVK-----ELLEIAQ-------NFIKG------------------------K------NTP----SDD-CG-QYFEMCVATE-----KN--ST-G--------------------PPH-------WESTSPV-TPVHP-L-TV-NQK---PQ-SDMPSTNH-T-S------HSSMDIS--------VK-SSD--SYHHM----PTR------------- LOC100489531_Xenopus_(Silurana)_tropicalis_301620562 ---L--S-R----------Y--QIDPTEW-----ELHPEVFDLIVTQWG---E-PDLDLMASRH-------NRKTP------LFIS---------KTR-----DHLANEE-------RKRHVHSNRSKLA-----------SEVV--------VYR----PGE----SLNRSTHNTP-----HS--GR-P---A-----S--PR--SDL-PPQ-SPNVP-FDGVALE-TAILN-Q-KG----------FSPVVAQT-M-I------NARKAVS-S-KA---YH------RIWKI-FI----------------- LOC100487066_Xenopus_(Silurana)_tropicalis_301624526 ---L--S-R----------T--TLDPGEW-----KLKEEIFQQLVAKWG---Q-PCLDVMASRF-------NSQTP------RFLS---------KVH-----DPMVEGL-------DALTS---------PW-----HC-QLAY--------AFP----PIP----LIPRLLHKIR-----RE--RV-P---T-----I--LI------APW-------WPRRAWF-AELIQ-M-SA-EQP------WTIPLSSD-L-L------SQGPATA--------EN-LHK--LNLTA-WM----------------- LOC100490727_Xenopus_(Silurana)_tropicalis_301619133 ---L--S-R----------T--TLDPGEW-----KLKPEIFQQIVKKWG---L-PCLDIMASRF-------NSQIP------RFLS---------KVH-----DPKAEGV-------DALTS---------PW-----HC-QLAY--------AFP----PIP----LIPRLLHKIR-----RE--NI-P---T-----I--LI------APW-------WPRRAWF-AELIQ-M-SA-EQP------WTFPLYAD-L-L------SQGPAKA--------EN-IHN--LNLTA-WM----------------- CHLREDRAFT_180868_Chlamydomonas_reinhardtii_159465941 ---L--S-R------QL--A--QARDQNL-----RLKPAVFRSLVTTDGGQYR-PTVDCCADVL-------GLNAQ-PGCA-EFFS-----------P-----ERSVLGQ-------EQRLA----GKV--LW--------------------AFP----PVS----LTGEVLATIAAAAQLDE--RT-R---A-----T--VV------VPY-------QPSYPWF-QQWAS-Q-RL-AYK------TLQGNISA-L-A--DW--QRSKGRS-G-ED-L---------------------------------- EAI_09447_Harpegnathos_saltator_307201692 ---E--S-R----------I--SDTNTEW-----SLSEQAFRAVEGVFG---P-FDIDLFASII-------NAKLD------LYVS---------WFP-----DPGSWAI-------DAFTL---------SW-----QS-LYFY--------AFP----PFI----IIPRILRKII-----DD--EA-T---G-----V--LI------VPW-------WPSQSWF-PMFTC-L-LQ---------------------------------------------------------------------------- EAI_17025_Harpegnathos_saltator_307196129 ---L--S-R----------L--KNLDTEW-----ELATYAFNKITTSFG---F-PELDLFATSL-------NAKCE------KFCS---------WAT-----DPNAWAI-------DAFSI---------SW-----ST-FFSY--------AFP----PFS----MILRMLNKIV-----QD--KA-R---G-----I--IV------VPN-------WKGQAWY-PMFRN-L-L----------------------------------------------------------------------------- NEMVEDRAFT_v1g211073_Nematostella_vectensis_156375001 ---L--S-R-------------FVDKDDW-----SVNQSVFRLLDAKGG---P-HTIDRFASAY-------NTKLT------CFNSSSLPGLFDLLLG-----AKAVSAV-------RKYHT---------GW-----MR-LRVW-ALSKFD-VKPIPAKPLH-VA-LFLTELTRSA-----EE--KG-V---G-----I--SN--VEG-VAY---VIT-WRAL----PTLLH-G-CT-SWR-DYGR------------------------------------------------------------------- BRAFLDRAFT_131954_Branchiostoma_floridae_260797342 ---T--G-Q-CRCL-----D--DFSGRQC-NMC-QFGYFDFPTCRECTC-N-Q-AGTDPNTCNA-------NDVCA-CADN-GTCS---------CKP-----NVEGKSC-------TLCKE-------G-SF-----NL----------EE-ENP--N-GCT-SC-FCFGITDQCR-----QA--NL-V---T-----E--QV------TPD-------ADNNNFF---LSN-I-RR-TQQ-T---------------------------------------------------------------------- NEMVEDRAFT_v1g208020_Nematostella_vectensis_156382077 ---A--N-K------------------SW--LK-KLSEEQLRDTADKYL-R-P-NNCSHVVVPKVNEEIWLNFKCR------VNIS---------YQP-----DPGAYAV-------NAFHT---------SW-----KN-LCFY--------AFS----PFG----IIQKVLSKIS-----ED--QA-T---G-----I--LV------APH-------WPT-----PTMVA-I-SC-K--------FTYRTTSY-F-T------QKEEHPV---LT---IQ-SRA--ETSTP----QDPTVLGLPLV----- EAI_12430_Harpegnathos_saltator_307198641 ---Q--S-H----------I--VSTETEW-----SLSCDYFHRIESGFD---P-FDIDLFASSI-------YTKCP------CFVS---------WLP-----DPLAHSI-------DAFSL---------DW-----SK-FYFF--------AFP----PFI----LILRVLRKII-----SD--KA-E---R-----V--LV------VP------------------------------------------------------------------------------------------------------ EAI_06111_Harpegnathos_saltator_307212135 ---E--S-R----------C--KDPGTEW-----CLSDEAFQQVNKAFG---P-FDINLFASAI-------NNKCD------VCVS---------WFP-----NPGSFTT-------DAFAV---------AW-----EA-LNFY--------AFP----PFI----LLPRVLRKLI-----DD--EA-T---G-----T--LV------V------------------------------------------------------------------------------------------------------- EAI_10577_Harpegnathos_saltator_307193617 ---E--S-R----------I--SDTDTEW-----SLTDCAFQLIDRHFG---P-FAIDLFASAI-------NTKND------LYVS---------WFP-----NPGSWAT---------FTL---------DW-----HR-FYFY--------AFP----PFI----LFSRGLRKFI-----DD--KA-I---G-----V--LV------VPW-------W--------------------------------------------------------------------------------------------- GIP_L7_0070_Glyptapanteles_indiensis_190702585 ---G--S-R----------I--VNPDTEW-----ELADWAFQRIVKNFG---T-PEIDLFASRT-------NRKCK------KFCS---------WHR-----DPDAYCV-------DAFTM---------VC-----TD-LKFY--------AFP----PFS----LILRTLKKIE-----AD--QA-Q---D-----T--STSTVSKNAAN-------GRDIVWQ-TFLKL-D-FN-EKA------VELLVGSI-T-D------STMKQYN---KP---LQEWKNF-------SSEQKIDMLKPQTNQVINW EAG_00458_Camponotus_floridanus_307183886 ---E--S-R----------K--LQPETEF-----ELDNSAFQKIVKVFG---Q-PEIDLFASRA-------NAKCR------RYVS---------SRK-----DSGSIAI-------DAFIL---------EW-----KR-FLFY--------AFP----PFS----VILKVLRKIE-----YE--GS-S---G-----I--VV-------------------------------------------------------------------------------------------------------------- NEMVEDRAFT_v1g220590_Nematostella_vectensis_156351485 ---L--D-R------V---I--TDEHSTF-REL-ARIAAVFRLLNIKWG---P-YTIDRFATHY-------NAQLS------RFHS---------KFA-----APGSCGV-------DAFTQ---------EW-----S--------------GLP----EKG----YLERGLTFRS-TI--AI--GP-T---Q-----V--SC-EF---RPL----------------------------------------------------------------------------------------------------- Dpul1000019018_Daphnia_pulex_Dpul1000019018 ---G--S-K-------------MTDTDDW-----QVDHETYQRINRRYS-----FTIDLFASDR-------NTKCQ------NFSQ-------------------------------------------------------------------IFT----ART----LLASMRFRTR----------------G-----K--MK----------------WLGSA---HQLEK--------------------------------------------------------------------------------- CBG23377_Caenorhabditis_briggsae_268557352 ---T--S-R-------------EFDTDDW-----GVQDWAFEWAQKRWS---R-VKCDLFASER-------NAKHS------VYFS---------RYP-----EPTSSGT-------DAFDHF-TCAAKSLTW--------------------WVP----PPV----LVPN----------------------------------------------------------------------------------------------------------------------------------------------- CBG19482_Caenorhabditis_briggsae_268561666 ---A--S-R-------------NFFFDDW-----GVAGRVFRQAQRL-------------------------------------------------YP-----EFGTSGV-------NVFDHIERAERMGCAL--------------------WVP----PPV----LIPHLLKMGR-----KR--RL-R---G-----V--LV------APL-------WRSHASY-QALVD-H-SG-R--------FIRAIKDY-I-I------YEKNDNI--------FI-PEG--------------------------- LOC582271_Strongylocentrotus_purpuratus_115621795 ---L--S-R----------G--KCLPSEW-----TLSPTVFRQLVRVFS---T-SISSQLRSTI------------------VFLG-------------------------------------------------------------------FVR----ESG------NQELLKIR-----ED--QA-M---V-----V--LI------APW-------WPARSWF-QDLLT-L-LV-GTL------WSLPCHPD-L-V------SQPLSGI--------LH-QRP--EILHL-TA----------------W LOC100123785_Nasonia_vitripennis_156546508 -------------------------------------------------------------------------------------------------------------M-------NAFTI---------NW-----NN-KFWY--------AFP----PFA----LLTKTLKKIR-----DD--KA-E---G-----I--LI------VPH-------WPGQPWF-PEFKR-L-LE-THA----P-FSVPAFTD-C-R-SIV--REAYRQK-G-LE---EA-PVE--II----------------------- MNEG_14804_Monoraphidium_neglectum_761958716 ---L--S-KIVDKN-------------DW-----MLHEEEFGRLARRFG---P-FEVDLFASHT-------TRQLP------KYFS---------LYH-----TPDTAGI-------DAFAQ---------HW-----G--RGCW--------CNP----PFT----LIGRVLRHAR-----EC--GA-R---M-----C--LL------APA-------WPSAAWWHQLVLP-G-GT-HFRPFVRECVVLPKRRD-L-F------------------------------------------------------ D478_26539_Brevibacillus_agri_BAB-2500_432181416 ------N-K--AMF--------TSEREEW-----ETPQDFFEKLNKEF----G-FQLDVCALPT-------NAKCE------RYFT---------PDE-------------------DGLKQ---------EW-----TG--VCW--------MNP----PYG--R-EIGKWVKKAY-----ES--AK-Q---G-AT--V--VC--L---LPA-------RTDVKWW-HDYCM-K-G--EIR------LVRGRMKF----------VGADNMA-----------PFP--NAVVI-FS----------------- C236_RS0118880_Brevibacillus_laterosporus_517503045 ------N-E--GMF--------TSSTDLW-----ETPQDFFNQLNKEF----G-FQLDVCALPE-------NAKCE------RYFS---------PDE-------------------DGLQQ---------EW-----TG--ICW--------MNP----PYG--R-QIGKWIKKAY-----ES--SL-N---G-AT--V--VC--L---IPA-------RTDASWW-HAHCM-K-G--EIR------LVKGRLKF----------GGSKWNA-----------PFP--NAVVI-FR----------------- ABOUO_79_Paenibacillus_phage_Abouo_525335850 ------N-E--GMF--------TSSTDLW-----ETPQEFFNQLNQEF----G-FQIDVCALPE-------NAKCE------RYFS---------PDE-------------------DGLQQ---------EW-----TG--ICW--------MNP----PYG--R-QIGKWIKKAY-----ES--SL-N---G-AT--V--VC--L---IPA-------RTDARWW-HDYCM-K-G--EIR------LVKGRLKF----------GSSKWSA-----------PFP--NALVI-FK----------------- D478_RS25245_Brevibacillus_agri_748713908 ------------MF--------TSEREEW-----ETPQDFFEKLNKEF----G-FQLDVCALPT-------NAKCE------RYFT---------PDE-------------------DGLKQ---------EW-----TG--VCW--------MNP----PYG--R-EIGKWVKKAY-----ES--AK-Q---G-AT--V--VC--L---LPA-------RTDVKWW-HDYCM-K-G--EIR------LVRGRMKF----------VGADNMA-----------PFP--NAVVI-FS----------------- M655_RS0109725_Bacillus_sp_NSP21_737442515 ------------MF--------KSEREEW-----ETPQEFFDKLNDEF----G-FQLDVCALPT-------NAKCE------RYFT---------PDD-------------------DGLHQ---------EW-----TG--VCW--------MNP----PYG--R-EIGKWVKKAY-----ES--AK-Q---G-AT--V--VC--L---LPA-------RTDVKWW-HDYCM-K-A--EIR------LVRGRMKF----------VGADNMA-----------PFP--NAVVI-FS----------------- ABBL099_02355_Acinetobacter_baumannii_690996743 ---T--K-N--KLFGL-----AEERTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AN-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- FL80_RS05360_Acinetobacter_baumannii_690988986 ---T--K-N--KLFGL-----AEERTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AN-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- RQ87_RS18135_Acinetobacter_baumannii_447010248 ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- ERIC1_1c08270_Paenibacillus_larvae_subsp_larvae_DSM_25719_567770034 ------N-K--VHY--------SSKTDMW-----ETPQNLFDRLNEEF----K-FDLDVCAIPE-------NAKCK------RYFT---------PSE-------------------DGLKQ---------EW-----KG--ACW--------MNP----PYG--R-QIGKWIAKAY-----ES--SL-E---G-AT--V--VC--L---VPS-------RTDTKWW-HGYCM-K-G--EIR------FIRGRLKF----------GGSPHNA-----------PFP--NAVVI-FR----------------- J517_3010_Acinetobacter_baumannii_691065210 ---T--K-N--KLFGL-----AEERTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- J523_3197_Acinetobacter_baumannii_691027491 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- J660_1691_Acinetobacter_baumannii_691157882 ---S--K-N--KLFGL-----AEDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- ERIC1_RS03940_Paenibacillus_larvae_738763505 ------N-K--VHY--------SSKTDMW-----ETPQNLFDRLNEEF----K-FDLDVCAIPE-------NAKCK------RYFT---------PSE-------------------DGLKQ---------EW-----KG--ACW--------MNP----PYG--R-QIGKWIAKAY-----ES--SL-E---G-AT--V--VC--L---VPS-------RTDTKWW-HGYCM-K-G--EIR------FIRGRLKF----------GGSPHNA-----------PFP--NAVVI-FR----------------- K035_3853_Acinetobacter_baumannii_691039522 ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- J697_3983_Acinetobacter_baumannii_691093639 ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- B4086_RS03845_Bacillus_cereus_822506548 ------N-K--GMF--------TSKTDLW-----ATPQYFFDELHKEF----N-FELDVCALED-------NAKCE------KYFT---------PEM-------------------DGLKQ---------EW-----NG--TCW--------MNP----PYG--R-GIGKWVQKAY-----ES--SL-T---G-ST--V--VC--L---LPA-------RTDTRWW-HDYCM-N-G--EIR------LVKGRLKF----------GDSKNSA-----------PFP--NAVVI-FG----------------- J689_1368_Acinetobacter_calcoaceticus/baumannii_complex_645913983 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- W9I_03525_Acinetobacter_nosocomialis_493629840 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- K041_RS17240_Acinetobacter_baumannii_690981431 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- J532_4398_Acinetobacter_baumannii_691154760 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- ACIN5021_2863_Acinetobacter_sp_OIFC021_444754682 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- J595_RS19805_Acinetobacter_baumannii_691047241 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWIAKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- LJ44_RS16470_Acinetobacter_baumannii_447017697 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- FL80_RS15355_Acinetobacter_baumannii_690990657 ---A--K-L--GLFGN-----AEGRTDVW-----ATPQTLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWISKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CAVVV-FR----------------- J532_4398_Acinetobacter_baumannii_940793_630464595 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- F985_01871_Acinetobacter_sp_NIPH_973_490838153 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPG-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--R-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- TT45_RS11045_Acinetobacter_baumannii_758882462 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWISKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- ABSDF2497_Acinetobacter_baumannii_SDF_169152788 ---T--K-N--KLFGL-----ADDRTDVW-----ATPQDFFEKLDRVF----K-FDLDVCALPD-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----DT--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- J689_1349_Acinetobacter_baumannii_691068978 ---A--Q-R--KLFGL-----AENRTDVW-----ATPQDFFDKLNAVF----N-FDLDVCALPE-------NAKCE------RFFS---------PEQ-------------------NGLKQ---------EW-----IG--TCW--------MNP----PYG--R-EIVDWIAKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- BTS2_0497_Bacillus_sp_TS-2_591276954 ------N-Q--AMF--------SSSTDKW-----STPQSFYDKLNQEF----Q-FDIDVCATDS-------DKKCE------RYFS---------PEQ-------------------DGLKQ---------EW-----TG--ICW--------MNP----PYG--R-GIGPWIQKAY-----ES--SQ-Q---G-AT--V--VC--L---LPS-------RTDTKWW-HEYCM-K-G--EIR------FIKGRLKF----------GDSKNSA-----------PFP--SVVVI-FR----------------- J594_4091_Acinetobacter_baumannii_259052_588219826 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIVDWIAKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- BTS2_RS02440_Bacillus_sp_TS-2_780117918 ------N-Q--AMF--------SSSTDKW-----STPQSFYDKLNQEF----Q-FDIDVCATDS-------DKKCE------RYFS---------PEQ-------------------DGLKQ---------EW-----TG--ICW--------MNP----PYG--R-GIGPWIQKAY-----ES--SQ-Q---G-AT--V--VC--L---LPS-------RTDTKWW-HEYCM-K-G--EIR------FIKGRLKF----------GDSKNSA-----------PFP--SVVVI-FR----------------- J635_1953_Acinetobacter_baumannii_690997976 ---A--K-L--GLFGN-----AEGRTDVW-----ATPQKLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----AG--TCW--------MNP----PYG--R-EIVDWISKAA-----YT--AE-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- LILY_61_Bacteriophage_Lily_755258783 ---SNTM-A--VHY--------SSKTDMW-----ETPQDFFDKLHAEF----G-FTLDVCAVPE-------NAKCE------RFFS---------PDD-------------------NGLLQ---------NW-----KG--VCW--------MNP----PYG--R-QIGAWIAKAY-----ES--SL-E---G-AT--V--VC--L---VPS-------RTDTKWW-HDYCL-K-G--EVR------FIKGRLKF----------GGSPHNA-----------PFP--NAIVI-FR----------------- J546_RS10975_Acinetobacter_baumannii_736663998 ---A--N-H--QLFGL-----AENRTDIW-----ATPQDFFDKLNAVF----K-FDLDVCALPN-------NAKCE------RFFS---------PED-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EIIEWVAKAA-----CT--AK-Q---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKSNA-----------PFG--CCVVV-FR----------------- J660_0735_Acinetobacter_calcoaceticus/baumannii_complex_493629922 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--C-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- J479_2646_Acinetobacter_baumannii_691127129 ---A--K-L--GLFGN-----AEGRTDVW-----ATPQTLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EITLWIDKAV-----QT--AN-Q---G-HT--V--VG--L---LPA-------RTDVTWW-QEHVM-N-R--EIH------YIKGRLKF----------GGCKHNA-----------PFG--CAVVV-FR----------------- ACINWC323_RS01110_Acinetobacter_sp_WC-323_696306260 ---A--K-S--KLFGL-----AEDRTDVW-----ATPQDFFDKLNAIF----D-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLSQ---------EW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-A---G-YT--V--VA--L---LPA-------RTDVGWW-QSHCL-N-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CAVVV-FR----------------- J596_3741_Acinetobacter_baumannii_691117543 ---A--K-L--GLYGN-----AEGKTDVW-----ATPQNLFDALDQIF----N-FDLDVCALPE-------NAKCE------RYFT---------PEL-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-N---G-HT--V--VG--L---LPV-------RTDVVWW-QEHIL-H-R--EIH------YIKGRLKF----------GGSKHNA-----------PFG--CALVV-FR----------------- J660_0735_Acinetobacter_baumannii_88816_593668543 ---A--Q-S--KLFGL-----AENRTDVW-----STPQDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----SG--TCW--------MNP----PYG--C-EIVDWIAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSSSNA-----------PFG--CCVVV-FR----------------- ACINWC323_A0077_Acinetobacter_sp_WC-323_425484490 ---A--K-S--KLFGL-----AEDRTDVW-----ATPQDFFDKLNAIF----D-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLSQ---------EW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-A---G-YT--V--VA--L---LPA-------RTDVGWW-QSHCL-N-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CAVVV-FR----------------- X858_RS0107890_Bacillus_subtilis_647261410 ------------HF--------SSKTDLW-----ATPQYFFDELHKEF----D-FELDVCALED-------NAKCE------KYFT---------PEM-------------------DGLKQ---------EW-----NS--TCW--------MNP----PYG--R-GIGEWVQKAY-----ES--SL-K---G-ST--V--VC--L---LPA-------RTDTRWW-HDYCM-K-G--EIR------LVKGRLKF----------GESKDNA-----------PFP--NAVVI-FG----------------- J635_2258_Acinetobacter_baumannii_690998264 ---A--K-L--GLFGN-----AEGRTDVW-----ATPQKLFDALDQVF----N-FDLDVCALPE-------NAKCE------RFFT---------PEI-------------------DGLKQ---------DW-----TG--TCW--------MNP----PYG--R-EISLWIEKAV-----QT--AN-Q---G-HT--V--VG--L---LPT-------RTDVAWW-QEHVM-N-R--EIH------YIKGRLKF----------GGCKHNA-----------PFG--CAVVV-FR----------------- G454_RS0114655_Desulfovirgula_thermocuniculi_654109520 ------N-R--GLF--------SSASSEW-----ETPQKFFETLDVEF----G-FTLDVCARPE-------NAKCP------RYFS---------PEE-------------------DGLRQ---------EW-----AP-EVCW--------MNP----PYG--R-EIGKWIQKAY-----EE--AQ-K---G-AT--V--VC--L---LPS-------RTDTAWW-HEYVM-RAA--EVR------FIRGRLRF----------GGAENGA-----------PFP--SCVVV-FR----------------- F931_01759_Acinetobacter_pittii_507070967 ---A--K-L--GLYGN-----AEGKTDVW-----ATPQNLFDAIDHIF----N-FDLDVCALPE-------NAKCD------RYFT---------PEL-------------------DGLKQ---------EW-----VG--TCW--------MNP----PYG--R-EISLWIEKAV-----ET--AN-N---G-HT--V--VG--L---LPV-------RTDVVWW-QEHIL-H-R--EIH------YIKGRLKF----------GGCKHNA-----------PFG--CALVV-FR----------------- PL75_03330_Neisseria_sp_KH1503_831387832 ------------HF--------SSKTDLW-----ATPQDFFDNLNEEF----G-FELDVCALPE-------NAKCE------KYFT---------PEN-------------------DGLKQ---------DW-----TG--TCW--------MNP----PYG--R-EIGKWMKKAY-----ES--SL-T---GNAT--V--VC--L---VPA-------RTDTKWF-HDFAM-K-G--EVR------FIKGRLKF----------GGSKNSA-----------PFP--SAVVI-FR----------------- K035_3825_Acinetobacter_baumannii_42057_4_629017472 -------------------------------------QDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- DESKU_RS03925_Desulfotomaculum_kuznetsovii_503587829 ------N-E--SMF--------SSRTGEW-----ETPQTFFDALDAEF----H-FTLDVCARPE-------NAKCA------RFFT---------PEQ-------------------DGLRQ---------SW-----AG-ETCW--------MNP----PYG--R-EIGRWVEKAY-----NE--AR-R---G-AV--V--VA--L---LPA-------RTDTRWW-HRYVM-RAA--EIR------FVEGRLKF----------GGAENSA-----------PFP--SVVVV-FT----------------- RBAU_RS10310_Bacillus_amyloliquefaciens_752856685 ---E--T-K--TNFNQGVFFNPEDRTDVW-----ATPIDFFNKINERY----K-LNLDVCAKPS-------NAKCK------NFFT---------PEI-------------------DGLKQ---------KW-----VG--RVW--------MNP----PYG--R-EIKKWIKKAY-----EE--VE-N---G-NS--EIAVC--L---VPA-------RTCSAWW-HEYCM-K-G--EIL------FIRHRLKF----------GGSKINA-----------PFP--NALVI-FS----------------- G454_RS0102995_Desulfovirgula_thermocuniculi_654100680 ------N-R--VLF--------SSATSEW-----ETPQELFARLHAEF----G-FTLDVCARPW-------NAKCT------RYFS---------PEQ-------------------NGLIQ---------EW-----AP-ETCW--------MNP----PYG--R-EISRWVRKAW-----EE--AQ-K---G-AT--V--VC--L---LPS-------RTDTAWW-HEYVM-RAA--EIR------FIRGRLHF----------EGAKNGA-----------PFP--SCVVV-FR----------------- K035_3825_Acinetobacter_baumannii_691039509 -------------------------------------QDFFEKLDRVF----N-FDLDVCALPE-------NAKCE------RYFT---------PEI-------------------DGLKQ---------EW-----TG--TCW--------MNP----PYG--K-EIIDWVAKAA-----ET--AS-K---G-HT--V--VA--L---VPV-------RTDARWF-QDYCL-G-R--EIH------FIRGRLKF----------GGSKTNA-----------PFG--CCVVV-FR----------------- TS65_RS13365_Aneurinibacillus_migulanus_759006369 ------T-A--VMF--------SSATDEW-----ATPQDFFDQLNQEF----H-FTLDPCATHE-------SAKCA------RYFT---------EED-------------------NGLAQ---------DW-----TG-EIVF--------MNP----PYG--R-VLGQWVKKAF-----EE--SI-K---G-AT--V--VC--L---LPA-------RTDTRWF-HDYIY-HRA--EIR------FVKGRLKF----------GDSKNSA-----------PFP--SMVVI-FN----------------- MMA_RS11485_Janthinobacterium_sp_Marseille_501027971 ------S-K--VHF--------SSATPEW-----YTPQSTFDVLNAEF----G-FTLDPCCTHE-------NAKCD------RHFT---------MAE-------------------NGLSQ---------DW-----SN-EVTF--------MNP----PYG--R-EIKEWMRKAY-----ES--SL-S---G-AT--V--VC--L---VPA-------RTDTAWW-HDYSI-K-G--EIR------FLRGRLKF----------GGAKTNA-----------PFP--SAIVI-FR----------------- Q332_RS01180_Pseudobacteroides_cellulosolvens_739064083 ------T-E--IMF--------SSKSDEW-----ETPQQFFDKLHKEF----N-FQLDVCATAE-------NAKCD------KYYT---------KID-------------------DGLSQ---------SW-H-HWAQ--RCW--------MNP----PYG--R-NIDKWIKKAF-----DE--SQ-E---G-AT--V--VC--L---IPA-------RTDTKYW-HTYCM-K-AH-EIR------FVKGRLKF-S---------NSKDCA-----------PFP--SAIVV-FK----------------- HMPREF0179_03455_Bilophila_wadsworthia_3_1_6_316921487 ------MNP--ALF--------SSAKEDW-----ETPREFFERLDGEF----H-FDLDVCAFPH-------NAKCP------TYFT---------KED-------------------DGLAR---------DW-----GN-RVCW--------MNP----PYG--K-AIKAWMTKAL-----DA--SR-R---G-AT--V--VC--L---VPS-------RTDTAWW-HDTVI-A-GGAEVR------FARGRLRF----------VGAEHPA-----------PFP--SAVVI-FR----------------- HMPREF0179_RS04985_Bilophila_wadsworthia_749811142 ------MNP--ALF--------SSAKEDW-----ETPREFFERLDGEF----H-FDLDVCAFPH-------NAKCP------TYFT---------KED-------------------DGLAR---------DW-----GN-RVCW--------MNP----PYG--K-AIKAWMTKAL-----DA--SR-R---G-AT--V--VC--L---VPS-------RTDTAWW-HDTVI-A-GGAEVR------FARGRLRF----------VGAEHPA-----------PFP--SAVVI-FR----------------- RDMS_RS01750_Deinococcus_sp_RL_736377798 ------M-A--VHY--------SSEKHDW-----TTPRSFFDELNAEF----N-FTLDAAASPH-------NALCS------RYFT---------EAD-------------------DGLSQ---------PW-----TG-TV-W--------CNP----PYG--R-QIGRWIAKAA-----QS--AC-E---G-AT--V--VM--L---IPA-------RTDTAAW-HDHILFN-PQAEVR------FVRGRLRF----------GDATANA-----------PFP--SAVII-FR----------------- NZ45_03810_Clostridium_botulinum_700273311 ------T-A--VMF--------SSETDLW-----ATPQDFFDKLNKEF----D-FDLDPCATHE-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----QG-HKVF--------CNP----PYG--R-GIKDWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTRYF-HEYIY-H-KAKEIR------FVKGRLKF----------GSAKNSA-----------PFP--SMVVV-FR----------------- BZ26_RS0118830_Clostridium_botulinum_489480013 ------T-A--VMF--------SSETDLW-----ATPQDFFDKLNKEF----N-FDLDPCATKE-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----GR-YRVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SK-K-Q-N-TT--V--VM--L---IPA-------RTDTKYF-HSYIY-H-KAKEIR------FIKGRLKF----------GNAKNSA-----------PFP--SMIVV-FR----------------- A11W_RS0107210_Staphylococcus_hominis_515743089 ------M-E--VHY--------SSKSNEW-----ATPQNLFDELNEEF----N-FTLDPCATDE-------NAKCS------KYFT---------IED-------------------DGLSK---------DW-----SK-DVVF--------MNP----PYG--R-EIKKWNKKAY-----EE--SL-N---G-AT--V--VC--L---IPA-------RTDTTYW-HDFIF-D-RADDIR------FLRGRLKF----------GNSKNSA-----------PFP--SAIVV-YR----------------- V006_02512_Staphylococcus_aureus_686297326 ------M-E--VHY--------SSKTNEW-----TTPQNLFDELNGEF----N-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR----------------- HMPREF9988_RS10060_Staphylococcus_epidermidis_488427723 ------M-E--VHY--------SSKSNEW-----ATPQKLFDELDKEF----N-FTLDPCATDE-------NAKCN------KHFT---------IED-------------------DGLSK---------DW-----SK-DVVF--------MNP----PYG--R-EIKKWIKKAY-----EE--SL-N---G-AT--V--VC--L---IPA-------RTDTTYW-HDFIF-D-KADDIR------FLRGRLKF----------GNSKNSA-----------PFP--SAIVV-YL----------------- T259_RS08765_Clostridium_botulinum_748203410 ------T-A--VMF--------SSETDLW-----ATPQDFFDKLNKEF----N-FDLDPCATHE-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----QG-YKVF--------CNP----PYG--R-VLKDWVKKCY-----EE--SL-K-P-N-TT--V--VM--L---IPA-------RTDTKYF-HEYIY-H-KVKEIR------FVKGRLKF----------GDAKNSA-----------PFP--SMVVV-F------------------ CO98_RS04645_Staphylococcus_aureus_739716594 ------M-S--VHF--------SSKSNEW-----TTPQYLFDELNEEF----N-FTLDPCATDE-------NAKCS------KYFT---------IED-------------------DGLSK---------DW-----SN-DVVF--------MNP----PYG--R-EIKKWIKKAY-----EE--SL-N---G-AT--V--VC--L---IPA-------RTDTTYW-HDFIF-D-KADDIR------FLKGRLKF----------GNSKNSA-----------PFP--SSIVI-YE----------------- TH16_RS01985_Staphylococcus_caprae_488372936 ------M-S--VHF--------SSKSNEW-----YTPQYLFDELNEKY----Q-FTLDPCASHE-------NAKCD------KYFT---------IED-------------------DGLTK---------DW-----SK-DIVF--------MNP----PYG--R-NIKHWIKKAY-----EE--SV-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-NAYNIK------FLKGRIKF----------GGAVNSA-----------PFP--SAIVV-FK----------------- PI74_RS05125_Clostridium_botulinum_500994137 ------T-A--VMF--------SSGTDLW-----ATPQDFFDKLNKEF----D-FDLDPCATHK-------NAKCS------KYFT---------KEI-------------------DGLKQ---------DW-----QG-YKVF--------CNP----PYG--R-SIKDWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTRYF-HEYIY-N-KAKEIR------FVKGRLKF----------GDAKNSA-----------PFP--SMVVV-F------------------ RK90_RS13240_Staphylococcus_aureus_446374006 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR----------------- SA930_RS14870_Staphylococcus_aureus_446374005 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR----------------- AS94_12270_Staphylococcus_aureus_686449191 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVEKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR----------------- U183_02276_Staphylococcus_aureus_686300364 ------M-E--VHY--------SSKTNEW-----TTPQNLFDDLNREF----N-FTLDPCSTDE-------NAKCQ------KHYT---------END-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKHWVKKAY-----EE--SI-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GESKNSA-----------PFP--SAIIV-YR----------------- Phi93_04_Lactococcus_phage_phi93_673939868 ------N-E--LMF--------SSKTDLW-----STPNDFFDKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------KEE-------------------NGLLQ---------DW-----GN-EVVF--------CNP----PYG--R-QIKEWIKKSY-----EE--SQ-K-D-N-TT--V--VM--L---IPA-------RTDTIYF-HEYIY-H-KA-EIR------FIKGRLKF----------GNAKNSA-----------PFP--SMVVI-FE----------------- RL05_RS02180_Staphylococcus_aureus_446374007 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLSEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR----------------- SAZ172_RS05790_Staphylococcus_aureus_554679133 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PLP--SAIIV-YR----------------- CLOSCI_00567_[Clostridium]_scindens_ATCC_35704_167664126 --------K--ALF--------SSAKEDW-----ATPQDFFDELNKEF----H-FDLDPCADAE-------NAKCK------EFFT---------KEQ-------------------NGLLQ---------DW-----GG-RCVF--------CNP----PYG--RTSTGEWIKKCY-----EE--AQ-K-P-G-TV--V--VA--L---IPA-------RTDTRFF-HDYIY-H-KA-EIR------FIKGRLHF----------GGCKDAA-----------PFP--SMVVV-FR----------------- ERS140248_02184_Staphylococcus_aureus_678260344 ------M-E--VHY--------SSKTNEW-----ATPQNLFDDLNREF----N-FTLDPCSTDE-------NAKCQ------KHYT---------AKD-------------------NGLIQ---------DW-----SE-DVVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SV-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------SESKNSA-----------PFP--SAIIV-YR----------------- T666_02640_Staphylococcus_aureus_686391504 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKCWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR----------------- SD74_RS18965_Clostridium_botulinum_752703286 ------T-A--VMF--------SSETDLW-----ATPQDFFDELNKEF----D-FDLDPCATHE-------NAKCD------KYYT---------IVE-------------------DGLKQ---------DW-----QG-HKVF--------CNP----PYG--R-GIKDWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTKYF-HSYIY-H-KAKEIR------FIKGRLKF----------GDAKNSA-----------PFP--SMVVV-F------------------ W619_00569_Staphylococcus_aureus_686419170 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNGEF----N-FTLDPCSTDE-------NAKCQ------KHYT---------AKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKHWVKKAY-----EE--SV-K---G-AT--V--VC--L---IPA-------RTDTTYW-HDYIF-N-KADDIR------FLRGRLKF----------GESKNSA-----------PFP--SAIIV-YR----------------- CLOSCI_RS06430_[Clostridium]_scindens_748651356 --------K--ALF--------SSAKEDW-----ATPQDFFDELNKEF----H-FDLDPCADAE-------NAKCK------EFFT---------KEQ-------------------NGLLQ---------DW-----GG-RCVF--------CNP----PYG--RTSTGEWIKKCY-----EE--AQ-K-P-G-TV--V--VA--L---IPA-------RTDTRFF-HDYIY-H-KA-EIR------FIKGRLHF----------GGCKDAA-----------PFP--SMVVV-FR----------------- SAGV69_RS11740_Staphylococcus_aureus_506511035 ------M-E--VHY--------SSKTNEW-----TTPQHLFDDLNEEF----S-FTLDPCSTDE-------NAKCR------KYYT---------VKD-------------------NGLIQ---------DW-----SE-DIVF--------MNP----PYG--R-SIKRWVKKAY-----EE--SL-K---G-AT--V--VC--L---IPA-------RTDTTYW-RDYIF-N-KADDIR------FLRGRLKF----------GDSKNSA-----------PFP--SAIIV-YR----------------- OR63_RS06485_Clostridium_tetani_737140426 ------T-A--VMF--------SSETDLW-----ATPQEFYNELNKEF----N-FDLDPCATHE-------NAKCP------KYYT---------VVE-------------------DGLKQ---------DW-----QG-HKVF--------CNP----PYG--R-EISKWVEKAY-----KE--SK-K-E-N-TT--V--VM--L---IPA-------RTDTKYF-HSYIY-R-KAKEIR------FIKGRLKF----------GNAKNSA-----------PFP--SMVVV-F------------------ AWRIB429_RS09790_Oenococcus_oeni_768719850 ------N-E--LMF--------SSKTDLW-----STPNDFFDKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------KEE-------------------NGLLQ---------DL-----GN-EVVF--------CNP----PYG--R-QIKDWVKKSY-----EE--SQ-K-D-N-TT--V--VM--L---IPA-------RTDTIYF-HEYIY-H-KA-EIR------FIKGRLKF----------GNAKNSA-----------PFP--SMVVI-FE----------------- BN927_RS09785_Lactococcus_lactis_554763517 ------K-E--LMF--------SSKTDLW-----STPWNFFDKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------IEE-------------------DGLLQ---------DW-----GN-EVVF--------CNP----PYG--R-QIKDWVKKAY-----EE--SQ-K-D-D-TT--V--VM--L---IPA-------RTDTIYF-HEYIY-H-KA-EIR------FIKGRLKF----------GDAKNAA-----------PFP--SMVVI-FR----------------- EMTOL_RS19950_Emticicia_oligotrophica_504839093 ---N--I-K--AIFSC--------KTTNW-----ETPQDLFDELDKQY----N-FTLDVCATSE-------NAKCN------EFFT---------PEI-------------------DGLKQ---------EW-----KG--MCW--------MNP----PYG--R-EIGKWVRKAH-----LE--VI-T---G-RC--RI-IA--L---LPA-------RTDTKWF-HEWVLNK-H--EIK------FIKGRLRF----------SDSKNSA-----------PFP--SMLVI-FE----------------- BN981_00304_Halobacillus_trueperi_635344555 ------M-N--VHY--------SSKSNDW-----ATPQDFFDGLDNEF----N-FTLDPCATSE-------NAKCD------NYFT---------IED-------------------DGLKQ---------SW-----EG-ETVF--------CNP----PYG--R-EIKLWVKKAF-----QE--SK-K-P-N-TK--V--VM--L---IPA-------RTDTKYF-HDYIY-M-QA-RVR------FIKGRLKF----------GNGKGNA-----------PFP--SMVVI-F------------------ BN981_RS01350_Halobacillus_737533832 ------M-N--VHY--------SSKSNDW-----ATPQDFFDGLDNEF----N-FTLDPCATSE-------NAKCD------NYFT---------IED-------------------DGLKQ---------SW-----EG-ETVF--------CNP----PYG--R-EIKLWVKKAF-----QE--SK-K-P-N-TK--V--VM--L---IPA-------RTDTKYF-HDYIY-M-QA-RVR------FIKGRLKF----------GNGKGNA-----------PFP--SMVVI-F------------------ BN981_RS01320_Halobacillus_737532221 ------M-D--VHY--------SSKTNEW-----ATPQDFFDELNTEF----N-FTLDPCATPD-------NAKCD------KYFT---------EKD-------------------DGLEQ---------SW-----EG-ETVF--------CNP----PYG--R-GIKHWVKKAY-----QE--ST-K-P-N-TT--V--VL--L---IPS-------RTDTRYF-HDYVY-H-KS-EIR------FLKGRLKF----------GDGSGNA-----------PFP--SMVAI-YR----------------- QI18_RS10395_Lactococcus_lactis_746045508 ------R-E--LMF--------SSKTDLW-----STPWNFFEKLNDEF----H-FTLDPCSTHE-------NAKCY------KHFT---------IKE-------------------DGLLQ---------DW-----GN-EVVF--------CNP----PYG--R-KIKDWVKKAY-----EE--SQ-K-D-N-TT--V--VM--L---IPA-------RTDTIYF-HEYVY-H-KA-EVR------FIKGRLKF----------GDAKNAA-----------PFP--SMVVI-FR----------------- RM98_RS18265_Chromobacterium_violaceum_759932528 ---S--E-Q--VHF--------SSKTDEW-----PTPQALFDQLHEEF----G-FTLDVCATAE-------NAKCE------RFFT---------REQ-------------------DGLAQ---------DW-----SR-DVVW--------MNP----PFG--H-QIKLWMAKAY-----RS--SI-D---G-AL--V--VC--L---VPA-------RTDTRWF-HRHALKA-A--EIR------ALDKRLRF----------DGAKAKA-----------PFP--AVLVV-Y------------------ RN16_RS04075_Chromobacterium_subtsugae_759887196 ---S--E-Q--IHF--------SSKTDEW-----PTPQALFDQLHAEF----G-FTLDVCATQE-------NAKCE------RFFT---------REQ-------------------DGLAQ---------DW-----SR-EVVW--------MNP----PFG--H-QIKLWMAKAY-----RS--SI-D---G-AL--V--VC--L---VPA-------RTDTRWF-HRHALKA-A--EIR------ALDKRLRF----------DGAKAKA-----------PFP--AVLVV-Y------------------ KU40_RS04850_Clostridium_botulinum_737823765 ------------MF--------SSKTDMW-----STPQDFYNKLNQEF----N-FNLDPCSTNE-------NAKCE------RHYT---------IAE-------------------DGLKQ---------NW-----VG-STVF--------CNP----PYG--R-VLKDWVKKCY-----EE--SK-K-D-N-TT--V--VM--L---IPA-------RTDTTYF-HNYIY-K-KVKEIR------FIRGRLKF----------GDCKNAA-----------PFP--SMVVV-F------------------ DALK_RS23730_Desulfatibacillum_alkenivorans_506429612 -------------------------NCEW-----ATPQDLFDSLNKEF----H-FTLDPCCTIE-------NAKCE------RFYT---------KAE-------------------DGLSQ---------DW-----TG-ETVF--------MNP----PYS--RSEMPKWIQRAY-----ES--SL-A---G-SK--V--VC--L---LPA-------KTDTRWF-HDFCL-K-G--EIR------FIKGRICF----------GSGEGRA-----------PFP--SMVVI-FN----------------- CC61_RS14530_Chromobacterium_sp_C-61_748184431 ---A--E-N--VHF--------STGKDEW-----PTPQALFDQLNAEF----G-FTIDVCATAK-------NAKCT------KFYT---------QVD-------------------DGLAQ---------NW-----AG-EVVW--------MNP----PFG--H-SIKLWMAKAY-----RS--SL-D---G-AL--V--VC--L---VPA-------RTDTRWW-HRVVMKA-S--EVR------VLDKRLRF----------DGGNHKA-----------PFP--AVVVV-F------------------ SAG0375_00225_Streptococcus_agalactiae_GB00984_527786367 ------Q-K--SLL--------SSDKDYW-----ETPQTFFKKLNNEF----D-FDLDVASSHD-------NAKCK------NHFT---------VVE-------------------DGLSQ---------DW-----TG--NVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SL-K-PYN-NV--I--VL--L---IPA-------RTDTKYW-HDYIF-G-KAKDIR------YLKGRLKF-T-I-----NGKENYPA-----------PFP--SAVII-F------------------ SAG0375_RS111635_Streptococcus_agalactiae_487848063 ------Q-K--SLL--------SSDKDYW-----ETPQTFFKKLNNEF----D-FDLDVASSHD-------NAKCK------NHFT---------VVE-------------------DGLSQ---------DW-----TG--NVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SL-K-PYN-NV--I--VL--L---IPA-------RTDTKYW-HDYIF-G-KAKDIR------YLKGRLKF-T-I-----NGKENYPA-----------PFP--SAVII-F------------------ DK41_RS08970_Streptococcus_agalactiae_642982737 ------Q-K--SLL--------SSDKDYW-----ETPQTFFKKLNNEF----D-FDLDVASSHD-------NAKCK------NHFT---------VVE-------------------DGLSQ---------DW-----TG--NVF--------CNP----PYG--R-EIGKWVEKAY-----KE--SL-K-PYN-NV--I--VL--L---IPA-------RTDTKYW-HDYIF-G-KAKDIR------YLKGRLKF-T-I-----NGKENYPA-----------PFP--SAVII-Y------------------ AB660_RS05030_Chromobacterium_subtsugae_828144310 ---D--A-S--IHF--------RSSTDEW-----PTPQLLFDELHAEF----Q-FTVDVCATPG-------NAKCP------RYYT---------RAD-------------------DGLAQ---------DW-----SA-ETVW--------MNP----PFG--H-GIKFWMEKAL-----KS--AR-A---G-AT--V--VC--L---VPS-------RTDTRWW-HRYAMWA-A--EIR------CLDKRLQF----------DGGSAKA-----------PFP--AVVIV-F------------------ ANACOL_RS13845_Anaerotruncus_colihominis_493931641 ------N-K--ALL--------SSKRLDW-----CTPRDFFDALDVEF----H-FTLDAAATEK-------SAKCA------KYYT---------PET-------------------DGLSA---------SW-----AG-ETVF--------CNP----PYG--R-EIKAWIKKGF-----EE--GQ-Q-S-G-TT--V--VL--L---IPS-------RTDTEYF-HKYIL-G-KA-EIR------FLKGRLKFTD-E-----EGLTQDAA-----------PFP--SMLVI-YR----------------- ELEN_RS13090_Eggerthella_lenta_506241510 ------G-G--VAF--------SSERHYW-----ETPQDLFDTLDNEF----H-FTLDPASTDE-------NAKCE------KHYT---------IED-------------------DGLCQ---------SW-----AG-ERVF--------CNP----PYG--R-ELSKWVKKAH-----AE--VALN-P-G-TV--V--VM--L---IPA-------RTDTTYF-HDYIY-H-KA-EVR------FIRGRLRF-C-I-----QGKAKDAA-----------PFP--SMVVV-FR----------------- VE20213_RS09880_Clostridiales_bacterium_VE202-13_639741003 --------------------------MDY-----CTPQDFFDKLNQEF----H-FTLDAAATSK-------SAKCP------QYYT---------PEI-------------------DGIKN---------PW-SIAGGG--AVF--------CNP----PYG--R-KIGKWVRKAY-----EE--SR-N---G-TT--V--VL--L---IPA-------RTDTAYF-HDYIY-G-CA-EIR------FVRGRLHF-TDE-----DGNTYDRA-----------PFP--SMVVI-YN----------------- T370_RS0102475_Bilophila_wadsworthia_736486878 --------N--VHF--------LSKKHDW-----ATPWPLFRELNARF----GPCELDVCATAR-------NAKCG------NFFS---------PEE-------------------DGLRQ---------VW-----HG--VCW--------MNP----PYG--R-ALPHWMAKAV-----NEIEME-R---A-ER--V--IC--L---LPA-------RTDTAWW-HRYVL-P-FAAEIH------YLRGRIRF----------EGAGSSA-----------PFP--SAVVI-F------------------ HMPREF0555_0745_Leuconostoc_mesenteroides_subsp_cremoris_ATCC_19254_227352467 ------D-K--VLF--------SSNSMVW-----ETPKDYFDKLNRKF----K-FDLDACASDT-------NHKVD------TYFT---------EDD-------------------NALEQ---------KW-----GG--NVF--------MNP----PYG--R-HIGKFIKKAY-----EE--HL-R-DPN-RF--I--VM--L---IPS-------RTDTKYW-HEYIQ-D-KAT-VK------FIKGRLKF-E-I-----DGESMDAA-----------PFP--SALVV-YG----------------- HMPREF0555_RS01180_Leuconostoc_mesenteroides_738135700 ------D-K--VLF--------SSNSMVW-----ETPKDYFDKLNRKF----K-FDLDACASDT-------NHKVD------TYFT---------EDD-------------------NALEQ---------KW-----GG--NVF--------MNP----PYG--R-HIGKFIKKAY-----EE--HL-R-DPN-RF--I--VM--L---IPS-------RTDTKYW-HEYIQ-D-KAT-VK------FIKGRLKF-E-I-----DGESMDAA-----------PFP--SALVV-YG----------------- L964_RS00605_Leuconostoc_pseudomesenteroides_491052808 ------S-K--ALF--------SSKSMVW-----ETPKDYFDKLNRKF----K-FDLDACASDT-------NHKVD------TYFT---------EDD-------------------DALEQ---------KW-----GG--NVF--------MNP----PYG--R-HIGEFIKKAY-----EE--HL-R-DPN-RF--I--VM--L---IPS-------RTDTKYW-HEYIQ-D-KAT-VK------FIKGRLKF-E-L-----DGRPMNTA-----------PFP--SALII-YG----------------- OPIT5_22060_Opitutaceae_bacterium_TAV5_573475515 -------------------M-TSSMDMTW-----GTPQVWFDYLHLEF----G-FTLDPCCLHQ-------TAKCK------KHYT---------PAE-------------------DGLAQ---------SW-----AE-ERVF--------MNP----PYG--R-DLPKWMKKAY-----EE--AR-D-N-G-TL--I--VC--F---VPA-------RVDTEWW-HRYAT-K-G--EVR------FPKGRVKF----------ADALDSA-----------PFP--VAVVI-FR----------------- OPIT5_RS20660_Opitutaceae_bacterium_TAV5_763429761 ------------------------MDMTW-----GTPQVWFDYLHLEF----G-FTLDPCCLHQ-------TAKCK------KHYT---------PAE-------------------DGLAQ---------SW-----AE-ERVF--------MNP----PYG--R-DLPKWMKKAY-----EE--AR-D-N-G-TL--I--VC--F---VPA-------RVDTEWW-HRYAT-K-G--EVR------FPKGRVKF----------ADALDSA-----------PFP--VAVVI-FR----------------- BR71_RS03710_Chromobacterium_haemolyticum_759948263 -TDE--A-S--IHF--------RSTRDDW-----ETPQDLFDALHAEF----G-FTVDVCASDK-------TAKCV------RYYT---------KAD-------------------NGLAK---------DW-----SN-EVVW--------MNP----PFG--H-VTKRWMDKAR-----LS--SM-R---G-AT--V--VC--L---VPA-------RVSVLWW-HRNVFLA-S--EVR------CLRPRLQF----------VGAAQKA-----------PFD--AVLVI-FR----------------- RM98_RS09640_Chromobacterium_violaceum_759929100 ---A--E-N--IHF--------RSGRDDW-----ETPHDLFASLNAEF----G-FTVDVCASEK-------TAKCP------RYYT---------PAM-------------------NGLAQ---------DW-----GG-ETVW--------MNP----PFG--H-VTKRWMDKAR-----LS--SL-Q---G-AT--V--VC--L---VPA-------RTSVLWW-HRNVFLA-S--EVR------CIRPRLQF----------VGAAQKA-----------PFD--AVLVV-F------------------ CLOM621_RS14915_Clostridiales_492715347 ------N-D--ALL--------SSKNMCW-----CTPPDFFAELDREF----H-FELDPASTDK-------SAKCA------KHFT---------PDD-------------------DGLKQ---------DW-----GG-YCVF--------CNP----PYG--R-AIADWVRKGY-----EE--SR-K-P-G-TT--V--VM--L---IPS-------RTDTAYF-HDWIF-G-KASEVR------FLRGRLKFTD-E-----DGNGEDAA-----------PFP--SAVIV-WR----------------- HMPREF1020_RS23965_Clostridium_sp_7_3_54FAA_496656604 ------N-D--ALL--------SSKNMCW-----CTPPDFFAELDREF----H-FELDPASTDK-------SAKCA------KHFT---------PDD-------------------DGLKQ---------DW-----GG-YRVF--------CNP----PYG--R-AIADWVRKGY-----EE--SR-K-P-G-TT--V--VM--L---IPS-------RTDTAYF-HDWIF-G-KASEVR------FLRGRLKFTD-E-----DGNGEDAA-----------PFP--SAVIV-WR----------------- N644_0465_Lactobacillus_plantarum_AY01_544589963 ------N-K--ALF--------TSNKEDW-----ETPQDFYDRLNAKY----H-FEWDLAASDG-------NAKCG------DYFT---------SDD-------------------NSLEQ---------DW-E-RLSG--NLF--------LNP----PYG--R-ELKLWVKKAS-----ET--QL-K---H-DQF-L--VM--L---IPS-------RTDTSYW-HDYIF-N-HA-EIE------FLRGRLKF-E-V-----DGVGGDSA-----------PFP--SAVVI-YT----------------- ZJ316_RS06725_Lactobacillus_plantarum_505193070 ------N-K--ALF--------TSNKEDW-----ETPQDFYDRLNAKY----H-FEWDLAASDG-------NAKCG------HYFT---------SDD-------------------NSLEQ---------DW-E-RLSG--NLF--------LNP----PYG--R-ELKLWVKKAS-----ET--QL-K---H-DQF-L--VM--L---IPS-------RTDTSYW-HDYIF-N-HA-EIE------FLRGRLKF-E-V-----DGVGGDSA-----------PFP--SAVVI-YT----------------- HMPREF0178_RS14615_Bilophila_sp_4_1_30_496774991 ------N-R--ALF--------SSVKDDW-----PTPWEFFHNLDLEF----D-FTLDVCAVPW-------SAKVW------RYCV---------PPHALRVWGETTFRRLFPDALVDGLAH---------SW-----AG-ERCY--------MNP----PYG--R-EIGPWVEKAR-----RE--AE-R---G-AL--V--VG--L---LPA-------RTDTAWF-HEHVY-R-AATEIR------FLKGRLKF----------EGAAASA-----------PFP--SMIAV-WG----------------- HMPREF0179_RS15850_Bilophila_wadsworthia_491165768 ------N-R--ALF--------SSVKDDW-----PTPWEFFRNLDLEF----D-FTLDVCAVPW-------SAKVC------RYCV---------PPHALRVWGETTFRRLFPDALVDGLAH---------SW-----AG-ERCY--------MNP----PYG--R-EIGPWVEKAR-----RE--GE-R---G-AL--V--VG--L---LPA-------RTDTAWF-HEHVY-R-AATEIR------FLKGRLKF----------EGAAASA-----------PFP--SMIAV-WG----------------- H627_RS17735_Lactobacillus_harbinensis_737460398 KP----G-G--AAL--------TSNKDDW-----ETPQAFFESLNAKY----H-FAIDLAASKD-------NAKCD------RYFS---------VAD-------------------DSLLQ---------DWSD-DFGG--AMY--------LNP----PYG--R-HIGDWVKKAY-----ET--SL-R---V-NVP-I--VL--L---IPA-------RTDTSYW-HDYIF-G-KA-SIK------FIRGRLKF-E-Q-----NGMAGGPA-----------PFP--SAIIV-YN----------------- A323_gp73_Acinetobacter_bacteriophage_AP22_388570840 ------M-N--VHF--------SSDKQTW-----ETPQDLFDKLNDIF----N-FNLDACAEHD-------TAKVK------KYFT---------IDD-------------------NALIQ---------DW-----IG-S-VW--------CNP----PYN--R-EQIKFIEKAL-----NE--SL-K-H-K-ST--V--VL--L---IPA-------RPETKVW-QNVIF-K-SASQIC------FIKGRLKF----------GNSKYNA-----------PFP--SALIV-FG----------------- TY47_RS06930_Lactobacillus_brevis_754895979 ------N-N--ALL--------SSEKNYW-----ETPHDFFKKLNEKY----Y-FSFDLAASPE-------NTKCE------NFFS---------EED-------------------NSLTK---------AW-H-ELKG--NLF--------LNP----PYG--R-ELRKWVKKAY-----EE--SL-K---K-HDGYI--VL--L---IPA-------RTDTSYW-HDFIF-G-KA-QIN------FLRGRIKF-E-L-----HGESKDAA-----------PFP--SAIVI-YG----------------- N644_RS02335_Lactobacillus_plantarum_727092536 ------N-K--ALF--------TSNKEDW-----ETPQDFYDRLNAKY----H-FEWDLAASDG-------NAKCG------DYFT---------SDD-------------------NSLEQ---------DW-E-RLSG--NLF--------LNP----PYG--R-ELKLWVKKAS-----ET--QL-K---H-DQF-L--VM--L---IPS-------RTDTSYW-HDYIF-N-HA-EIE------FLRGRLKF-E-V-----DGVGGDSA-----------PFP--SAVVI-Y------------------ MCOL2_RS04700_Listeria_fleischmannii_738104299 ------D-R--VIF--------SSERDDW-----ETPTDLFNELDKEF----L-FDLDATANKN-------NAKCP------KFFT---------KEQ-------------------NALVQ---------EW-----RG--SVF--------CNP----PYG--R-EIQKFIEKAY-----IE--SK-K-AYC-ER--V--VL--L---IPA-------RTDTKIW-HDFIF-P-FSKEII------FIKGRLKY-E-L-----NKISNSPA-----------PFP--SAIII-FE----------------- G469_RS0106650_Atopobium_fossor_654811069 ------T-S--GLR--------SSASNEW-----TTPKDLFDELNREF----K-FTVDAASTHE-------NALVD------KHWT---------LAE-------------------DGLAQ---------CW-----DG-ERVW--------CNP----PYG--R-QIAQWVKKAS-----EA--V------G-GV--V--VM--L---IPA-------RTDTSYW-HDYVF-P-NASDIR------FIRGRLHF----------SQSKTAA-----------PFP--SAIVV-FE----------------- B7017_p0034_Bifidobacterium_breve_704484626 ------G-A--AAM--------TSNKDDW-----ETPQALFDQLDKEF----H-FTLDAASNDQ-------NAKCE------HHYT---------AEN-------------------SGLEH---------SW-----GG-ETVF--------CNP----PYG--R-NIGDWIRKAS-----QE--AS-K-P-D-TL--V--VL--L---VPA-------RTDTRWF-QNYIL-H-RA-EVR------FLPGRLKY-E-V-----DGQAGEAA-----------PFP--SMVVI-MR----------------- VPUCM_1151_Vibrio_parahaemolyticus_UCM-V493_584469889 ------L-D--VMFSS-ANS-GDKSKDKW-----QTPPEIFAQLNDRF----G-FTLDAAAEPE-------TALCE------KYFT---------EED-------------------DALKQ---------DW-----SG-HVVF--------CNP----PYS--K--LRVFAKKAY-----EE--SL-K---G-TT--V--VM--L---VPA-------RTDTQAC-HDYLA-N-G--EMY------FIRGRLKF-L-K-----VGELQDAA-----------PFP--SVVCV-LG----------------- Q331_RS21100_Afifella_pfennigii_736470177 ------H-Q--SLY--------SSRTEEW-----ETPPALFERLDRIF----G-FRLDACASPA-------NRKCE------TWFS---------AAD-------------------NALER---------SW---AEHG--RVW--------LNP----PYG--R-RIAGFMRKAF-----EE--SQ-K---G-AL--V--VA--L---VPA-------RTDTLWW-HEWVN-G-KA-DIV------FLKGRLKY-LDE-----NRRERSPA-----------PFP--SALVV-Y------------------ CLOM621_08346_Clostridium_sp_M62/1_291074040 --------------------------MCW-----CTPPDFFAELDREF----H-FELDPASTDK-------SAKCA------KHFT---------PDD-------------------DGLKQ---------DW-----GG-YCVF--------CNP----PYG--R-AIADWVRKGY-----EE--SR-K-P-G-TT--V--VM--L---IPS-------RTDTAYF-HDWIF-G-KASEVR------FLRGRLKFTD-E-----DGNGEDAA-----------PFP--SAVIV-WR----------------- consensus/100% .......................................................................................................................................................................................................................................................................................................................... consensus/95% ............................a...........h..h...a.......phD.hs..........s.p.........ahs...............................ssh............h....................h.P....P.......h...h.............................h........s........b............................................................................................. consensus/90% ............................W..........ha..h...a.......phD.hu..........s.ps........aho...............................ssh............W....................h.P....P.s.....l..hh.bh.......p...............l..lh......hP........b.s...h....................................................................................... consensus/85% ............................W..........hFp.lp..a......hplD.hu..........s.+s........aho...............................suh.p..........W...........a........h.P....Phs.....l..hh.+h......pp.....p...s.....l..lh......hP........b.s..ha.p.hh................h.....p.................h......................................... consensus/80% ........p.............p.....W.......s.phFpplp..a......hplD.hA..p.......NsKs.......paao...............................suh.p.........pW..........ha........h.P....Phu.....l.phl.Kh......pp.....p...s.....l..lh......hP........bss..aa.pphh.......p........h.bsbbp.............s...h...........P.s........................... consensus/75% ........p.............p.pps.W.......spphFpplp.bF....p.hplD.hA..p.......NuKC.......paao...............................suh.p.........pW..........ha........hsP....Pau.....l.phlpKh......pp..s..p...s.....l..lh......hPh.......bss..aa.pphh.......ch.......albsbhch..........s.s.s.h...........Pbs...hh.l.h.................. consensus/70% ......p.p.............pspps.W......sspphFcpLspbF....p.FslDhhA..p.......NAKCp......paao...............................suLpp.........pW.....s....ha........hsP....Pau....bl.chlpKu......cp..u..p...s.....l..lh..h...lPh.......bscs.aa.pchhh......clp......alcsbh+a..........s.upssu...........Phs..shl.l.a..................
<----N6A DNA methylase-------------------------------------------Str-2---------------------------------------Strand-4----------------------------------------------------------------------> RES M-FRRDLFESI---------Q--------------SSLG-V----------------------TFTYDAAC-NDEGTN----ALCARYASPGRSFLASNVAG----E-CVWINPPYSHIRDWQQHYMRCKASDPEH-T-S--AVFCVPA-WPQVHRLMQKAKYSLVARYPA---GTPLFSKPGPDGQR ALIGN -----HHHHHH-----------------------------------------------------EEEEEE-------------EEEEE--------EE----------EEEE-------HHHHHHHHH-----------E--EEEEE-------HHHHHHHHHHHHEEE-------EEEE-------- HMM ----HHHHHHH---------H--------------HH--------------------------EEEHHHHH-H----------EE--EE-----EEEE--------E-EEEE------HHHHHHHHH------------E--EEEEE----------------EEEEEEE------EEEEE------- FREQ ----HHHHHHH---------H--------------HHHH-H----------------------HHHHHHHH-----HH----HHHHHHHHH-----------------EE----------HHHHHHHHHHHHHH-----E--EEEE---------HHHHH-H---EEEEEE------EEEE------- PSSM ------HHHHH---------H--------------HH--------------------------EEEEEEE--------------------------------------EEEE-------HHHHHHHHHHHHH-------E--EEEEEE-----HHHHHHHH--EEEEEEE------EEEEE------- CONF 7-848847687---------8--------------8715-4----------------------56405651-457723----56532211052111443488----0-5964488878157899989887740299-5-0--8998515-776004566505310699970---86377747888864 FINAL ----HHHHHHH---------H--------------HHH-------------------------EEEHHHHH------H----HHHHHHH-----------------E-EEEE-------HHHHHHHHHHHHHH------E--EEEEE-------HHHHHHH---EEEEEEE-----EEEEE------- SOL25 B-B---BB--B--------------------------B--B-----------------------BBBBBBB-----------BBB--BBB----BB---B--------BBBBBBBB--B--BB--BB-B--------B-B--BBBBBB--B----BB---B-B-BBB-BB-------BBB-------- SOL5 -------B--B-----------------------------------------------------BBB-B----------------------------------------BBB---------B--------------------BBBBBB-----------------B---------------------- SOL0 ------------------------------------------------------------------B-B--------------------------------------------------------------------------BBB------------------------------------------ _Crei_29423677 M-FRRDLFESI---------Q--------------SSLG-V----------------------TFTYDAAC-NDEGTN----ALCARYASPGRSFLASNVAG----E-CVWINPPYSHIRDWQQHYMRCKASDPEH-T-S--AVFCVPA-WPQVHRLMQKAKYSLVARYPA---GTPLFSKPGPDGQR _Crei_29423694 M-FRRDLFESI---------Q--------------SSLG-V----------------------TFTYDAAC-NDEGTN----ALCARYASPGRSFLASNVAG----E-CVWINPPYSHIRDWQQHYMRCKASDPEH-T-S--AVFCVPA-WPQVHRLMQKAKYSLVARYPA---GTPLFSKPGPDGQR Vcar1000014369_Vcar_Vcar1000014369 M-FLPDEFRNV---------E--------------NMLG-R----------------------QFTFDAAC-NNSGDN----SLCTRFASPSNSFLTSDVSG----EFFVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR Vcar1000006797_Vcar_Vcar1000006797 M-FLPDEFRNV---------E--------------NMLG-R----------------------QFTFDAAC-NNSGDN----SLCTRFASPSNSFLTSDVSG----E-FVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR Vcar1000013547_Vcar_Vcar1000013547 M-YLPDEFRNV---------E--------------NMLG-R----------------------QFTFDAAC-NNSGDN----SLCTRFASPSNSFLTSDVSG----E-FVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR Vcar1000003571_Vcar_Vcar1000003571 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTHVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000013363_Vcar_Vcar1000013363 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFVVPKDVFTFECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000006689_Vcar_Vcar1000006689 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAIPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000010818_Vcar_Vcar1000010818 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFLDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000011421_Vcar_Vcar1000011421 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSCFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000012306_Vcar_Vcar1000012306 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000012957_Vcar_Vcar1000012957 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFAQPVDVGTR Vcar1000013369_Vcar_Vcar1000013369 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000013410_Vcar_Vcar1000013410 M-FLRDEFRRV---------E--------------TELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000006131_Vcar_Vcar1000006131 M-FLRDEFRRV---------E--------------NELG-R----------------------QFTFDAAC-NDSGDN----SLCSRFASPSNSFFDTDVSG----E-FVWINPPYTHIKEWQQHYHRCKLKNPKT-T-S--AVFAVPK-WTQVECLMRSAGYQLLKTYAV---GTQLFSQPVDVGTR Vcar1000007918_Vcar_Vcar1000007918 ---------------------------------------------------------------------------GDN----SLCTRFASPSNSFLTSDVSG----E-FVWANPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQIERRMRAAGHQLLKTYAV---GTKLFLEKADDGSR Vcar1000012324_Vcar_Vcar1000012324 --LSCTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVRMVPRTSTPSTFISQYLESKTTNPRT---S--AIIILPD-RPTAPWAPLIRHMTVVRRFPA---GARIVCRRDPSDAS Vcar1000014693_Vcar_Vcar1000014693 --IAQLLFLEY---------D--------------SQYG------------------------PFTVDAYC-DDLGLT----AQLSPFFSPSRPFLSTDIEG----E-CVWMVPPVDNASTNIARYLDAKTANPNT---S--AIIVLPD-RPQAPWGPLIRHMTIVRRFPA---GAQIVCRPLSSDPS Vcar1000005651_Vcar_Vcar1000005651 --IARSLFLEY---------D--------------SQYG------------------------PFTVDAYC-DDLGLT----AQLSPFFSPSRPFLSTDIEG----E-CVWMVPPVDNASTIVARYLDAKTASPNT---S--AIIVLPD-RPQAPWAPLIRHMTIVRRFPA---GAQIVCRPTTSDPS Vcar1000014193_Vcar_Vcar1000014193 ------------------------------------------------------------------------------------LSPFFSPSRPFLSTDIEG----E-CIWMVPPVDNVSTIVARYLDAKTANPKT---S--AIIVLPD-RPQAPWAPLIRHMTIVHRFPA---GAQIVCRPTTSDPS Vcar1000006954_Vcar_Vcar1000006954 ------------------------------------------------------------------------------------LSPFFSPSRPFLSTDIEG----E-CVWMVPPVDNASTIVARYLDAKTANPKT---S--AIIVLPD-RPQASWAPLIRHMTIVRRFPA---GAQIVCRPTSSDPS Vcar1000001334_Vcar_Vcar1000001334 --LTRAIFLDL---------D--------------SQYG------------------------PFTVDACC-D--GIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIILPD-RPTAPWAPLIRHMTVMRRFPA---GAWIVCRRDPSDAS Vcar1000012130_Vcar_Vcar1000012130 --LARTVFLDL---------D--------------SQYS------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSTPSTFISQYLESKSTNPRT---S--AVIVLPD-RPTAPWTPLIRHMTVVRRFPA---GARIVCHRDPSDAS Vcar1000003269_Vcar_Vcar1000003269 --LSCTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----THVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSTLSTFISQYLESKTTNPCTAPIS--TLVII----------------------------------------- Vcar1000012920_Vcar_Vcar1000012920 --LSRTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAHVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIVLPD-RPTAPWAPLIRHMTVVRRFPT---GARIVCRRDPSDAS Vcar1000014935_Vcar_Vcar1000014935 --LSRTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIVLPD-RPTAPWAPLIRHMTVVRRFPA---GARIVCRRDPSDAS Vcar1000010947_Vcar_Vcar1000010947 --L---------------------------------HYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQVDG----E-CVWMVPPTSSPSAFISQYLESKTTNPRT---S--AIIVLPD-RPTAPWAPLIRHMTVVRRFPA---GARIVCRRDPSDAS Vcar1000003043_Vcar_Vcar1000003043 --LSCTIFLDL---------D--------------SQYG------------------------PFTIDACC-DDFGIN-------VPFFSPPHSFLSAQVDG----E-CVWMVLPTLNPSAFISQYLESKTTNPCT---S--AIIVLPD-RPTAPWALLIHHMAIVRRFPA---GVQIFCRRDPSDAS Vcar1000010860_Vcar_Vcar1000010860 --VMGKAAEWY---------HDLFTHKGSMLTIQGMCDD-F----------------------VLLFSDEC-ATDANGNNWFAFDTLINFAQDSVTSHDLNG----Q-HIWCSTPADRVIPWLNNYSTFKQRTPDT-T-R--AVILVPK-CIHLEKEFQTRGWTLLKEFTK---NSRIFSEPKPGGGH Vcar1000008009_Vcar_Vcar1000008009 ----RSVFLRL---------Q--------------KASG-R----------------------VFTFDATC-NGGSD-----ALCPKFACSSSPIISHDVSG----Q-HVWCQPPPKCVNDWLDHYSACKQRSPES-T-S--AIFVVPK-CTQFEQTFQKRGWTLLKEFLS---DAHIFSVPKSGGGR Vcar1000014314_Vcar_Vcar1000014314 I-IDYELLRQL---------E--------------RRIG-R----------------------TFSLDAAA-NDDGSN----SVCKMFASPTRSFLNSDCSG----H-TIWMNPPMKLLSDFLRHYHRCKSYDPSI-S-SWTAPWASTA-WPRVNCLDVNCSMCRVLTFPA---KSVLFNGLSLEGKT Vcar1000000459_Vcar_Vcar1000000459 --VLRSVFLRL---------Q--------------KASG-R----------------------VFAIDASC-NVRSDN----SLCPIFAC-PDSFTSHNLSG----Q-HIWCNAPADRAIPWLNNYSTFKQRTPDT-T-S--AVILVPK-CAHLEKEFQTRGWTLLKEFTK---NSSIFSEPKPGGGR Vcar1000000870_Vcar_Vcar1000000870 --------ERY---------Q--------------LRQP-T----------------------AAAAEPAR-VTGPTA----ALQPQRGTRQDSFTSHNLSG----Q-HIWCNAPADRAIPWLNNFSTFKQRAPDT-T-S--AVILVPK-CAHLEKEFQTRGWTLLKEFTK---NSNIFSEPKPGGGR Vcar1000014202_Vcar_Vcar1000014202 M-FLPGEFRNV---------E--------------NMLG-R----------------------QFTFDTAC-NNSGDN----SLCTRFASLSSSFLTSDVSG----E-FPGFKPPYTKIELWHKHYLRCKRMRPET-T-S--AVFVVPK-WSQVERRRQGASGARTAGGKC---VPPVCGKHLPQMPR Vcar1000001335_Vcar_Vcar1000001335 --LSCTIFLDL---------Y--------------SQYG------------------------LFTVDACC-DDFGIN----AHIIPFFSPSCSFLSAQVDGLWFAY-SLWMFWAVR----------------------------------------------------------------------- Vcar1000010806_Vcar_Vcar1000010806 --VAPPRFVRR---PDGNCLW--------------SKYW---QGLGRSVHESEVASIATTMGREFTLDACA-SDCGLS----AVCNAFSCTARPFLDTNIAG----H-TVWMAPNAADLPAYVTHYRACKPLAPQS---T--AACILVP-SGTEP--SLLKGMKLVRRYPV---GTSLFYVPDVQGSR Vcar1000015077_Vcar_Vcar1000015077 --VAPPRFVRR---PDGNCLW--------------SKYW---QGLGRSVHESEVASIATTMGREFTLDACA-SDCGLS----AVCNAFSCTARPFLDTNIAG----H-TVWMAPNAADLPAYVTHYRACKPLAPQS---T--AACILVP-SGTEP--SLLKGMKLVRRYPV---GTSLFYVPDVQGSR Vcar1000003045_Vcar_Vcar1000003045 A-LANPVVLSKLSLPQPGA-W--------------STFLLSPPASGRYL-------VVRGPQRGATMTLCELQAYGES----AFEMELVPAPRP-------------------PQPPPMP-----FPSPQPPSPP-------------P-PPSPP--PSSKSSSRSREWPV---SKVSTNPTAVTAVT Vcar1000007748_Vcar_Vcar1000007748 --LARTIFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLGPSR-----------------------RRYLESKSTNPRT---S--AVIVLPD-RPTAPWTPLIRHMTVVRRFPA---GARIVCHRDPSDAS Vcar1000007858_Vcar_Vcar1000007858 --LARTVFLDL---------D--------------SQYG------------------------PFTVDACC-DDFGIN----AHVVPFFSPSRSFLSAQ---------------------------VDALKQHNRL---H--QPPLPPV-DENTP-TSRSTTLDMVKQWATDVGGGDCNSHDDSKGHL Vcar1000005961_Vcar_Vcar1000005961 AGIAPPRFVRR---PDGNCLW--------------SKYW---QGLGRSVHESEVASIATIMDREFTLDACA-SDCGLS----AVCNAFSCTARPFLDTNVAG----H-TVWMAPNAADLPAYVTHYRACKPLAPQS---T--AACILVP-SGTEP--SLLKGMKLVRRYPV---GTSLFYVPDVQGSR consensus/100% .......................................................................................b....ps.............................................................................................. consensus/95% ...........................................................................s...........h.s.spsh.s.p..........................a...b...Pp....p.....h......p................................... consensus/90% ................................................................hshss...ss.u.s....u.hs.a.ssspsFhs.pl.G....p..lbh.ss.........pY..sK...Ppp...o..Ahhhls....p.....b....p.h.pas....ss.lh....s.s.p consensus/85% ..h....h.p.........................p.b..........................FohDsss.ss.G.s....u.hs.FhssupsFhsspl.G....p..lWhsss.sp...h..pY.psK..pPpo...o..Ahhhls...sp.....b....pll+pass...ss.lh....s.ssp consensus/80% ..h..s.Fbph........................p.hs.........................FThDAss.ss.G.s....u.hs.FhSsupsFhssplsG....c..lWhsPP.sp...a.ppY.psK..sPpo...S..AlhhlP...sph....b...hpll+pass...usplh.p.sssssp consensus/75% ..h..s.Fbph.........p..............sphG.........................FThDAsC.ss.G.s....u.hs.FhSPopSFhssplsG....c..VWhsPP.sphp.a.ppY.csK..sPcT...S..AlhhlPc..sph....b...hpll+pass...Gsplhsp.sssssp consensus/70% ..h..s.Fbpl.........p..............sphG.........................FThDAsC.ss.G.N....u.hs.FhSPSpSFhssslsG....E.hVWhsPPhsphp.a.ppY.csK..sPcT...S..AlhhlPc.bsph....b...hpll+pass...Gsplhsp.sssssp
GI Domain architecture Pfam Gene_name len Taxonomy Species Genbank/other annotation # 1; Adig1000018270 - LMF1 Adig1000018270 178 eukaryota>cnidaria Acropora digitifera adi_v1.14096 Adig1000000319 - RVT_1 Adig1000000319 1180 eukaryota>cnidaria Acropora digitifera adi_v1.18602 Adig1000005510 RnaseH RNase_H+Dam+Phage_integrase Adig1000005510 720 eukaryota>cnidaria Acropora digitifera adi_v1.22623 Adig1000019867 - - Adig1000019867 132 eukaryota>cnidaria Acropora digitifera adi_v1.15446 Adig1000014879 - - Adig1000014879 599 eukaryota>cnidaria Acropora digitifera adi_v1.11174 Adig1000005157 CASPASE RVT_1 Adig1000005157 1007 eukaryota>cnidaria Acropora digitifera adi_v1.00366 Adig1000023250 - - Adig1000023250 374 eukaryota>cnidaria Acropora digitifera adi_v1.02813 Adig1000019364 - - Adig1000019364 374 eukaryota>cnidaria Acropora digitifera adi_v1.15024 Adig1000005444 CASPASE Dam Adig1000005444 970 eukaryota>cnidaria Acropora digitifera adi_v1.22572 Adig1000006995 NACHT+RnaseH RVT_1+RNase_H Adig1000006995 2418 eukaryota>cnidaria Acropora digitifera adi_v1.04684 Adig1000000407 - RVT_1 Adig1000000407 1291 eukaryota>cnidaria Acropora digitifera adi_v1.18677 Adig1000012708 RnaseH+ZNKNUCK RVT_1 Adig1000012708 747 eukaryota>cnidaria Acropora digitifera adi_v1.00852 Adig1000012208 - RVT_1 Adig1000012208 895 eukaryota>cnidaria Acropora digitifera adi_v1.00785 Adig1000005671 MYB RVT_1+Phage_integrase Adig1000005671 1221 eukaryota>cnidaria Acropora digitifera adi_v1.22768 Adig1000014194 SbcC - Adig1000014194 412 eukaryota>cnidaria Acropora digitifera adi_v1.10547 Adig1000010273 SWC3 YkyA Adig1000010273 349 eukaryota>cnidaria Acropora digitifera adi_v1.06943 384498610 - RVT_1+Dam RO3G_13812 370 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_13812 [Rhizopus delemar RA 99-880]. 384486240 - Dam RO3G_03124 172 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_03124 [Rhizopus delemar RA 99-880]. 384495516 RnaseH RNase_H RO3G_10717 264 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_10717 [Rhizopus delemar RA 99-880]. 384497823 - RVT_1+Dam RO3G_13025 1062 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_13025 [Rhizopus delemar RA 99-880]. Mcir1000003087 - - Mcir1000003087 229 eukaryota>fungi>basal Mucor circinelloides Genemark1.3173_g Pbla1000001570 RnaseH RVT_1+DNA_pol_viral_C+Dam Pbla1000001570 570 eukaryota>fungi>basal Phycomyces blakesleeanus e_gw1.22.32.1 Amac1000015656 - - Amac1000015656 424 eukaryota>fungi>blastocladiomycota Allomyces macrogynus Allomyces macrogynus ATCC 38327 hypothetical protein (424 aa) Rall1000003656 - Dam Rall1000003656 306 eukaryota>fungi>cryptomycota Rozella allomycis O9G_006133m.01 Rall1000004614 - - Rall1000004614 193 eukaryota>fungi>cryptomycota Rozella allomycis O9G_005572m.01 Bcir1000007834 - - Bcir1000007834 192 eukaryota>fungi>mucoromycotina Backusella circina fgenesh1_pg.3_#_153 Bcir1000015312 - - Bcir1000015312 227 eukaryota>fungi>mucoromycotina Backusella circina fgenesh1_pg.351_#_5 Mver1000012212 - Dam Mver1000012212 214 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (214 aa) Mver1000007812 RnaseH Dam Mver1000007812 408 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (408 aa) Mver1000006934 - Dam Mver1000006934 281 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (281 aa) 485639708 - Dam EMIHUDRAFT_252968 232 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_252968 [Emiliania huxleyi CCMP1516]. Sarc1000007323 - - Sarc1000007323 147 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (147 aa) Sarc1000003931 HPC2 - Sarc1000003931 111 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (111 aa) Sarc1000000340 - Dam Sarc1000000340 291 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (291 aa) Sarc1000008354 - Dam Sarc1000008354 611 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (611 aa) Sarc1000010122 - Dam Sarc1000010122 129 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (129 aa) Sarc1000005744 - - Sarc1000005744 206 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (206 aa) Sarc1000001093 - Dam Sarc1000001093 320 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (320 aa) Sarc1000009775 - Dam Sarc1000009775 395 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (395 aa) Sarc1000012860 - - Sarc1000012860 81 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (80 aa) Sarc1000000559 - - Sarc1000000559 181 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (181 aa) Sarc1000002310 - - Sarc1000002310 167 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (167 aa) Sarc1000000227 - - Sarc1000000227 158 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (158 aa) Sarc1000007502 - rve Sarc1000007502 346 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (346 aa) Sarc1000008574 - - Sarc1000008574 91 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (91 aa) Sarc1000004600 - - Sarc1000004600 341 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (341 aa) Sarc1000009974 - Spc7_N Sarc1000009974 333 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (333 aa) Sarc1000013043 - - Sarc1000013043 244 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (243 aa) Sarc1000011591 - - Sarc1000011591 77 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (77 aa) Smar1000013635 RnaseH RNase_H+Dam Smar1000013635 229 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR001291-PA pep:novel scaffold:Smar1:AFFK01015044:37482:38168:-1 gene:SMAR001291 transcript:SMAR001291-RA Smar1000013446 RnaseH - Smar1000013446 296 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR001465-PA pep:novel scaffold:Smar1:JH430561:19939:21493:-1 gene:SMAR001465 transcript:SMAR001465-RA Smar1000007923 - - Smar1000007923 77 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR006470-PA pep:novel scaffold:Smar1:JH431701:276342:276938:1 gene:SMAR006470 transcript:SMAR006470-RA Smar1000009144 - - Smar1000009144 159 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR005333-PA pep:novel scaffold:Smar1:JH431599:48277:48814:-1 gene:SMAR005333 transcript:SMAR005333-RA Smar1000001056 - - Smar1000001056 193 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR012323-PA pep:novel scaffold:Smar1:JH432129:50962:52819:1 gene:SMAR012323 transcript:SMAR012323-RA Smar1000003623 RnaseH RNase_H+Dam Smar1000003623 408 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR010103-PA pep:novel scaffold:Smar1:JH431960:112870:114093:-1 gene:SMAR010103 transcript:SMAR010103-RA Smar1000006848 - - Smar1000006848 89 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR015629-PA pep:novel scaffold:Smar1:JH431783:5867:6133:1 gene:SMAR015629 transcript:SMAR015629-RA Smar1000011252 - - Smar1000011252 183 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR003465-PA pep:novel scaffold:Smar1:AFFK01018421:2875:3482:-1 gene:SMAR003465 transcript:SMAR003465-RA Smar1000004446 RnaseH RVT_1 Smar1000004446 425 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR009389-PA pep:novel scaffold:Smar1:JH431896:51121:52792:1 gene:SMAR009389 transcript:SMAR009389-RA Smar1000007917 - RNase_H Smar1000007917 429 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR006464-PA pep:novel scaffold:Smar1:JH431701:250462:251748:1 gene:SMAR006464 transcript:SMAR006464-RA Smar1000004632 SUN F5_F8_type_C Smar1000004632 655 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR009255-PA pep:novel scaffold:Smar1:JH431878:386467:389171:-1 gene:SMAR009255 transcript:SMAR009255-RA 210086106 PIN+SET+CHASE3+MA+WXG+RNASE-EG+MA+LAMG+LamG+LEVANB+LamG+LAMG Laminin_B+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_B+Laminin_EGF+Laminin_EGF+Laminin_I+Flagellar_rod+MAD+Myosin_tail_1+Troponin+Laminin_II+Laminin_G_1+Laminin_G_2+Laminin_G_1+Laminin_G_2 BRAFLDRAFT_131954 2475 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_131954 [Branchiostoma floridae]. 313236395 CDC27 RVT_1+GRP+FoP_duplication+Phage_integrase+DUF3807 GSOID_T00016890001 1568 eukaryota>metazoa>chordata Oikopleura dioica unnamed protein product [Oikopleura dioica]. 313244116 - RVT_1 GSOID_T00010067001 725 eukaryota>metazoa>chordata Oikopleura dioica unnamed protein product [Oikopleura dioica]. 210086104 DISCOIDIN+PIN+SET+MA+MODE-HTH+NIT+XpaC+sigma+CHASE3+MODE-HTH+MA+WXG+BZIP+LamG+LamG+LamG+LamG+LamG Acyl-CoA_ox_N+Laminin_N+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_B+Laminin_B+Laminin_EGF+VSP+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_EGF+Laminin_B+Laminin_EGF+Laminin_EGF+Laminin_I+Myosin_tail_1+AAA_27+AAA_27+MAD+Myosin_tail_1+PspA_IM30+Cast+Lebercilin+FadA+Troponin+Laminin_II+Laminin_G_1+Laminin_G_2+Laminin_G_2+Herpes_BLLF1+Herpes_BLLF1+Pneumo_att_G+Laminin_G_1+Laminin_G_2 BRAFLDRAFT_131952 3505 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_131952 [Branchiostoma floridae]. 313235579 - - GSOID_T00014774001 357 eukaryota>metazoa>chordata Oikopleura dioica unnamed protein product [Oikopleura dioica]. 327268405 TPR+TPR+TPR+TPR+TPR+JOR TPR_11+TPR_11+TPR_12+TPR_11+TPR_17+TPR_11+TPR_11+TPR_1+TPR_1+Herpes_BLLF1+JmjC LOC100564779 1580 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: lysine-specific demethylase 6A-like [Anolis carolinensis]. 327286446 RnaseH+TBC RVT_1+RabGAP-TBC LOC100566709 1049 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: hypothetical protein LOC100566709 [Anolis carolinensis]. 327274991 NLPC LRAT LOC100563610 382 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: hypothetical protein LOC100563610 [Anolis carolinensis]. 125838616 RnaseH RVT_1 LOC100005823 560 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: reverse transcriptase/ribonuclease H/putative methyltransferase-like [Danio rerio]. 189519778 RnaseH RVT_1 LOC100003059 790 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio]. 189546720 - RVT_1 LOC100149602 762 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio]. 189516844 RnaseH RVT_1+RNase_H LOC558928 684 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: reverse transcriptase/ribonuclease H/putative methyltransferase-like [Danio rerio]. 17066696 RnaseH RVT_1+Dam - 785 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis reverse transcriptase/ribonuclease H/putative methyltransferase, partial [Tetraodon nigroviridis]. 125850303 RnaseH RVT_1+RNase_H LOC561204 892 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio]. 125835610 RnaseH RVT_1 LOC100008043 684 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Danio rerio]. 156209094 - Dam NEMVEDRAFT_v1g220590 131 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. 156219436 - - NEMVEDRAFT_v1g208020 360 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. 156216881 STYKIN Pkinase_Tyr+Dam NEMVEDRAFT_v1g211073 426 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. 156209473 - DUF640 NEMVEDRAFT_v1g220156 672 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. 321452244 - - DAPPUDRAFT_267974 433 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_267974 [Daphnia pulex]. 170819710 SRDOMAIN Herpes_BLLF1+RVT_1 - 1291 eukaryota>metazoa>crustacea Daphnia pulex reverse transcriptase [Daphnia pulex]. 321459175 - Dam DAPPUDRAFT_257338 330 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_257338 [Daphnia pulex]. 170819724 RnaseH RVT_1 - 757 eukaryota>metazoa>crustacea Daphnia pulex reverse transcriptase [Daphnia pulex]. 115628917 - - LOC582271 229 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Strongylocentrotus purpuratus]. 291220884 - DUF829 LOC100368707 403 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: hypothetical protein [Saccoglossus kowalevskii]. 291236647 ZZ-like - LOC100378332 667 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: hypothetical protein [Saccoglossus kowalevskii]. 291232955 - Dam+Phage_integrase LOC100369420 685 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: predicted protein-like [Saccoglossus kowalevskii]. 156542171 RnaseH RVT_1 LOC100116731 585 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase [Nasonia vitripennis]. 307196129 RnaseH RVT_3+PCIF1_WW EAI_17025 251 eukaryota>metazoa>hexapoda Harpegnathos saltator hypothetical protein EAI_17025, partial [Harpegnathos saltator]. 156538873 - RVT_1 LOC100115061 405 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: hypothetical protein, partial [Nasonia vitripennis]. 156542658 - Phage_integrase LOC100121360 1054 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to tyrosine recombinase [Nasonia vitripennis]. 156539065 - RVT_1+Phage_int_SAM_1 LOC100117226 787 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to reverse transcriptase/ribonuclease H/putative methyltransferase, partial [Nasonia vitripennis]. 156546508 - Phage_integrase LOC100123785 389 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to tyrosine recombinase [Nasonia vitripennis]. 307212135 - - EAI_06111 213 eukaryota>metazoa>hexapoda Harpegnathos saltator hypothetical protein EAI_06111, partial [Harpegnathos saltator]. 307193617 - Dam EAI_10577 149 eukaryota>metazoa>hexapoda Harpegnathos saltator hypothetical protein EAI_10577, partial [Harpegnathos saltator]. 307201692 - DNA_pol_viral_C+Dam EAI_09447 251 eukaryota>metazoa>hexapoda Harpegnathos saltator hypothetical protein EAI_09447, partial [Harpegnathos saltator]. 307198641 - Dam EAI_12430 155 eukaryota>metazoa>hexapoda Harpegnathos saltator hypothetical protein EAI_12430, partial [Harpegnathos saltator]. 307183886 - DNA_pol_viral_C EAG_00458 240 eukaryota>metazoa>hexapoda Camponotus floridanus hypothetical protein EAG_00458, partial [Camponotus floridanus]. 156542106 - Phage_integrase LOC100115034 858 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to tyrosine recombinase [Nasonia vitripennis]. Aque1000012689 - Dam Aque1000012689 284 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.212957 Aque1000016213 - Phage_int_SAM_1 Aque1000016213 334 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.216481 Aque1000017746 - - Aque1000017746 158 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.218014 Aque1000024217 - - Aque1000024217 222 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.224485 Aque1000022247 - Dam Aque1000022247 129 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.222515 Aque1000026010 - - Aque1000026010 147 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.226278 Aque1000015330 - - Aque1000015330 199 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.215598 Aque1000019121 - - Aque1000019121 220 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.219389 Aque1000008102 - - Aque1000008102 232 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.208349 Aque1000012578 - - Aque1000012578 221 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.212846 Aque1000018252 RnaseH RVT_1+Dam Aque1000018252 708 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.218520 Aque1000010245 - Dam Aque1000010245 256 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.210513 Aque1000019993 - - Aque1000019993 556 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.220261 Aque1000019816 - - Aque1000019816 230 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.220084 Aque1000001701 - Phage_int_SAM_1 Aque1000001701 312 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.201769 Aque1000022602 - RVT_1+Dam Aque1000022602 350 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.222870 Aque1000008788 - RVT_1 Aque1000008788 577 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.209041 Aque1000011876 - - Aque1000011876 79 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.212144 159465941 Phd-YefM+RnaseH RVT_1+RNase_H CHLREDRAFT_180868 1199 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii hypothetical protein CHLREDRAFT_180868 [Chlamydomonas reinhardtii]. 22415757 - RVT_1 ORF-B 829 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis reverse transcriptase [Volvox carteri f. nagariensis]. CrRemI subclade GI Domain architecture Gene_name len Species Class Genbank annotation Vcar1000015077 N6-MTase Vcar1000015077 707 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_231000001 Vcar1000014693 N6-MTase Vcar1000014693 671 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_fgenesh4_pg.C_1190017 Vcar1000010806 N6-MTase Vcar1000010806 511 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_83000041 Vcar1000005651 N6-MTase Vcar1000005651 483 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_107000001 Vcar1000014193 N6-MTase Vcar1000014193 466 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_80000013 Vcar1000007748 N6-MTase Vcar1000007748 460 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_102000028 Vcar1000001334 N6-MTase Vcar1000001334 443 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_fgenesh4_pg.C_120014 Vcar1000007858 N6-MTase Vcar1000007858 397 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_fgenesh4_pg.C_300114 Vcar1000012920 N6-MTase Vcar1000012920 372 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_95000030 Vcar1000014935 N6-MTase Vcar1000014935 321 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_273000001 Vcar1000003269 N6-MTase Vcar1000003269 313 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_25000154 Vcar1000013363 N6-MTase Vcar1000013363 311 Volvox carteri eukaryota>viridiplantae>chlorophyta e_gw1.101.32.1 Vcar1000003043 N6-MTase Vcar1000003043 302 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_23000205 Vcar1000012324 N6-MTase Vcar1000012324 287 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_22000129 Vcar1000000870 N6-MTase+pepsin Vcar1000000870 267 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_fgenesh4_pg.C_10481 Vcar1000014369 N6-MTase Vcar1000014369 252 Volvox carteri eukaryota>viridiplantae>chlorophyta e_gw1.112.21.1 Vcar1000006797 N6-MTase Vcar1000006797 251 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_Genewise1Plus.C_190101 Vcar1000013547 N6-MTase Vcar1000013547 251 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.120.3.1 Vcar1000003571 N6-MTase Vcar1000003571 250 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.10.241.1 Vcar1000006131 N6-MTase Vcar1000006131 250 Volvox carteri eukaryota>viridiplantae>chlorophyta e_gw1.31.78.1 Vcar1000006689 N6-MTase Vcar1000006689 250 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.75.44.1 Vcar1000010818 N6-MTase Vcar1000010818 250 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_Genewise1.C_830117 Vcar1000011421 N6-MTase Vcar1000011421 250 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.116.24.1 Vcar1000012306 N6-MTase Vcar1000012306 250 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.61.90.1 Vcar1000012957 N6-MTase Vcar1000012957 250 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.105.40.1 Vcar1000013369 N6-MTase Vcar1000013369 250 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.101.33.1 Vcar1000013410 N6-MTase Vcar1000013410 250 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.111.24.1 Vcar1000010947 N6-MTase Vcar1000010947 225 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_fgenesh4_pg.C_410059 Vcar1000007918 N6-MTase Vcar1000007918 224 Volvox carteri eukaryota>viridiplantae>chlorophyta e_gw1.11.237.1 Vcar1000014202 N6-MTase Vcar1000014202 213 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_Genewise1Plus.C_800030 Vcar1000012130 N6-MTase Vcar1000012130 160 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_57000068 Vcar1000000459 N6-MTase Vcar1000000459 159 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_Genewise1Plus.C_10079 Vcar1000001335 N6-MTase Vcar1000001335 151 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_12000015 Vcar1000014314 N6-MTase Vcar1000014314 150 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_Genewise1.C_1320001 Vcar1000008009 N6-MTase Vcar1000008009 139 Volvox carteri eukaryota>viridiplantae>chlorophyta gw1.11.234.1 Vcar1000006954 N6-MTase Vcar1000006954 346 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_9000001 Vcar1000010860 N6-MTase+RT Vcar1000010860 569 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_89000033 Vcar1000003045 N6-MTase Vcar1000003045 2927 Volvox carteri eukaryota>viridiplantae>chlorophyta fgenesh4_pg.C_scaffold_23000206 Vcar1000005961 N6-MTase Vcar1000005961 1336 Volvox carteri eukaryota>viridiplantae>chlorophyta estExt_fgenesh5_synt.C_620028 29423694 N6-MTase - 258 Chlamydomonas reinhardtii eukaryota>viridiplantae>chlorophyta pol protein [Chlamydomonas reinhardtii]. 159468287 N6-MTase - 171 Chlamydomonas reinhardtii eukaryota>viridiplantae>chlorophyta hypothetical protein CHLREDRAFT_171026 [Chlamydomonas reinhardtii]. 29423677 N6-MTase+pepsin+RT - 776 Chlamydomonas reinhardtii eukaryota>viridiplantae>chlorophyta reverse-transcriptase [Chlamydomonas reinhardtii].Back to Contents
consensus/100% .......................................................................D...s.................................s.h.....h.......................................................................................................h....s....................................................................................................................................................................................... consensus/95% .........................................b.pP..b...h..................lDss.s.............s...hs..............suL.....W...........hhhNPPa...........................................ah.+h...................................lhl....s.sp...ha.........................h.....ch.b...............................................s................hhhh........................................................................ consensus/90% .................................p.......a.TP..hhp.l.....h...........pLDss.u.............s.p.as..............suL...p.W...........hahNPPa..........................s................ah.+h..p.........s......................lhLh...s.os.s.hap..h...................h.h.h.p.Rl.F...............................................sshs.............lhha........................................................................ consensus/85% ...............b.................p.......a.TP..hhp.l.....h..........hsLDss.u.............s.p.as..............sGL...p.W...........hahNPPY..........................sp......h........ah.+h.pp.........s.....................hlhLl...spT-.s.aapphh...................l.a.l.c.RlpF........................................s......sshss............lhha........................................................................ consensus/80% ...............h.................pp......W.TP..hhc.lp....h..........hsLDss.u..p......ps..sppaao.......p......sGL...p.W...........hahNPPY..........................up......l........ah.+hhpp.........s.....................hlhLl...scT-.s.aapphh...................l.a.l.+uRlpF........s.......................p.......s......ushss...........hllha........................................................................ consensus/75% ...............a.................ps......W.TP..hhc.Ls....a..........hsLDss.u.sp......ss.psppaaT.......pp.....DGL...ppW...........hahNPPY..........................up......l.......pWlpKhhpp.........s.....................hlhLl..ssRTD.spaapchh...................lpF.l.+GRl+F........s..................p....s.......s......ushss...........hllla........................................................................ consensus/70% ...............a.................ss......W.TPpphh-bLsp...a..........hsLDsC.ussp......ss.psp+aaT.......cp.....DGLp..ppW..........plahNPPY..........................uc.p....l.......cWlpKuhpp.......p.u.....................lVhLl..PuRTD.opaapchh...........b.......lpF.l.+GRL+F........u.............s....p....s.......s......APhss...........hllla........................................................................ Annot Str(-1) Str-1 Str-2 Str-4 Str-5 Str-6 Str-7 FINAL ---------------------------------------------HHHHHHHHH-H------------------------------H-HHHH----------------------------------E-EEEE--------------------------------------H-----HHHHHHHHHHH--H-H-------E-----------------EEEEEE--E-----HHHHHHHH-----------H-HH----HHH-H-H---EE----------------------------------------------------EEEEE----EEEEEE--------------------------------------------HHHH-HH-HH-------- ALIGN -----------------------------------------------HHHH--H-H-H----------------------------------EEE-------------------------------E-EEE--------------------------------------HH-----HHHHHHHHHHH--H---------------------------EEEEEE--E------HHHHHHH-----------H-HH----HHH-H-H---EE-----------------------------------------------------------EEEEEEEE-E-----------------------------------------------EE-E--------- HMM --------------EEE---------------------------HHHHHHHHHH-H-------------EEE-------------------EEEEE--------------------------------EEEEEE------------------------------------------HHHHHHHHHHH--H-H-H-----E-----------------EEEEEE--E---H-HHHHHHHH-----------H-H-----HHH-H-HH-HEEE---------------------------------------EE-----------------EEEEEEE--------------------------------------------HHHHH-HH-HHH--H---- FREQ ---------------------------------------------HHHHHHHHH---------------------------H---HH-HHHH-----------------------EEE--------E-EEE---------------------------------------H-----HHHHHHHHHHH--H-H-------E-----------------EEEEEE----------HHHHHH-----------H-HH----HHH-H-H---EE---------------------------------------------------EEEEEE----EEEEEE-----------------------------------------------E-EE-EE-------- PSSM ---------------------------------------------HHHHHHHHH-H-H--------------------------------------------------------------------E-EEEE--------------------------------------H-----HHHHHHHHHHH--H-H-------------------------EEEEEE--------HHHHHHHH-----------H-----------E----EE--------------------------------------------------------------EEEEE---------------------------------------------HHHH-HH-HH-------- VOLCADRAFT_104970_Volvox_carteri_f_nagariensis_302839284 GG-N-V-------RLFLSS-------------ESP----E-WFTPLSIIELVHE-V-FTPGG------INLDSC-SSAAANT---RV-GATAYYDM------ES-----DGLLECNAW-M------G-NVFVNPPF--------------------------GV-HGGASY-----QSLFFQRCATE--Y-M-AG-R-IH-----------------QAVLLL--KAAVG-YAWFDAIL-----------Q-WP----VCF-L-RQRLAFV-------------------R----R-Q--SGSQQQEGGPLTWGVRVANPHG---------SVVVYMGP-----------------------------------------DVQRF-VS-VFG--C--MG-----------\ParB-HTH fused VOLCADRAFT_100579_Volvox_carteri_f_nagariensis_302855367 GG-N-V-------RLFLSS-------------ESP----E-WFTPLSIIELVRE-V-FTPGR------IDLDPC-SSAAANT---RV-GATVYYDM------ES-----DGLLECNAW-M------G-NVFVNPPF--------------------------GV-RGGASY-----QSLFFQRCATE--Y-T-AG-R-IH-----------------QAVLLL--KSAVG-YAWFDAIL-----------Q-WP----VCF-L-RQRLAFV-------------------R----G-Q--SGSQQQEGGPLTWGARVANPHG---------SVVVYMGP-----------------------------------------DVQRF-VS-VFG--C--MG-----------| VOLCADRAFT_104840_Volvox_carteri_f_nagariensis_302838546 YG-G-L-------RVFMQS-------------DTC----E-WYTPDFILDLVRE-L-FTPGC------IDLDPC-SCAAANT---RV-RATSFYDE------AT-----DGLAEGSAW-R------G-N----PAF--------------------------GV-RRGQSL-----QGLFFGRCMRE--Y-Q-AG-N-VR-----------------QAVVLLILKAGIG-YSWFNDVL-----------N-WP----VCF-L-REHLSFV-------------------R----Q-V--GTS-----DELQWGARAQNPHG---------SVIVYMGP-----------------------------------------AVERF-AT-LFS--R--IG-----------| VOLCADRAFT_106473_Volvox_carteri_f_nagariensis_302846292 AG-S-R-------PIFLQS-------------ASV----E-WYTPQCILDKVAE-M-FGPGG------IDLDPC-SSEAANT---RV-KAGRFFDV------AL-----DGLSEACRW-E------G-NVFVNPPF--------------------------GS-RGVLSM-----QNLFFERCVKE--Y-R-QG-A-VK-----------------QAVVLL--KAAVG-YKWFRAVL-----------E-WP----VCF-L-WERLAFV-------------------Q----P-Q--HTSVGEE-SELKWGSRVQNPHG---------SVVVYLGT-----------------------------------------NVDKF-VR-IFG--D--IG-----------| VOLCADRAFT_108225_Volvox_carteri_f_nagariensis_302854263 AG-S-R-------PIFLQS-------------ASV----E-WYTPQCILDKVAE-M-FGPGG------IDLDPC-SSEAANT---RV-KAGRFFDV------AL-----DGLSEACRW-E------G-NVFVNPPF--------------------------GS-RGVLSM-----QNLFFERCVKE--Y-R-QG-A-VK-----------------QAVVLL--KAAVG-YKWFRAVL-----------E-WP----VCF-L-WERLAFV-------------------Q----P-Q--HTSVGEE-SELKWGSRVQNPHG---------SVVVYLGT-----------------------------------------NVDKF-VR-IFG--D--IG-----------| VOLCADRAFT_91459_Volvox_carteri_f_nagariensis_302838997 YG-G-L-------PVITRS-------------DTC----E-WYTPDFILDLVRE-L-FTPGC------IDLDPC-SCAAANT---RV-RATSFYDE------AT-----DGLAEGNAW-R------G-NVFLNPAF--------------------------GV-RRGQSL-----QELFFGRCKRE--Y-Q-AG-N-VR-----------------QAVVLL--KAGIG-CSWFNDVL-----------N-WP----VCF-L-RERLSFV-------------------R----Q-V--GTS-----DELQWGARALNPHG---------SVIAYMGP-----------------------------------------AVERF-AT-LFS--R--IG-----------| VOLCADRAFT_104908_Volvox_carteri_f_nagariensis_302838722 AG-S-R-------PIFLQS-------------ASV----E-WYTPQCILDKVAE-M-FGPGG------IDLDPC-SSEAANT---RV-KAGRFFDV------AL-----DGLSEACRW-E------G-NVFVNPPF--------------------------GS-RGVLSM-----QNLFFERCVKE--Y-R-QG-A-VK-----------------QAVVLL--KAAVG-YKWFRAVL-----------E-WP----VCF-L-WERLAFV-------------------Q----P-Q--HTSVGEE-SELKWGSRVQNPHG---------SVVVYLGT-----------------------------------------NVDKF-VR-IFG--D--IG-----------| VOLCADRAFT_118198_Volvox_carteri_f_nagariensis_302842945 AR-D-W-------QDYVSP-------------DSE----Y-YATPPYILTAVRK-L-YG-GA------IDLDPA-SDEKANE---AV-QAAKFYTA------EE-----DGLSPELPW-S------G-KIFINPPS--------------------------GI-VGSEPL-----QGLFFNRAIRE--A-AVAP-T-IT-----------------ECVILL--KAAVG-QRWFGPVF-----------D-HP----HCW-L-AERTVKK-------------------GA---A-A--AAAAAGGGGNGGDGGGGKGPRG---------MVVVYVGR-----------------------------------------RVQDF-CN-AFG--E--LG-----------| VOLCADRAFT_106408_Volvox_carteri_f_nagariensis_302845993 GG-N-V-------RLFLSS-------------ESL----E-WFTPLSIIELVRE-V-FTPGR------INLDPC-SSAAANT---RV-GATVYYDM------ES-----DGLLECNTW-M------G-NVFVNLPF--------------------------GV-HGGASY-----QSLFFQRCATE--Y-T-AG-R-IH-----------------QAVLLL--KAALCLFSGDASAL-----------T-NF----ICV-L-PCSGDTS----------------------------------------T-----FTNFI---------CSLAWSGDS---------------------------------------SALTTF-IC-SLA--C--SG-----------| BATDEDRAFT_85509_Batrachochytrium_dendrobatidis_JAM81_575474002 TG-V-S-------FTELNN--------------------E-LYTPKAIIMAAKK-V-IEKKQ------FDLDPA-SCAFANTLHGDT-IANTIYTE------AE-----DGLQ--KIW-N------G-HVWLSPPS--------------------------GI-DEAGLI-R---MKKWFLAAESK--Y-L-AG-E-IV-----------------SCHILL--RVDMQ-NDWFLRAL-----------Y-YP----HCF-F-HERIQFS---------------------------------------T---------PT---------GREKLLTD-----------------S----------HMLVYMG-----TNTERF-CI-QFA--Q--LG-----------/ Ot07g02900_Ostreococcus_tauri_308806169 VG-L-T-------PDWIVH-------------ATC-KVFE-MDLPTIEAPLIK---------------GLLDPC-TNSHLRP---NI-PAEKCYDK------KD-----DGLKMENSW-E------GYHVLVNPPY--------------------------EA-Q----V-----QWRFINRAINE--V-E-WE-R-CP-----------------GVILVC--RNSTD-TSYFQRLL-----------P-FP----RIH-L-RRTAVQF---------------------K----D-Y--S-------H------CPVGF---------GICVFCIV-----------------SP--T--NPK-QA----------EMYHRF-YD-EFH--Q--SG-----------\Fused to multiple chromatinic domains OT_ostta07g03040_Ostreococcus_tauri_693499233 VG-L-T-------PDWIVH-------------ATC-KVFE-MDLPTIEAPLIK---------------GLLDPC-TNSHLRP---NI-PAEKCYDK------KD-----DGLKMENSW-E------GYHVLVNPPY--------------------------EA-Q----V-----QWRFINRAINE--V-E-WE-R-CP-----------------GVILVC--RNSTD-TSYFQRLL-----------P-FP----RIH-L-RRTAVQF---------------------K----D-Y--S-------H------CPVGF---------GICVFCIV-----------------SP--T--NPK-QA----------EMYHRF-YD-EFH--Q--SG-----------| F751_3154_Auxenochlorella_protothecoides_760440511 MG-L-T-------PDWIIE-------------TVCFGVFG-LQRPTAEVPFIK---------------GLLDPC-SNSRVAP---NI-PAEVLYDKHVGAGVED-----NGLALKNEW-K------GFYILLNPPF--------------------------HS-Q----M-----QWRFVNRAIDA--V-E-RG-E-VP-----------------GVLLVC--RNSTD-ANYFQRLR-----------P-YP----RVL-L-GRKCALF---------------------K----D-Y--D-------K------SPNGF---------GIAAVMLA-----------------K---R--E---RT----------DLYLRF-YD-AFE--R--FG-----------|| MICPUCDRAFT_57353_Micromonas_pusilla_CCMP1545_303277723 DG-L-T-------PDWIVD-------------AAC-RVFC-LNVPTVDEPIIR---------------GLLDPC-TNNKRRP---NI-PAEKTFDK------KQ-----DGLKQENEW-K------GYHVVLNPSY--------------------------ES-Q----V-----QWRFINRAINE--V-E-WG-F-CP-----------------GILLVC--RNSTD-TSYFQRLH-----------P-FP----RIF-L-RRDAIRF---------------------K----D-Y--D-------N------TPIGF---------GIAVFCMV-----------------APTVT--KSE-KL----------ETYRRF-YD-EFS--H--AG-----------| OSTLU_87805_Ostreococcus_lucimarinus_CCE9901_145349057 VG-L-T-------PDWIVH-------------GAC-KVFG-LDLPTIEAPLIK---------------GLLDPC-TNSHLRP---NI-PAEKCYDK------KD-----DGLKMSNPW-A------GYHVLVNPPY--------------------------EA-Q----V-----QWRFINRAINE--V-E-WE-H-CP-----------------GIILVC--RNSTD-TSYFQRLL-----------P-FP----RIH-L-RRKAVQF---------------------K----D-Y--S-------N------CPIGF---------GICVFCIV-----------------SP--T--HPQ-QA----------DIYSRF-HD-EFH--A--AG-----------| VOLCADRAFT_89771_Volvox_carteri_f_nagariensis_302835622 MG-L-T-------PDWIIM-------------AAAFKVFQ-LPRPTASAPYIR---------------GLLDPC-TNSKANP---NI-PAEKLYDK------SD-----DGLKLSNSW-S------GYHVILNPEY--------------------------TS-Q----T-----QWRFVNRAIDE--V-E-NG-C-VP-----------------AVLLLC--RNSTD-TAYFQRLR-----------P-YP----RVM-L-KRTSARF---------------------K----D-Y--E-------K------TPIGF---------GIAVFCIA-----------------P---R--GPG-RT----------TLYRRF-ID-AFG--D--WG-----------| Bathy04g03050_Bathycoccus_prasinos_612396523 EG-L-T-------PDWIIE-------------GCY-KVFG-LEKPTVEVPFVK---------------DLLDPC-TNSLSNP---NI-PAEVLYDK------SI-----NGLLSKNSW-A------NKFALLNPPY--------------------------ET-Q----T-----QWRFIHRAINE--V-E-WG-F-SK-----------------GILLVC--RNSTD-TNYFQKLL-----------P-FP----RVM-L-RRNAVQF---------------------K----D-Y--T-------S------SPIGF---------GIAVFCMV-----------------AP--G--NPEVQR----------ETYARF-YD-EFA--S--AG-----------| MICPUN_55980_Micromonas_sp_RCC299_255071987 DG-L-T-------PDWIID-------------AGC-RIFG-LNVPTVQEPIVK---------------GLLDPC-TNDKRDP---NI-PAEKTYDK------RQ-----DGLKQENPW-K------GYHVILNPSY--------------------------ES-Q----V-----QWRFINRAINE--V-E-WG-F-CP-----------------GILLVC--RNSTD-TSYFQRLL-----------P-FP----RIF-L-RRDAVRF---------------------K----D-Y--T-------H------TPIGF---------GIAVFCLV-----------------SPIVT--PEE-KM----------ATYSRF-YN-EFR--H--AG-----------| H632_c3034p0_Helicosporidium_sp_ATCC_50920_633905054 FG-L-T-------PDWIIS-------------AACFDVLQ-LARPTPERPFIR---------------GLLDPC-SNSLLAP---NI-PAERLYDR------AA-----DGLSAANPW-R------GFHVLLNPPF--------------------------SA-Q----M-----QWRFVNRAIDA--V-E-ND-E-VP-----------------AVVLLC--RNSTD-AGFFQRLR-----------P-YP----RVL-L-RRKSAHF---------------------K----D-Y--E-------K------TPIGF---------GIAVFMLA-----------------K---E--S---RI----------HLYERF-LK-TFE--R--AG-----------| COCSUDRAFT_83615_Coccomyxa_subellipsoidea_C-169_545372676 MG-L-T-------PDWIIQ-------------AASFVVFR-LPRPTPEQPFIA---------------GLLDPC-TNSMVAP---NI-PAQVLYDK------KM-----NGLLMSNSW-A------GFHVLLNPDY--------------------------SA-A----T-----QWRFVNRAIDE-----------VP-----------------AVLLVC--RNSTD-TAYFQRLR-----------P-YP----RVM-L-RRGNARF---------------------K----D-Y--D-------K------TPIGF---------GVAVFCIA-----------------K---A--P---AT----------ELYERF-FD-GFA--A--MG-----------| CHLNCDRAFT_138470_Chlorella_variabilis_552817679 MG-L-T-------PDWIVE-------------AAAFRVFG-LERPTAERPYIA---------------GLLDPC-TNSKLAP---NI-PAESLYDK------QPRAAQDNGLKLSNSW-Q------GRYVLLNPDY--------------------------RA-Q----V-----QWRFVNRAIDE--V-E-NG-G-VP-----------------AVVLVC--RNSTD-TGYFQRLR-----------P-YP----RVL-L-RRLSARF---------------------K----D-Y--E-------K------TPIGF---------GIAVFCIA-----------------K---S--NVR-RV----------ELYSRF-YD-AFE--G--MG-----------/ Npun_F2574_Nostoc_punctiforme_PCC_73102_186465327 SH-S-E-PC----PPPPK--------------ESD----K-WYTPPNIQDLLTQ-V-L---GA-----VDLDPC-ADDG------KHIKAANHYTA------SD-----DGLA--QEW-Y------G-RVFMNPPY--------------------------SC------------PGKWMAKLQAE--I-E-AG-R-VT-----------------EAIALV--PAATD-TNWLHPLL-----------D-TQP---ICF-W-KGRIKF--------LD------------T----N----Y-------Q------PKLSARQ-------SHCLLYW------G----------------------------------TNAQKF-KQ-VFD--E--VG-----------\Prokaryotic homologs VR70_RS06925_Rhodospirillaceae_bacterium_BRH_c57_783127364 MD-R-G-KH----GLFVN-LEP-G------Q-RNT----E-WYTPDWILNPLYE-A-MGNQP------FDLDPC-SPIK-GPDA-PV-WAKKHFTR------KD-----DGLS--QDW-H------G-RVWLNPPY--------------------------AS------------LADWIRKAADA--T-W-CR-N--MRNPPTEESAQREHPLCESVVALI--PARTH-TVYWQDYI-----------T-NHA-R-VLF-L-HGKIGF----RMP-TP-------E----G-LV-Q----A-------K------TQFPE---------GLAFVIW------------------------------------------GNHR-----PFT--Q------AL-------| VR70_RS05265_Rhodospirillaceae_bacterium_BRH_c57_783126461 MD-R-G-KH----GLFVN-LEP-G------Q-RNT----E-WYTPKWILEPLYK-A-MGNQP------FDLDPC-SPIK-GPNS-PV-WAKKHFTK------DD-----DGLN--QNW-H------G-RVWLNPPY--------------------------AS------------LADWIRKAADA--T-W-CS-S--MPNPPTEEAGLREQPLCESVVALI--PARTH-TVYWQDYI-----------T-NHA-R-VIF-I-HGKIGF----LMP-TP-------E----G-LV-Q----A-------K------TQFPE---------GLAFVIW------------------------------------------GNHR-----PFT--D------AL-------| Riv7116_4895_Rivularia_sp_PCC_7116_427373349 RG-Q-G-SL----SNKSL--------------SSD----E-WYTPPHISDLVTQ-V-L---GQ-----ITLDPC-ADEG------KHIRAAQHYTV------LD-----DGLI--QEW-N------G-RIFMNPPY--------------------------SA------------PSVWIKKLQAE--F-E-SG-R-VT-----------------EAIALV--PAATD-TRWLSPLL-----------K-SQP---VCF-W-TGRIKF--------LD------------M----S----Y-------K------PRLSARQ-------SHCLVYW------G-G--NWE-------------------------------RF-KE-VFD--P--YG-----------| OSG_eHP4_00155_Environmental_Halophage_eHP-4_383396772 EH-T-V-VD----AATKQ--------------ETD----E-WASPRELVEPLNT-A-V---NG-----FDLDPC-SGAE------VSPFADKTYTE------SD-----NGLS--QPW-S------G-IVWVNPPY--------------------------SA------------MDTWTEKAIAE--I----E-N-TG-----------------TICYLC--KGDSS-TEWWQTAA-----------Q-EAT-V-ICA-I-DHRLQF--------GD------------G----D----N-------S------APF-----------ASHIVVF------G-R--ASD-------------------------------SL-IL-ELQ--N--HG-----------| ACAty_RS09645_Acidithiobacillus_caldus_491011364 -----A-------VADPA--------------WSD----E-WYTPDYILDAARA-V-L-G-D------IDLDPA-SCAA-AN-E-AV-QAKRFFAK------EQ-----DGLQ--QAW-R------G-KVWLNPPY--------------------------SY-P--Q-------ILDFCEALVQR--Y---AD-G--S--------------VT-EAIVLV--NSGTE-TQWGQMLL-----------S-HGS-A-ACF-P-ASRLKF--------RR----P--E----G----K----S-------G------LPS-----------QGQMLVY--F---G-P--HVD-------------------------------RF-KT-VFL--S--IG-----------| ATC_RS06425_Acidithiobacillus_caldus_503768726 -----A-------VADPA--------------WSD----E-WYTPDYILDAARA-V-L-G-D------IDLDPA-SCAA-AN-E-VV-RARQFFDK------TQ-----DGLQ--QDW-Q------G-TVWLNPPY--------------------------SY-P--A-------ILDFCEALVQR--Y---AD-G--S--------------VT-EAIVLV--NSGTE-TQWGQMLL-----------S-HGS-A-ACF-P-ASRLKF--------RR----P--E----G----K----S-------G------LPS-----------QGQMLVY--F---G-P--RAD-------------------------------RF-KT-VFL--S--IG-----------| HMPREF0731_4170_Roseomonas_cervicalis_ATCC_49957_296263068 -----T-------AAFSS--------------AYE----A-WATPPDLLERLYA-A-V-G-S------IDLDPC-SPGK-LR-S-RV-KAPRHFTE------RD-----DGLA--QEW-S------G-KVYMNPPY--------------------------GR-T----------IGAWTTKARVE--V---TA-G--R--------------AE-CVVGLV--PARTD-TRWWHADV-----------A-GHA-H-VWL-L-KGRLAF--------GD------------G----S----T-------P------APF-----------PSALLLW------G-G--NAP-----------TIA-----------------EM-SA-SFP------------------| IL54_RS15525_Sphingobium_sp_ba1_739620247 ---T-P------SLWLPS---V----------AAD-RN-R-RFTPIEFLRAIEQ-V-W-G-M------IDLDPC-GHPD----S-PV-NARRRISL------EE-GG--DGLR--DDW-S---G--D-VVYLNPPF--------------------------SE------------MVTWLKRADQM----W-IE--------------MR---VQ-KIIALV--PARTD-IGYFHDRI-----------A-QVC-D-VGL-M-RGRLQF--------GQ----P-----I-G----K-K--D-------D--RSR-ATF-----------ALMVCLW------G-A--TAE-------------------------------EI--A-AFD-----AI-----------| G407_RS29165_Salinarimonas_rosea_759845022 ------------VANWGS---G----------SRD----E-RFTPEDVVRRIEA-V-L-D-G------IELDPC-GHPK----S-PV-RARRFIHR------EE-----DGLK--QPW-N---A--R-TVFVNPPY--------------------------SE------------TGEFVRKAAHE----W-RS-K--R--------------AQ-TVLMLL--PVKTH-FAWHQDHI-----------Q-GIA-D-VFF-L-RGRITF----ERL-GM------------P----A----T-------P------APF-----------PTMLVLY------G----GTD---------------VMIA------------RI-MA-LFE-----CG-----------| AFERRI_560020_Acidithiobacillus_ferrivorans_669953369 IN-P-R------AALWTA--------------KDN----T-WMTPPSLLEQLYP-L-LPSKT------FDIDPC-SPCV-GP-AAPV-RAYVHYTE------RH-----DGLR--QSW-G-K-G--T-YCYVNPPF--------------------------SH------------LRKWIHKALAE--T---DN-G-------V------------VSILLC--PARVD-SIWWHALV-----------A-NRI-P-VVM-L-RGRLHF--------GG------------G----D----N-------R--QQK-APF-----------ASALLII------G----GSA-------------------------------QL-PK-RVA--D--AT-----------| SYN7509_RS0224085_Synechocystis_sp_PCC_7509_497315962 LQ-----------DKFSK-SSK-T------PTRKK-AP-Q-LYTPPEIIDLVRV-V-M-G-E------IDLDPA-SDDI-AQ-Q-WV-QARNYYTL------AL-----DGLF--HPW---F----G-RVWLHPPA--------------------------DG-K----------TAKWTSKLLNE----Y-SS-G--R--------------VT-EAVLLV--RPSAG-SKWFQKLT-----------R-LFP---VCF-P-DERLKF----L---DD------------Q----E-IP-Q-------T------QPK-----------NGNAIFY--L---G-Q--NRQ-------------------------------QF-GQ-VFG--T--IG-----------| GGI1_15033_Acidithiobacillus_sp_GGI-221_339835072 IK-K-P-RC-A--PCLSS--------------GKD----D-WTTPSHLLNLILN-V-LGRKG------FDLDPC-SPSL-KG---PV-PASRYYTR------RE-----DGLK--QAW-E------G-LVFVNPPY--------------------------SQ------------MRHWSSKLVDA--A---AC-G-------V------------QIIALV--PSRTG-TQWWHQVL-----------D-GGA-R-PIY-L-RGRLRF--------GE------------G------I--G-------Q------APF-----------DSTILLF-----------NFS-------------------------------DF----LAE--Q--MA-----------| TREAZ_0592_Treponema_azotonutricium_ZAS-9_333734957 -----H-------VAHNS--------------GNN----E-WYTPAEYIEAARK-A-M-G-G------IDLDPA-SCEA-AN-R-TV-KAKKIHTI------DD-----DGLG--HPW-E------G-RVWLNPPY--------------------------AR-E--L-------IGKFIEKLKTH--V---CR-G--E--------------VT-EAIVLV--NNATE-TAWFGALV-----------S-FSN-A-IVF-P-ASRVKF--------NG----P--D----G----K----M-------G------SPL-----------QGQAVLY--A---G-P--NSE-------------------------------KF-LD-AYK--S--FG-----------| Metlim_0419_Methanoplanus_limicola_490177569 -----K-------GIRGV--------------PKN----E-WTTPPEIVEASLE-V-L-G-V------IDIDPC-AESK-DC-P-NI-PARAHYTI------WD-----NGMS--VHW-E------G-RVFLNPPY--------------------------GN-S----------LARWIAYLRDE--Y---RL-G--Y--------------VR-EAIVLV--PARTD-SRWFH--Y-----------M-GSN-F-IWCGV-KGRLRF--------SE------------I----D----G-------P------APF-----------PSAIFYI------G-K--NRK-----------RFV-----------------EI-FS-RFG------------------| Riv7116_1753_Rivularia_sp_PCC_7116_427370342 KT-S-S-KE----TERTN--------------KTD----C-WYSPPHIVELVIQ-V-L---GE-----INLDPC-ADDG------RHIRATKHYTF------DD-----NGLE--QSW-C------G-KVYMNPPY--------------------------SH------------PGAWMKKLELE--F-E-TG-N-VD-----------------EAIALV--PAATD-TNWLSPVL-----------K-TQP---VCF-W-KGRIKF--------LG------------Q----D----Y-------Q------PKLSARQ-------SHVLVYW------G-N--NWQ-------------------------------KF-RE-VFE--D--YG-----------| SPUTW3181_RS15120_Shewanella_sp_W3-18-1_500114172 IN-Q-S------NAEQGF---------------------E-YYTPAPWPQLASQ-L-M-G-G------IDLDPA-SNEI-AN-A-SI-KAKSIFTK------EV-----DGLS--KTW-H---G----TVWMNHPFHRGEQPC---SSKCKKKACIKRGHHIDK-P----I-PG--NGDWINKVISE--Y---ES-G--N--------------IK-EAVIIT--FCNSS-EGWFLPLL-----------K-YA----QCF-P-NGRVHY----IKE-DG------------S---------K-------A------DSCT----------KGSVITY--I---G-K--NVA-------------------------------EF-AR-LYG--E--HG-----------| SHEWPOL2_RS06540_Shewanella_sp_POL2_739569226 IN-Q-S------NAEQGF---------------------E-YYTPEPWPQLASQ-L-M-G-G------IDLDPA-SNDI-AN-A-SI-NAKTIFTK------EI-----DGLS--QRW-Y---G----TVWMNHPFHRGEKPC---KAKCNKKACIKRGHHIDK-P----I-PG--NGDWINKIIKE--Y---ES-G--H--------------IK-EAVIIT--FCNSS-ETWFLPLL-----------K-FP----QCF-P-HGRVHY----KKA-DG------------S---------K-------A------DSCT----------KGSVITY--L---G-K--NVA-------------------------------EF-SR-LFG--A--HG-----------| VII_RS00060_Vibrio_mimicus_446980525 IN-Q-T--------------------------SGD--V-E-YYTPLEWVEPARQ-V-M-G-S------IELDPA-SSDI-AN-Q-TV-KAQRIFTI------DD-----DGLS--RPW-T---AQ---TLWMNHPFHRGEKACPADHSKCKKITCLKRGFHIDK-D----I-PS--NNDWINKFIAE--Y---EA-G--H--------------FK-EAICIT--FGNTS-EAWFRKLL-----------P-HL----QCF-P-NGRVHY----RKP-DG------------T---------I-------N------RNVT----------KGSVLTY--L---G-D--RPK-------------------------------AF-KK-VFS--R--LG-----------| C475_14433_Halosimplex_carlsbadense_493940376 TS-E-Y-QQ----VHWSS--------------ESD----E-WATPPSLLRPLDD-A-V---DG-----FDLDPC-SGAE------ERSIAAETYTE------AD-----DGLA--QRW-H------G-VVWCNPPY--------------------------SD-V----A-------DWIEKARFE--G-A-RD-A-VE-----------------LVIVLV--PARTS-TQWFHKFA-----------S-HAA-A-VCF-I-EGRLSF--------GD------------A----D----N-------S------APF-----------PSMLLAF------G-E--PTD-------------------------------AV-ID-AFD--D--RG-----------| VPUCM_1151_Vibrio_parahaemolyticus_UCM-V493_584469889 ---L-D-------VMFSS-ANS-G------DKSKD----K-WQTPPEIFAQLND-R-F---G------FTLDAA-AEPE------TA-LCEKYFTE------ED-----DALK--QDW-S---G--H-VVFCNPPY--------------------------SK------L-----R-VFAKKAYEE--S---LK-G--T-----------------TVVMLV--PARTD-TQACHDYL-----------A-N--GE-MYF-I-RGRLKF-----LKVGE------------L----Q----D-------A------APF-----------PSVVCVL------G-P--GVE--R-------------------------------------------------------| C478_07432_Haloterrigena_thermotolerans_493699302 --------------VYEE--------------GDD----K-HDTPVEFVAPLIE-A-V---GG-----FDLDP--SASQ------SSDLAERNVTK------DE-----DGLS--TPW-H------G-DVWLNPPY--------------------------SG-V----S-------DWLEYGRDE--Y-Y-RG-A-VD-----------------SIIALV--FARTS-TQWFHNHA-----------T-TAD-L-ACF-V-EGRLSF--------AG------------S----D----H-------S------APA-----------PSVVLVW------G-D--AAG---------------------------NP--DV-VE-YLD--S--QG-----------| HLRTI_001342_Halorhabdus_tiamatea_495800631 --------------VYEE--------------GDD----D-HDTPGEFVEPLID-A-V---SG-----FDLDP--SAST------SSNLAERNVTK------DE-----DGLS--IPW-H------G-DVWLNPPY--------------------------ST-V----S-------DWLEYARNE--Y-H-RG-A-VD-----------------SIISLV--FARTS-TQWFHNHA-----------T-TAD-L-ACF-V-EGRLSF--------GE------------A----A----N-------S------APA-----------PSLVLVW------G-D--AAE---------------------------NT--DV-VE-YLS--S--QG-----------| Metfor_2481_Methanoregula_formicica_505099336 -----A-------ALLSH--------------AST----E-HYTPQYILDAVIA-C-M-E-A------IDLDPC-SNSR-KI-P-NV-PAARHYTV------QD-----NGLL--RPW-V------G-RVFLNPPF--------------------------GY-E----V-----E-DWFSKLFLE--T---LE-G--R--------------TT-EAIILW--KSATE-TSAWKTLT-----------R-LSC-R-VCF-P-SARVRF--------GG----P--G----S--D-E-R--K-------S------PTF-----------SPALFYV------G-P--RPE-------------------------------RF-EE-AFR--H--IG-----------| HMPREF0731_RS06220_Roseomonas_cervicalis_750330482 -----T-------AAFSS--------------AYE----A-WATPPDLLERLYA-A-V-G-S------IDLDPC-SPGK-LR-S-RV-KAPRHFTE------RD-----DGLA--QEW-S------G-KVYMNPPY--------------------------GR-T----I-----G-AWTTKARVE--V---TA-G--R--------------AE-CVVGLV--PARTD-TRWWHADV-----------A-GHA-H-VWL-L-KGRLAF--------GD------------G----S----T-------P------APF-----------PSALLLW------G-G--NAP----------------------------------------------------------| LEPBO_RS38120_Leptolyngbya_boryana_738087862 -------------ERMTS---A----------KTD----E-HYTPPELLELVYE-C-FSPLG------IELDPC-SNAH-GEEA-NV-KASQYFTI------ED-----DGLA--QEW-N---A--K-TVYINPPY--------------------------SD------V-----A-AWVDKVVTE----Q-DR-N--N--------------IG-DVLLLV--KADTS-TQWFAQIW-----------E-SAT-A-VCF-L-KKRVRF-----IN-AE------------S----E-G--N-------A------APF-----------ASAIAYF------G-S--EID-------------------------------RF-YY-AFE--S--AG-----------| AGC34828.1_Escherichia_phage_PBECO_4_441462540 ---H-S-------VHFST--------------GKD----N-WTTPKDFFEDLDE-L-W---E------FTLDAA-CVKE------TA-LCDNFFTP------ED-----DSLS--QDW-G---N--N-IVWLNPPY--------------------------SD------L-----K-TWLSKAVDA--Y---NN-G--A-----------------TVVILV--PSRTD-TIAFQDYA-----------A-KICDC-ICF-I-KGRLRF--GIPEEPDK------------K----T----D-------S------APF-----------PSCLIVL------D-K--YLT--T-------------------------------------------------------| PBI_121Q_417_Escherichia_phage_121Q_712914615 ---H-S-------VHFST--------------GKD----N-WTTPKDFFEDLDE-L-W---G------FTLDAA-CVDE------TA-LCDNFFTP------ED-----DSLS--QDW-G---N--N-IVWLNPPY--------------------------SD------L-----K-TWLSKAVDA--Y---NN-G--A-----------------TVVILV--PSRTD-TIAFQDYA-----------A-KICDC-ICF-I-KGRLRF--GIPEEPDK------------K----T----D-------S------APF-----------PSCLIVL------D-K--YLT--T-------------------------------------------------------| F403_gp088_Enterobacteria_phage_vB_KleM-RaK2_422937337 ---H-A-------VHFST--------------RKN----DLWTTPKPLFDKLNA-L-W---N------FTVDVA-CSNE------TA-LCLKHYTP------ED-----DGLS--QDW-S---N--E-TFWLNPPY--------------------------SD------L-----S-PWLSKSVED--Y---NR-G--A-----------------TGLILV--PARTD-TRAFQNFA-----------S-PFCDA-MCF-I-KGRLKF--GNPLKPND------------K----L----T-------S------APF-----------PSCIIVL------D-K--NLT--Q-------------------------------------------------------| DX12_RS0110285_Vibrio_parahaemolyticus_646896396 ---N-K-------LFFSS-ARN-G------SSKQD----K-WQTPPAVFEKLNE-E-F---N------FTLDAT-AEPE------TA-LCDHYFTI------DD-----DALT--QDW-G---N--Q-TVYCNPPY--------------------------SQ------L-----K-DFAKKAQEE--A---KK-G--A-----------------TVVMLV--PARTD-TKAFHDYL-----------S-H--GE-VRL-I-KGRLKF-----LMEGK------------E----Q----D-------A------APF-----------PSMVCVM------G-K--DRE--Q-------------------------------------------------------| ABSDF2497_Acinetobacter_baumannii_SDF_169152788 MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---K------FDLDVC-ALPD------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAADT--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLV-------------------------------DV----NWE--K--SA-----------| A148_RS0111015_Vibrio_splendidus_695353200 ---N-K-------LFFSS-ART-G------NPKRD----K-WQTPPAVFKKLNE-E-F---H------FTLDAT-AEPE------TA-LCDHYFTM------DD-----DALT--QDW-S---N--Q-TVYCNPPY--------------------------SQ------L-----K-DFAKKAQEE--A---KK-G--A-----------------TVVMLV--PARTD-TKAFHDHL-----------S-H--GE-VRL-I-KGRLKF-----LQDGE------------E----Q----D-------A------APF-----------PSMVCV-------------------------------------------------------------------------| OAC_RS0107480_Vibrio_cyclitrophicus_515155813 ---N-K-------LFFSS-ART-G------NPKRD----K-WQTPPAVFKKLNE-E-F---H------FTLDAT-AEPE------TA-LCDHYFTM------DD-----DALT--QDW-S---N--Q-TVYCNPPY--------------------------SQ------L-----K-DFAKKAQEE--A---NK-G--A-----------------TVVMLV--PARTD-TKAFHDHL-----------S-H--GE-VRL-I-KGRLKF-----LQDGE------------E----Q----D-------A------APF-----------PSMVCVM------G-N--DVE--Q-------------------------------------------------------| I59_RS11655_Curtobacterium_sp_B8_551283918 AG-R-G------GWTHEA-P-S----------ATI----D-WYTPDYIFQALAV-T------------FDLDPC-SPGS-SR-S-NV-PAGAVYTL------AD-----DGLS--SPW-R------G-LCWVNPPY--------------------------D--D----T-----R-TWLQRLADH------G-----------------------EGIALV--FARTD-TKWFHEAA-----------K-SAD-L-VCF-T-SGRIKF-----ID-GR-------T----M----Q----P-------G------GSPGA-GS--------VFLAW------G----ATA-------------------------------AA----ALG------------------| Q354_RS0120435_Marinobacterium_jannaschii_654380487 FG-E---------GANNA-NGR----------KSV----E-WYTPKWIFDELNV-V------------FDLDPS-SPHD-HE-S-FV-PADEKYTI------FD-----DGLS--KPW-H------G-RVWLNPPY--------------------------GR-D----T-----P-FWMNRMIDH------G-----------------------NGIALV--FSRTD-AKWFQDAM-----------K-AAT-A-VLF-V-AGRIEF-----VP-GN-------E----N----K----H-------K----K-SRSGA-GT--------ALFAF------G----EDN-------------------------------AR----VLR------------------| D478_26539_Brevibacillus_agri_BAB-2500_432181416 IN-K---------AMF--------------TSERE----E-WETPQDFFEKLNK-E-F---G------FQLDVC-ALPT------NA-KCERYFTP------DE-----DGLK--QEW-T------G-VCWMNPPY--------------------------GR-E----I-----G-KWVKKAYES--A---KQ-G--A-----------------TVVCLL--PARTD-VKWWHDYC-----------M-KG--E-IRL-V-RGRMKF--------VG------------A----D----N-------M------APF-----------PNAVVIF------S-P--ASA-------------------------------GC----SYK--A--ID-----------| J479_2646_Acinetobacter_baumannii_691127129 MA-K-L-------GLFGN-----A------EGRTD----V-WATPQTLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----T-LWIDKAVQT--A---NQ-G--H-----------------TVVGLL--PARTD-VTWWQEHV-----------M-NR--E-IHY-I-KGRLKF--------GG------------C----K----H-------N------APF-----------GCAVVVF------R-P--SLK-------------------------------DV----QWG--A--Q------------| FL80_RS15355_Acinetobacter_baumannii_690990657 MA-K-L-------GLFGN-----A------EGRTD----V-WATPQTLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWISKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCAVVVF------R-P--SLK-------------------------------DV----QWG--A--Q------------| C462_04300_Halorubrum_arcis_495269178 MS-L-F-SH----EFHED--------------SSD----E-FGTPAEFHRPLAD-A-V---GG-----FDLDPA-SGAE------SQPLASTRFTK------ED-----DGLS--KEW-F------G-TVWLNPPF--------------------------SE-K----T-------RWVRKARAE--V-A-EG-N-VE-----------------TAVVLL--PVDTS-TKLFHDHV-----------T-DAT-A-ICF-V-EGRLSF--------DG------------G----D----R-------N------PNF-----------GTLLAVF------G-E--ASD-------------------------------DL-LD-ALD--R--KG-----------| AEQU_RS00240_Coriobacteriaceae_496663041 AG---G-------AAFSS--------------ARD----D-WETPAWLFSALDS-E-F---H------FTLDAA-SSDA------NA-KCERHLTK------RD-----DGLA--ADW-----G--GERVWVNPPY--------------------------GR-G----V-----G-AWARKAAIE--G-A-KP-R--T-----------------TVALLV--AARTD-TEWFLRYI-----------L-GHA-E-IRL-V-RGRIRF----ELA-GV------------A----Q----G-------P------APF-----------PSMVAVF------G-E--GAA----------------------------------PG-KVS--SIANA--AARGGKGAS| J689_1368_Acinetobacter_calcoaceticus/baumannii_complex_645913983 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| J635_1953_Acinetobacter_baumannii_690997976 MA-K-L-------GLFGN-----A------EGRTD----V-WATPQKLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-A------G-TCWMNPPY--------------------------GR-E----I-----V-DWISKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----KWG--D--Q------------| LJ44_RS16470_Acinetobacter_baumannii_447017697 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| J635_2258_Acinetobacter_baumannii_690998264 TA-K-L-------GLFGN-----A------EGRTD----V-WATPQKLFDALDQ-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QDW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVQT--A---NQ-G--H-----------------TVVGLL--PTRTD-VAWWQEHV-----------M-NR--E-IHY-I-KGRLKF--------GG------------C----K----H-------N------APF-----------GCAVVVF------R-P--SLK-------------------------------DV----QWG--T--Q------------| TT45_RS11045_Acinetobacter_baumannii_758882462 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWISKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWA--M--AV-----------| J660_0735_Acinetobacter_calcoaceticus/baumannii_complex_493629922 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GC-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| J689_1349_Acinetobacter_baumannii_691068978 MA-Q-R-------KLFGL-----A------ENRTD----V-WATPQDFFDKLNA-V-F---N------FDLDVC-ALPE------NA-KCERFFSP------EQ-----NGLK--QEW-I------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QW-------------------| W9I_03525_Acinetobacter_nosocomialis_493629840 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| F985_01871_Acinetobacter_sp_NIPH_973_490838153 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPG------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| J523_3197_Acinetobacter_baumannii_691027491 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------| K041_RS17240_Acinetobacter_baumannii_690981431 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------| J517_3010_Acinetobacter_baumannii_691065210 MT-K-N-------KLFGL-----A------EERTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----NWE--K--SA-----------| J595_RS19805_Acinetobacter_baumannii_691047241 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| J697_3983_Acinetobacter_baumannii_691093639 MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-Q--SLI-------------------------------DV----SWE--K--SA-----------| BN776_01939_Clostridium_sp_CAG:768_548223533 MN-S-T-EF-KK-YNFMQ---E----------RSD------YLTPPEMIQEIFQ-E-LNLLGIYSGDKFDLDTC-CSQK----N-IP-ACNHYIEG------EN-----DGLS--LDW-H------N-LNYCNPPY--------------------------KT------C-----D-KWVKKAFAE----F-QN-G--K-----------------ISVLLI--PARTE-TKYWQEYILKNGFAIRENVY-------VRF-L-RKGLCF-----LN-PE------------T----N----E-------K----M-GVFKN---------ALAIVIF------D----GSK--N-K--------------------------EV-------------------------| J660_1691_Acinetobacter_baumannii_691157882 MS-K-N-------KLFGL-----A------EDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----NWE--K--SA-----------| RQ87_RS18135_Acinetobacter_baumannii_447010248 MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------| J596_3741_Acinetobacter_baumannii_691117543 MA-K-L-------GLYGN-----A------EGKTD----V-WATPQNLFDALDQ-I-F---N------FDLDVC-ALPE------NA-KCERYFTP------EL-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NN-G--H-----------------TVVGLL--PVRTD-VVWWQEHI-----------L-HR--E-IHY-I-KGRLKF--------GG------------S----K----H-------N------APF-----------GCALVVF------R-P--SLK-------------------------------DV----QSD--K--SI-----------| ACINWC323_RS01110_Acinetobacter_sp_WC-323_696306260 MA-K-S-------KLFGL-----A------EDRTD----V-WATPQDFFDKLNA-I-F---D------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLS--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NA-G--Y-----------------TVVALL--PARTD-VGWWQSHC-----------L-NR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCAVVVF------R-P--SLN-------------------------------DV----RWE--Q--SQ-----------| K035_3853_Acinetobacter_baumannii_691039522 MT-K-N-------KLFGL-----A------DDRTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------| F931_01759_Acinetobacter_pittii_507070967 MA-K-L-------GLYGN-----A------EGKTD----V-WATPQNLFDAIDH-I-F---N------FDLDVC-ALPE------NA-KCDRYFTP------EL-----DGLK--QEW-V------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NN-G--H-----------------TVVGLL--PVRTD-VVWWQEHI-----------L-HR--E-IHY-I-KGRLKF--------GG------------C----K----H-------N------APF-----------GCALVVF------R-P--SLK-------------------------------DV----RWE--S--SI-----------| FL80_RS05360_Acinetobacter_baumannii_690988986 MT-K-N-------KLFGL-----A------EERTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---NK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----NWE--K--SA-----------| ABBL099_02355_Acinetobacter_baumannii_690996743 MT-K-N-------KLFGL-----A------EERTD----V-WATPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---NK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------| SALWKB2_RS02465_Snodgrassella_alvi_644547413 MN-K-G----FTHE-KNA-S-N----------NSD----E-WYTPEWMFRILNL-D------------FDLDPA-APKG-GL-P-WI-PAQQFYCK------ED-----DGLS--KPW-H------G-LVWLNPPY--------------------------GK-E----T-----G-KWLQRMHEH------R-----------------------QGIALV--FSRTD-SRWFHDYA-----------V-KAD-A-ILY-L-KGRVRF-----V--NA-------D----G----D----P-------G----K-SSLGC---------GSVLIGW------G----EVA-------------------------------VS----ALQ------------------| MZ39_RS07370_Pseudomonas_fluorescens_734904253 MG-A-R-------E-APP-K-H----------KSV----E-WYTPAWIFERLGL-Q------------FDLDPS-SPHD-YV-T-AV-PAKTKYTI------FD-----DGLS--KEW-S------G-RVWMNPPY--------------------------GP-E----T-----S-FWMRRLIAH------G-----------------------DGIALV--FSRTD-AEWFQDAM-----------A-NAS-A-TLL-V-KGRIAF-----VP-GH-------E----N----S----H-------K----K-GRSGA---------GSALFAF------G----DEC-------------------------------AI----ALQ------------------| H621_RS26760_Pseudomonas_vranovensis_739119776 MG-A-R-------P-EQP-K-H----------KSV----E-WYTPAWIFERLGV-E------------FDLDPS-SPHD-YV-T-PV-PAKRKYTV------FD-----DGLS--KDW-A------G-RVWMNPPY--------------------------GP-D----T-----G-FWMRRLIAH------G-----------------------NGIALV--FSRTD-AEWFQEAM-----------S-SAS-A-TLL-I-KGRIAF-----IP-GH-------E----N----S----H-------K----K-GRSGA---------GSAMFAF------G----DEC-------------------------------AI----ALQ------------------| J532_4398_Acinetobacter_baumannii_691154760 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------| BTS2_0497_Bacillus_sp_TS-2_591276954 IN-Q---------AMF--------------SSSTD----K-WSTPQSFYDKLNQ-E-F---Q------FDIDVC-ATDS------DK-KCERYFSP------EQ-----DGLK--QEW-T------G-ICWMNPPY--------------------------GR-G----I-----G-PWIQKAYES--S---QQ-G--A-----------------TVVCLL--PSRTD-TKWWHEYC-----------M-KG--E-IRF-I-KGRLKF--------GD------------S----K----N-------S------APF-----------PSVVVIF------R-P--KVV-------------------------------SM-------------------------| HMPREF1315_RS07015_Bifidobacterium_longum_494116860 AG---A-------AAMTS--------------NKD----D-WETPQSLFDQLDE-E-F---H------FILDAA-SSDQ------NA-KCEHHYTA------EN-----SGLE--HSW-----E--GETVFCNPPY--------------------------GR-N----I-----G-DWIRKASQE--A-S-KP-D--T-----------------LVVLLV--PARTD-TRWFQNHI-----------L-HRA-E-VRF-L-PGRLKY----EVN-GQ------------A----G---------------------------------EAAPSFW------R-E--GTP----------------------------------SF----------------------| B7017_p0034_Bifidobacterium_breve_704484626 AG---A-------AAMTS--------------NKD----D-WETPQALFDQLDK-E-F---H------FTLDAA-SNDQ------NA-KCEHHYTA------EN-----SGLE--HSW-----G--GETVFCNPPY--------------------------GR-N----I-----G-DWIRKASQE--A-S-KP-D--T-----------------LVVLLV--PARTD-TRWFQNYI-----------L-HRA-E-VRF-L-PGRLKY----EVD-GQ------------A----G----E-------A------APF-----------PSMVVIM------R-T--GER----------------------------------------------------------| BBRE_RS02915_Bifidobacterium_breve_518557238 AG---G-------AAYMS--------------NRM----N-WETPQELFDQLDA-E-F---H------FTLDAA-SSAT------NH-KCQKYYTA------ED-----SAFD--HEW-----G--GETVFCNPPY--------------------------GK-A----I-----A-EWVRKCSAE--A-S-RK-D--T-----------------LVVMLL--PARTD-TRWFQQFI-----------L-NRA-E-VRF-L-KGRLRF----ETN-GI------------P----G----G-------P------APF-----------PSMIVVM------R-T--GER----------------------------------------------------------| ELEN_RS13090_Eggerthella_lenta_506241510 ---G-G-------VAFSS--------------ERH----Y-WETPQDLFDTLDN-E-F---H------FTLDPA-STDE------NA-KCEKHYTI------ED-----DGLC--QSW-A---G--E-RVFCNPPY--------------------------GR-E----L-----S-KWVKKAHAEVAL---NP-G--T-----------------VVVMLI--PARTD-TTYFHDYI-----------Y-HKA-E-VRF-I-RGRLRFCIQ-----GK------------A----K----D-------A------APF-----------PSMVVVFR-----------------------------------------------------------------------| CLOSCI_00567_[Clostridium]_scindens_ATCC_35704_167664126 LN-K-A--------LFSS--------------AKE----D-WATPQDFFDELNK-E-F---H------FDLDPC-ADAE------NA-KCKEFFTK------EQ-----NGLL--QDW-G---G--R-CVFCNPPY--------------------------GRTS----T-----G-EWIKKCYEE-AQ---KP-G--T-----------------VVVALI--PARTD-TRFFHDYI-----------Y-HKA-E-IRF-I-KGRLHF--------GG------------C----K----D-------A------APF-----------PSMVVVF---RKGK----ENEEEK-KTGCTAAGHT-EEKAAEKDDGSENGVDGI-------------------------| SAG0375_00225_Streptococcus_agalactiae_GB00984_527786367 VQ---K-------SLLSS--------------DKD----Y-WETPQTFFKKLNN-E-F---D------FDLDVA-SSHD------NA-KCKNHFTV------VE-----DGLS--QDW-----T--G-NVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE--SLK-PY-N--N-----------------VIVLLI--PARTD-TKYWHDYI-----------F-GKA-KDIRY-L-KGRLKF----TIN-GK------------E----N----Y-------P------APF-----------PSAVIIF------------------------------------------------------------------------| GSM_RS11735_Lactobacillus_mali_497764146 LN---K-------SMFTS--------------DKQ----Y-WETPRDFFNKINK-V-F---H------FNWDLA-STDD------NA-LCTNHLTE------KD-----DSLS--IDW-G-GLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKAAET--K-L-KH-N--Q-----------------YLVLLI--PSRTD-TSYWHDYI-----------F-GKA-E-IKF-I-RGRLKF----AID-GE------------Q----K----D-------A------APF-----------PSALIIYK-----G-E---------------------------------------------------------------| HMPREF0555_0745_Leuconostoc_mesenteroides_subsp_cremoris_ATCC_19254_227352467 VD---K-------VLFSS--------------NSM----V-WETPKDYFDKLNR-K-F---K------FDLDAC-ASDT------NH-KVDTYFTE------DD-----NALE--QKW-----G--G-NVFMNPPY--------------------------GR-H----I-----G-KFIKKAYEE--HLR-DP-N--R-----------------FIVMLI--PSRTD-TKYWHEYI-----------Q-DKA-T-VKF-I-KGRLKF----EID-GE------------S----M----D-------A------APF-----------PSALVVY-GF---------------------------------------------------------------------| HMPREF0555_RS01180_Leuconostoc_mesenteroides_738135700 VD---K-------VLFSS--------------NSM----V-WETPKDYFDKLNR-K-F---K------FDLDAC-ASDT------NH-KVDTYFTE------DD-----NALE--QKW-----G--G-NVFMNPPY--------------------------GR-H----I-----G-KFIKKAYEE--HLR-DP-N--R-----------------FIVMLI--PSRTD-TKYWHEYI-----------Q-DKA-T-VKF-I-KGRLKF----EID-GE------------S----M----D-------A------APF-----------PSALVVY-GF---------------------------------------------------------------------| N644_0465_Lactobacillus_plantarum_AY01_544589963 IN---K-------ALFTS--------------NKE----D-WETPQDFYDRLNA-K-Y---H------FEWDLA-ASDG------NA-KCGDYFTS------DD-----NSLE--QDW-E-RLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKASET--Q-L-KH-D--Q-----------------FLVMLI--PSRTD-TSYWHDYI-----------F-NHA-E-IEF-L-RGRLKF----EVD-GV------------G----G----D-------S------APF-----------PSAVVIYT-----G-E--GNV----------------------------------HE-NPE--L------------LEE| DK41_RS08970_Streptococcus_agalactiae_642982737 VQ---K-------SLLSS--------------DKD----Y-WETPQTFFKKLNN-E-F---D------FDLDVA-SSHD------NA-KCKNHFTV------VE-----DGLS--QDW-----T--G-NVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE--SLK-PY-N--N-----------------VIVLLI--PARTD-TKYWHDYI-----------F-GKA-KDIRY-L-KGRLKF----TIN-GK------------E----N----Y-------P------APF-----------PSAVIIY------------------------------------------------------------------------| SAG0375_RS111635_Streptococcus_agalactiae_487848063 VQ---K-------SLLSS--------------DKD----Y-WETPQTFFKKLNN-E-F---D------FDLDVA-SSHD------NA-KCKNHFTV------VE-----DGLS--QDW-----T--G-NVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE--SLK-PY-N--N-----------------VIVLLI--PARTD-TKYWHDYI-----------F-GKA-KDIRY-L-KGRLKF----TIN-GK------------E----N----Y-------P------APF-----------PSAVIIF------------------------------------------------------------------------| L964_RS00605_Leuconostoc_pseudomesenteroides_491052808 NS---K-------ALFSS--------------KSM----V-WETPKDYFDKLNR-K-F---K------FDLDAC-ASDT------NH-KVDTYFTE------DD-----DALE--QKW-----G--G-NVFMNPPY--------------------------GR-H----I-----G-EFIKKAYEE--HLR-DP-N--R-----------------FIVMLI--PSRTD-TKYWHEYI-----------Q-DKA-T-VKF-I-KGRLKF----ELD-GR------------P----M----N-------T------APF-----------PSALIIY-GL---------------------------------------------------------------------| ZJ316_RS06725_Lactobacillus_plantarum_505193070 VN---K-------ALFTS--------------NKE----D-WETPQDFYDRLNA-K-Y---H------FEWDLA-ASDG------NA-KCGHYFTS------DD-----NSLE--QDW-E-RLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKASET--Q-L-KH-D--Q-----------------FLVMLI--PSRTD-TSYWHDYI-----------F-NHA-E-IEF-L-RGRLKF----EVD-GV------------G----G----D-------S------APF-----------PSAVVIYT-----G-E--GNV----------------------------------HE-NPE--L------------LEE| T370_RS0102475_Bilophila_wadsworthia_736486878 MN-----------VHFLS--------------KKH----D-WATPWPLFRELNA-R-F---GP-----CELDVC-ATAR------NA-KCGNFFSP------EE-----DGLR--QVW-H------G-VCWMNPPY--------------------------GR-A----L-----P-HWMAKAVNEIEM---ER-A--E-----------------RVICLL--PARTD-TAWWHRYV-----------L-PFAAE-IHY-L-RGRIRF--------EG------------A----G----S-------S------APF-----------PSAVVIF------------------------------------------------------------------------| RBAU_RS10310_Bacillus_amyloliquefaciens_752856685 ME-T-K-------TNFNQGVFFNP------EDRTD----V-WATPIDFFNKINE-R-Y---K------LNLDVC-AKPS------NA-KCKNFFTP------EI-----DGLK--QKW-V------G-RVWMNPPY--------------------------GR-E----I-----K-KWIKKAYEE--V---EN-G--N---------------SEIAVCLV--PARTC-SAWWHEYC-----------M-KG--E-ILF-I-RHRLKF--------GG------------S----K----I-------N------APF-----------PNALVIF------S-N--EHV-------------------------------NT-YK-AID--R--EGNLVI-------| JCM13658_RS00605_Bacteroidales_490421344 MD-----------VTFEG---K-S------STGKN----E-WLTPPCLLRRLGP---F-----------DLDPC-SPVN-RP---WD-TARHHYTI------ED-----DGLQ--QPW-F------G-RVFCNPPY--------------------------DT-A--L-I-----V-RFIRRCVEH------R-----------------------NAVALT--FARTD-TRLFHELI----FP-----N-ADS---ILF-I-KGRLSF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLIAF------N----KEN----------------------------------TA-VLE--T--CG-----------| ACINWC323_A0077_Acinetobacter_sp_WC-323_425484490 MA-K-S-------KLFGL-----A------EDRTD----V-WATPQDFFDKLNA-I-F---D------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLS--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----S-LWIEKAVET--A---NA-G--Y-----------------TVVALL--PARTD-VGWWQSHC-----------L-NR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCAVVVF------R-P--SLN-------------------------------DV----RWE--Q--SQ-----------| HMPREF1069_RS24300_Bacteroides_ovatus_490451898 MD-----------VTFEG---K-S------STGKD----E-WLTPPCLLRRLGP---F-----------DLDPC-SPVN-RP---WD-TARHHYTI------ED-----DGLQ--QPW-F------G-RVFCNPPY--------------------------DT-A--L-I-----V-RFIRRCVEH------R-----------------------NAVALT--FARTD-TRLFHELI----FP-----N-ADS---ILF-I-KGRLSF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLIAF------N----KEN----------------------------------TA-VLE--T--CG-----------| N007_RS30575_Alicyclobacillus_acidoterrestris_750137118 QG-Q-D-------VLFSS--------------ASI----E-WGTPQHIFDALDA-E-F---H------FTLDAA-ANVH------NH-KCDKWYGT-QS---DG-TFI-DGLA--QDW-S---G--E-TIWLNPPY--------------------------QR-N--V-I-----D-KWAHKAYTS--A-R-DN-G--T-----------------TVVLLL--PARLD-VKWWNKYC-----------V-YAP-E-IRF-V-EGRIRF----EQE-GK-------------------Y--N-------S------ATF-----------PSAIVIF------------------------------------------------------------------------| N007_05570_Alicyclobacillus_acidoterrestris_ATCC_49025_529047023 QG-Q-D-------VLFSS--------------ASI----E-WGTPQHIFDALDA-E-F---H------FTLDAA-ANVH------NH-KCDKWYGT-QS---DG-TFI-DGLA--QDW-S---G--E-TIWLNPPY--------------------------QR-N--V-I-----D-KWAHKAYTS--A-R-DN-G--T-----------------TVVLLL--PARLD-VKWWNKYC-----------V-YAP-E-IRF-V-EGRIRF----EQE-GK-------------------Y--N-------S------ATF-----------PSAIVIF----R-G----GVS-------------------------------DV----PYQ------------------| HMPREF0179_RS04985_Bilophila_wadsworthia_749811142 ---MNP-------ALFSS--------------AKE----D-WETPREFFERLDG-E-F---H------FDLDVC-AFPH------NA-KCPTYFTK------ED-----DGLA--RDW-G---N--R-VCWMNPPY--------------------------GK-A----I-----K-AWMTKALDA--S---RR-G--A-----------------TVVCLV--PSRTD-TAWWHDTV-----------I-AGGAE-VRF-A-RGRLRF--------VG------------A----E----H-------P------APF-----------PSAVVIF------R-P--PPS--P-------------------------------------------------------| BZ26_RS0118830_Clostridium_botulinum_489480013 MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDKLNK-E-F---N------FDLDPC-ATKE------NA-KCSKYFTK------EI-----DGLK--QDW-G---R--Y-RVFCNPPY--------------------------GR-E----I-----G-KWVEKAYKE-SK---KQ-N--T-----------------TVVMLI--PARTD-TKYFHSYI-----------Y-HKAKE-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMIVVF-RG---------------------------------------------------------------------| H627_RS17735_Lactobacillus_harbinensis_737460398 MS---D-FLKPGGAALTS--------------NKD----D-WETPQAFFESLNA-K-Y---H------FAIDLA-ASKD------NA-KCDRYFSV------AD-----DSLL--QDWSD-DFG--G-AMYLNPPY--------------------------GR-H----I-----G-DWVKKAYET--S-L-RV-N--V-----------------PIVLLI--PARTD-TSYWHDYI-----------F-GKA-S-IKF-I-RGRLKF----EQN-GM------------A----G----G-------P------APF-----------PSAIIVY--N---G-D--GAE----------------------------------K-----------------------| K288_RS0104020_Bradyrhizobium_sp_Ai1a-2_653543986 MT-T-A-------PLFAG---I-GAHQTPRRTRTD----E-WLTPPAVLKALGP--------------FDLDPC-APIV-RP---WP-TAAHHYTI------RD-----NGLL--LPW-F------G-RVFLNPPY--------------------------HR-S--V-I-----G-KWLARMSGH------G-----------------------RGIALI--FARTE-TEAFFRYV----WE-----Q-ASA---LLF-L-RGRLDF-H--TVD-GG------------T----A----QRQSGRAAN------AGA-----------PSVLCAY------G----PRD----------------------------------AE-MLA--F--CG-----------| EX05_RS06230_Agrobacterium_rhizogenes_736484955 MT-L---------NLFAG---M-GTHQSA-RSKTD----V-WFTPPAIIEALGGPDS-----------FDLDPC-SSVE-RP---WP-TARRHFT----P--ED-----NGLM--RPW-Q------G-RVWMNPPY--------------------------ST-Q--L-L-----R-KFMARMAEH------D-----------------------HGVALV--FARTE-TDPFHRYV----WG-----A-ASG---LLF-V-RGRLNF-H--RID-GE------------P----A----R------KN------GGA-----------PSVLIAY------G----DED----------------------------------RD-ILA--A--AP-----------| I569_RS06865_Enterococcus_dispar_510798824 MS---L-SY--K-AIMTS--------------DNQ----D-WETPQELFDNLNN-E-F---D------FELDAF-ASDK------NA-KCKHFFTE------RD-----DAFQ--QDW-T-KYK----SIFINPPY--------------------------TS-K----V-----Q-DEVLKKIND--T-I-SS-NWMG-----------------VIVLLI--PARTD-TKRWHDYI-----------F-NKA-DDIRF-I-KGRLRF----EVD-GI------------P----R----G-------S------STF-----------PSAVIVY--D---L-R--NKE----------------------------------E--------------------VAE| YWU_RS14235_Corynebacterium_sputi_736638732 MT-A-G------RKSVSD--------------TKH------WCTPPGILDSVRS-V-F---G---GK-IDLDPC-SNEH----S-LV-NASVEYKL-P----EN-----DGLA--ESW-D-F----E-RIFVNPPY--GSDPV-R-----------------KT-R----I-----A-HWFAKIAES--V---RN-G--S-----------------EVIALV--PVATN-TRHWKNHV----FP-----L-AAA---VCF-LYEPRVKF----YID-GR------------E----D-P--K-------G------APM-----------SCAIIYY------G-R--HLE-------------------------------SF-AE-NFR--H--HG-----------| Q333_RS16720_Brevundimonas_bacteroides_737311698 MS---A------SHRFDN-A-K-R-RRSD-DHPRQ----A-LATPAYVLEPVRR-L-L-G-G------IGLDPC-TDPD----N-PT-GADRFYCL------PQ-----DGAS--LPW-D---A--P-SIFVNPPY--------------------------GE-A----R-----K-RWVERCVEA---------G--T--------------RT-RVVLLI--PAHTE-SKVFQLAL--R--------S-CDS---VLF-I-DARLRF--------GV--M---------R----D----N-GRQE--A------ASH-----------GSALLSW------N-V--DLS-------------------------------RI----VED--V--CG-----------| BSTEL_RS07490_Bifidobacterium_stellenboschense_736512951 MT-A-G------RQPVSV--------------TKH------WCTPQKYVDAVTE-V-F---G---GT-IDLDPC-SNEY----S-TV-NARVEYRL-P----EH-----DGLR--DSW-D-Y----P-RIYVNPPY--GRDKE-H-----------------GT-T----I-----A-DWFVRIAEA--A---RN-G--S-----------------EVMALV--PVATN-TAHWKEYV----YP-----V-ASA---VCF-LYDTRLHF----VIN-GN------------E----D-T--K-------G------APM-----------SCAMIYY------G-N--HPQ-------------------------------EF-GR-VFS--R--YG-----------| BBIA_RS00390_Bifidobacterium_biavatii_705399968 MT-A-G------RHPVSQ--------------TKH------WCTPQKYVDAVTE-V-F---D---GQ-IDLDPC-SNEY----S-TV-NARVEYIL-P----EN-----DGLR--DSW-D-Y----D-RIYVNPPY--GRDVE-H-----------------GT-T----I-----A-DWFVRIADA--V---GR-G--S-----------------EVMALV--PVATN-TAHWKDFV----YP-----V-ASA---ICF-LYDTRLHF----VIN-GN------------E----D-T--K-------G------APM-----------SCCMIYY------G-D--NPR-------------------------------KF-GR-VFS--R--YG-----------| HMPREF0179_03455_Bilophila_wadsworthia_3_1_6_316921487 ---MNP-------ALFSS--------------AKE----D-WETPREFFERLDG-E-F---H------FDLDVC-AFPH------NA-KCPTYFTK------ED-----DGLA--RDW-G---N--R-VCWMNPPY--------------------------GK-A----I-----K-AWMTKALDA--S---RR-G--A-----------------TVVCLV--PSRTD-TAWWHDTV-----------I-AGGAE-VRF-A-RGRLRF--------VG------------A----E----H-------P------APF-----------PSAVVIF------R-P--PPS--P-------------------------------------------------------| BN981_RS01320_Halobacillus_737532221 MNKM-D-------VHYSS--------------KTN----E-WATPQDFFDELNT-E-F---N------FTLDPC-ATPD------NA-KCDKYFTE------KD-----DGLE--QSW-E---G--E-TVFCNPPY--------------------------GR-G----I-----K-HWVKKAYQE-ST---KP-N--T-----------------TVVLLI--PSRTD-TRYFHDYV-----------Y-HKS-E-IRF-L-KGRLKF--------GD------------G----S----G-------N------APF-----------PSMVAIY-R----------------------------------------------------------------------| V006_02512_Staphylococcus_aureus_686297326 ---M-E-------VHYSS--------------KTN----E-WTTPQNLFDELNG-E-F---N------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------| RDMS_RS01750_Deinococcus_sp_RL_736377798 ---M-A-------VHYSS--------------EKH----D-WTTPRSFFDELNA-E-F---N------FTLDAA-ASPH------NA-LCSRYFTE------AD-----DGLS--QPW-T---G--T-V-WCNPPY--------------------------GR-Q----I-----G-RWIAKAAQS--A---CE-G--A-----------------TVVMLI--PARTD-TAAWHDHI-----------LFNPQAE-VRF-V-RGRLRF--------GD------------A----T----A-------N------APF-----------PSAVIIF------R-P--GGQ--G-------------------------------------------------------| T666_02640_Staphylococcus_aureus_686391504 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-CWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------| A11W_RS0107210_Staphylococcus_hominis_515743089 ---M-E-------VHYSS--------------KSN----E-WATPQNLFDELNE-E-F---N------FTLDPC-ATDE------NA-KCSKYFTI------ED-----DGLS--KDW-S---K--D-VVFMNPPY--------------------------GR-E----I-----K-KWNKKAYEE--S---LN-G--A-----------------TVVCLI--PARTD-TTYWHDFI-----------F-DRADD-IRF-L-RGRLKF--------GN------------S----K----N-------S------APF-----------PSAIVVY------R----GVTT---------------------------------------------------------| SAGV69_RS11740_Staphylococcus_aureus_506511035 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWRDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------| X998_RS01715_Staphylococcus_aureus_446374007 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLSE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------| U183_02276_Staphylococcus_aureus_686300364 ---M-E-------VHYSS--------------KTN----E-WTTPQNLFDDLNR-E-F---N------FTLDPC-STDE------NA-KCQKHYTE------ND-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-HWVKKAYEE--S---IK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GE------------S----K----N-------S------APF-----------PSAIIVY------R----GVR----------------------------------------------------------| QZ29_RS14215_Staphylococcus_aureus_446374006 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------| ERS140248_02184_Staphylococcus_aureus_678260344 ---M-E-------VHYSS--------------KTN----E-WATPQNLFDDLNR-E-F---N------FTLDPC-STDE------NA-KCQKHYTA------KD-----NGLI--QDW-S---E--D-VVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---VK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------SE------------S----K----N-------S------APF-----------PSAIIVY------R----GGR----------------------------------------------------------| HMPREF9988_RS10060_Staphylococcus_epidermidis_488427723 ---M-E-------VHYSS--------------KSN----E-WATPQKLFDELDK-E-F---N------FTLDPC-ATDE------NA-KCNKHFTI------ED-----DGLS--KDW-S---K--D-VVFMNPPY--------------------------GR-E----I-----K-KWIKKAYEE--S---LN-G--A-----------------TVVCLI--PARTD-TTYWHDFI-----------F-DKADD-IRF-L-RGRLKF--------GN------------S----K----N-------S------APF-----------PSAIVVY------L----GVTT---------------------------------------------------------| SAZ172_RS05790_Staphylococcus_aureus_554679133 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APL-----------PSAIIVY------R----GAQ----------------------------------------------------------| SA930_RS14870_Staphylococcus_aureus_446374005 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVKKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----G------------------------------------------------------------| AS94_12270_Staphylococcus_aureus_686449191 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNE-E-F---S------FTLDPC-STDE------NA-KCRKYYTV------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-RWVEKAYEE--S---LK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GD------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------| BN981_RS01350_Halobacillus_737533832 ---M-N-------VHYSS--------------KSN----D-WATPQDFFDGLDN-E-F---N------FTLDPC-ATSE------NA-KCDNYFTI------ED-----DGLK--QSW-E---G--E-TVFCNPPY--------------------------GR-E----I-----K-LWVKKAFQE-SK---KP-N--T-----------------KVVMLI--PARTD-TKYFHDYI-----------Y-MQA-R-VRF-I-KGRLKF--------GN------------G----K----G-------N------APF-----------PSMVVIF------------------------------------------------------------------------| W619_00569_Staphylococcus_aureus_686419170 ---M-E-------VHYSS--------------KTN----E-WTTPQHLFDDLNG-E-F---N------FTLDPC-STDE------NA-KCQKHYTA------KD-----NGLI--QDW-S---E--D-IVFMNPPY--------------------------GR-S----I-----K-HWVKKAYEE--S---VK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NKADD-IRF-L-RGRLKF--------GE------------S----K----N-------S------APF-----------PSAIIVY------R----GAQ----------------------------------------------------------| BN981_00304_Halobacillus_trueperi_635344555 MGKM-N-------VHYSS--------------KSN----D-WATPQDFFDGLDN-E-F---N------FTLDPC-ATSE------NA-KCDNYFTI------ED-----DGLK--QSW-E---G--E-TVFCNPPY--------------------------GR-E----I-----K-LWVKKAFQE-SK---KP-N--T-----------------KVVMLI--PARTD-TKYFHDYI-----------Y-MQA-R-VRF-I-KGRLKF--------GN------------G----K----G-------N------APF-----------PSMVVIF------------------------------------------------------------------------| LILY_61_Bacteriophage_Lily_755258783 MS-NTMA------VHYSS--------------KTD----M-WETPQDFFDKLHA-E-F---G------FTLDVC-AVPE------NA-KCERFFSP------DD-----NGLL--QNW-K------G-VCWMNPPY--------------------------GR-Q----I-----G-AWIAKAYES--S---LE-G--A-----------------TVVCLV--PSRTD-TKWWHDYC-----------L-KG--E-VRF-I-KGRLKF--------GG------------S----P----H-------N------APF-----------PNAIVIF------R-G--KGQ----------------------------------------------------------| ERIC1_RS03940_Paenibacillus_larvae_738763505 MN-K---------VHYSS--------------KTD----M-WETPQNLFDRLNE-E-F---K------FDLDVC-AIPE------NA-KCKRYFTP------SE-----DGLK--QEW-K------G-ACWMNPPY--------------------------GR-Q----I-----G-KWIAKAYES--S---LE-G--A-----------------TVVCLV--PSRTD-TKWWHGYC-----------M-KG--E-IRF-I-RGRLKF--------GG------------S----P----H-------N------APF-----------PNAVVIF------R-----------------------------------------------------------------| ERIC1_1c08270_Paenibacillus_larvae_subsp_larvae_DSM_25719_567770034 MN-K---------VHYSS--------------KTD----M-WETPQNLFDRLNE-E-F---K------FDLDVC-AIPE------NA-KCKRYFTP------SE-----DGLK--QEW-K------G-ACWMNPPY--------------------------GR-Q----I-----G-KWIAKAYES--S---LE-G--A-----------------TVVCLV--PSRTD-TKWWHGYC-----------M-KG--E-IRF-I-RGRLKF--------GG------------S----P----H-------N------APF-----------PNAVVIF------R-G--RKE-------------------------------SLHGQKRNE--T--KD-----------| Q397_RS23350_Terasakiella_pusilla_740171937 MN--WTH------VQRTNGG------------TSD----Q-WFTPFELLNTLYD-A-C---G-V-MQ-FDLDPC-SPPE-YL-A-HT-KAKKRFCV------ES-GD--DGLA--EDW-R---G--K-TIFMNPPY--------------------------GR-G----I-----D-KWVEKACTE--V---AN-G--N--------------AK-TVIGLL--PVKAD-TDWWHNHV--A-MK--AD-M-------FVF---NGRLKF--------GN------------A----K----G-------S------GRF-----------ASALAIW------------------------------------------------------------------------| OPIT5_22060_Opitutaceae_bacterium_TAV5_573475515 --------------MTSS--------------MDM----T-WGTPQVWFDYLHL-E-F---G------FTLDPC-CLHQ------TA-KCKKHYTP------AE-----DGLA--QSW-A---E--E-RVFMNPPY--------------------------GR-D----L-----P-KWMKKAYEE--A---RDNG--T-----------------LIVCFV--PARVD-TEWWHRYA-----------T-K-G-E-VRF-P-KGRVKF--------AD------------A----L----D-------S------APF-----------PVAVVIF------R-S--RL-----------------------------------------------------------| GL4_RS02905_Methyloceanibacter_caenitepidi_779886230 MT-L-G-----SHQRCVG--------------KSQ----Q-HLTPRWILDPLGE--------------FELDPC-AASP-RP---WS-CADVNYTE------ED-----DGLS--QVW-S------G-RVWLNPPF--------------------------NR-Y--V-V-----G-DWMDRFFDH------G-----------------------RGIALL--HARTE-TNWF-RLV----WK-----C-ASA---LLF-L-DKRVKF-C--RSD-GS------------M----Q----E--A----N------SGA-----------PVVLVAA------D----DLN----------------------------------AA-CLR--R--CG-----------| TY47_RS06930_Lactobacillus_brevis_754895979 MN---N-------ALLSS--------------EKN----Y-WETPHDFFKKLNE-K-Y---Y------FSFDLA-ASPE------NT-KCENFFSE------ED-----NSLT--KAW-H-ELK--G-NLFLNPPY--------------------------GR-E----L-----R-KWVKKAYEE--S-LKKH-D--G-----------------YIVLLI--PARTD-TSYWHDFI-----------F-GKA-Q-INF-L-RGRIKF----ELH-GE------------S----K----D-------A------APF-----------PSAIVIY-G----G-S--Q------------------------------------------------------------| HMPREF1020_RS23965_Clostridium_sp_7_3_54FAA_496656604 MN---D-------ALLSS--------------KNM----C-WCTPPDFFAELDR-E-F---H------FELDPA-STDK------SA-KCAKHFTP------DD-----DGLK--QDW-----G--GYRVFCNPPY--------------------------GR-A----I-----A-DWVRKGYEE--S-R-KP-G--T-----------------TVVMLI--PSRTD-TAYFHDWI-----------F-GKA-SEVRF-L-RGRLKF-T--DED-GN------------G----E----D-------A------APF-----------PSAVIVW-RSPE-S-T--GRE----------------------------------FA-TWH--I---------------| CLOM621_RS14915_Clostridiales_492715347 MN---D-------ALLSS--------------KNM----C-WCTPPDFFAELDR-E-F---H------FELDPA-STDK------SA-KCAKHFTP------DD-----DGLK--QDW-----G--GYCVFCNPPY--------------------------GR-A----I-----A-DWVRKGYEE--S-R-KP-G--T-----------------TVVMLI--PSRTD-TAYFHDWI-----------F-GKA-SEVRF-L-RGRLKF-T--DED-GN------------G----E----D-------A------APF-----------PSAVIVW-RSPE-S-T--GRE----------------------------------FA-TWH--I---------------| ANACOL_RS13845_Anaerotruncus_colihominis_493931641 ---N-K-------ALLSS--------------KRL----D-WCTPRDFFDALDV-E-F---H------FTLDAA-ATEK------SA-KCAKYYTP------ET-----DGLS--ASW-A---G--E-TVFCNPPY--------------------------GR-E----I-----K-AWIKKGFEE-GQ---QS-G--T-----------------TVVLLI--PSRTD-TEYFHKYI-----------L-GKA-E-IRF-L-KGRLKF--------TD------------E----EGLTQD-------A------APF-----------PSMLVIY------R----GQGKEQ-NDG---------------------------------------------------| ND2E_3441_Colwellia_psychrerythraea_694338559 VK-K-L-------AYIGS---K-P-GDIT-SRDSD----S-WYTPNIYTDMTRK-V-L-G-T------IDLDPF-SSSL-AN-E-YV-KAERYFDA------DS-----NAFK--QIW-F-K-EQ-G-TVFMNPPY--SRKLI-------------------DK-A----V-----E-IFLQNISDS--S-I-S-----------------------QAVVLV--NNATE-TKWFQSLT--R--------K-SDA---LCL-V-DKRIPF-E--SFD-GK-----------------H----S-------S------GNT-----------RGQVFLY--Y---G-V--NKK-------------------------------AF-KK-VFK--E--IG-----------| JCM19241_5986_Vibrio_sp_JCM_19241_749448467 MT-Q-H-------AKIAN--------------MNN----E-WHTPHQYIDSARK-V-M-G-S------IDTDPA-SNDI-AQ-E-YI-QADTYYTI------DN-----SSLD--KEW-S------G-NVWMNPPY--------------------------GR-T----I-----K-DFCNKLVDE----F-ES-G--R--------------VK-QAIVLT--NNGTD-TQWFDALS--G--------I-SSA---ICH-H-KKRIAF-L--RPT-GE-----------------R-V--N--------------NNT-----------KGQIFMY--I---G-D--NSQ-------------------------------AF-RD-EFN--Q--YG-----------| G469_RS0106650_Atopobium_fossor_654811069 -----M-------STFTS-GLR-S------S-ASN----E-WTTPKDLFDELNR-E-F---K------FTVDAA-STHE------NA-LVDKHWTL------AE-----DGLA--QCW-D---G--E-RVWCNPPY--------------------------GR-Q----I-----A-QWVKKASEA--V------G--G-----------------VVVMLI--PARTD-TSYWHDYV-----------F-PNASD-IRF-I-RGRLHF--------SQ------------S----K----T-------A------APF-----------PSAIVVF------E-R--WA-----------------------------------------------------------| HMPREF1247_RS02895_Atopobium_488626325 -----M-------TAFTS-GLR-S------S-TSN----E-WTTPKYLFDELNR-E-F---K------FTVDAA-STHE------NA-LVDKHWTI------EE-----DGLS--QCW-D---N--E-RVWCNPPY--------------------------GR-Q----I-----A-KWVKKASEA--V------G--G-----------------VVVMLI--PARTD-TAYWHDYI-----------F-SNASD-IRF-I-CGRLHF--------SN------------S----K----N-------A------APF-----------PSAIVVF------E-R--WQ-----------------------------------------------------------| N644_RS02335_Lactobacillus_plantarum_727092536 MN---K-------ALFTS--------------NKE----D-WETPQDFYDRLNA-K-Y---H------FEWDLA-ASDG------NA-KCGDYFTS------DD-----NSLE--QDW-E-RLS--G-NLFLNPPY--------------------------GR-E----L-----K-LWVKKASET--Q-L-KH-D--Q-----------------FLVMLI--PSRTD-TSYWHDYI-----------F-NHA-E-IEF-L-RGRLKF----EVD-GV------------G----G----D-------S------APF-----------PSAVVIY------------------------------------------------------------------------| CC61_RS14530_Chromobacterium_sp_C-61_748184431 MA-E-N-------VHFST--------------GKD----E-WPTPQALFDQLNA-E-F---G------FTIDVC-ATAK------NA-KCTKFYTQ------VD-----DGLA--QNW-A---G--E-VVWMNPPF--------------------------GH-S----I-----K-LWMAKAYRS--S---LD-G--A-----------------LVVCLV--PARTD-TRWWHRVV-----------M-KAS-E-VRV-L-DKRLRF--------DG------------G----N----H-------K------APF-----------PAVVVVF------------------------------------------------------------------------| QI18_RS10395_Lactococcus_lactis_746045508 MN-R-E-------LMFSS--------------KTD----L-WSTPWNFFEKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTI------KE-----DGLL--QDW-G---N--E-VVFCNPPY--------------------------GR-K----I-----K-DWVKKAYEE-SQ---KD-N--T-----------------TVVMLI--PARTD-TIYFHEYV-----------Y-HKA-E-VRF-I-KGRLKF--------GD------------A----K----N-------A------APF-----------PSMVVIFRKDNQ-------------------------------------------------------------------| RN16_RS04075_Chromobacterium_subtsugae_759887196 LS-E-Q-------IHFSS--------------KTD----E-WPTPQALFDQLHA-E-F---G------FTLDVC-ATQE------NA-KCERFFTR------EQ-----DGLA--QDW-S---R--E-VVWMNPPF--------------------------GH-Q----I-----K-LWMAKAYRS--S---ID-G--A-----------------LVVCLV--PARTD-TRWFHRHA-----------L-KAA-E-IRA-L-DKRLRF--------DG------------A----K----A-------K------APF-----------PAVLVVY------------------------------------------------------------------------| MMA_RS11485_Janthinobacterium_sp_Marseille_501027971 -M-S-K-------VHFSS--------------ATP----E-WYTPQSTFDVLNA-E-F---G------FTLDPC-CTHE------NA-KCDRHFTM------AE-----NGLS--QDW-S---N--E-VTFMNPPY--------------------------GR-E----I-----K-EWMRKAYES--S---LS-G--A-----------------TVVCLV--PARTD-TAWWHDYS-----------I-K-G-E-IRF-L-RGRLKF--------GG------------A----K----T-------N------APF-----------PSAIVIF------R-P--------------------LPIKELA------------------------------------| AWRIB429_RS09790_Oenococcus_oeni_768719850 MN-N-E-------LMFSS--------------KTD----L-WSTPNDFFDKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTK------EE-----NGLL--QDL-G---N--E-VVFCNPPY--------------------------GR-Q----I-----K-DWVKKSYEE-SQ---KD-N--T-----------------TVVMLI--PARTD-TIYFHEYI-----------Y-HKA-E-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMVVIFE-----------------------------------------------------------------------| TH16_RS01985_Staphylococcus_caprae_488372936 ---M-S-------VHFSS--------------KSN----E-WYTPQYLFDELNE-K-Y---Q------FTLDPC-ASHE------NA-KCDKYFTI------ED-----DGLT--KDW-S---K--D-IVFMNPPY--------------------------GR-N----I-----K-HWIKKAYEE--S---VK-G--A-----------------TVVCLI--PARTD-TTYWHDYI-----------F-NNAYN-IKF-L-KGRIKF--------GG------------A----V----N-------S------APF-----------PSAIVVF------KPKGDGLK----------------------------------------------------------| OR63_RS06485_Clostridium_tetani_737140426 MN-T-A-------VMFSS--------------ETD----L-WATPQEFYNELNK-E-F---N------FDLDPC-ATHE------NA-KCPKYYTV------VE-----DGLK--QDW-Q---G--H-KVFCNPPY--------------------------GR-E----I-----S-KWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TKYFHSYI-----------Y-RKAKE-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------| G454_RS0114655_Desulfovirgula_thermocuniculi_654109520 ML-N-R-------GLFSS--------------ASS----E-WETPQKFFETLDV-E-F---G------FTLDVC-ARPE------NA-KCPRYFSP------EE-----DGLR--QEW-A---P--E-VCWMNPPY--------------------------GR-E----I-----G-KWIQKAYEE--A---QK-G--A-----------------TVVCLL--PSRTD-TAWWHEYV-----------M-RAA-E-VRF-I-RGRLRF--------GG------------A----E----N-------G------APF-----------PSCVVVF------R-P--GYS--G--------LPV-VKSMAAR------------------------------------| RM98_RS18265_Chromobacterium_violaceum_759932528 LS-E-Q-------VHFSS--------------KTD----E-WPTPQALFDQLHE-E-F---G------FTLDVC-ATAE------NA-KCERFFTR------EQ-----DGLA--QDW-S---R--D-VVWMNPPF--------------------------GH-Q----I-----K-LWMAKAYRS--S---ID-G--A-----------------LVVCLV--PARTD-TRWFHRHA-----------L-KAA-E-IRA-L-DKRLRF--------DG------------A----K----A-------K------APF-----------PAVLVVY------------------------------------------------------------------------| T259_RS08765_Clostridium_botulinum_748203410 MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDKLNK-E-F---N------FDLDPC-ATHE------NA-KCSKYFTK------EI-----DGLK--QDW-Q---G--Y-KVFCNPPY--------------------------GR-V----L-----K-DWVKKCYEE-SL---KP-N--T-----------------TVVMLI--PARTD-TKYFHEYI-----------Y-HKVKE-IRF-V-KGRLKF--------GD------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------| DESKU_RS03925_Desulfotomaculum_kuznetsovii_503587829 ML-N-E-------SMFSS--------------RTG----E-WETPQTFFDALDA-E-F---H------FTLDVC-ARPE------NA-KCARFFTP------EQ-----DGLR--QSW-A---G--E-TCWMNPPY--------------------------GR-E----I-----G-RWVEKAYNE--A---RR-G--A-----------------VVVALL--PARTD-TRWWHRYV-----------M-RAA-E-IRF-V-EGRLKF--------GG------------A----E----N-------S------APF-----------PSVVVVF------T-PEKAVS--D--------GPV-VRSMRVK------------------------------------| CLOSCI_RS06430_[Clostridium]_scindens_748651356 LN-K-A--------LFSS--------------AKE----D-WATPQDFFDELNK-E-F---H------FDLDPC-ADAE------NA-KCKEFFTK------EQ-----NGLL--QDW-G---G--R-CVFCNPPY--------------------------GRTS----T-----G-EWIKKCYEE-AQ---KP-G--T-----------------VVVALI--PARTD-TRFFHDYI-----------Y-HKA-E-IRF-I-KGRLHF--------GG------------C----K----D-------A------APF-----------PSMVVVF---RKGK----ENEEEK-KTGCTAAGHT-EEKAAEKDDGSENGVDGI-------------------------| NZ45_03810_Clostridium_botulinum_700273311 MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDKLNK-E-F---D------FDLDPC-ATHE------NA-KCSKYFTK------EI-----DGLK--QDW-Q---G--H-KVFCNPPY--------------------------GR-G----I-----K-DWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TRYFHEYI-----------Y-HKAKE-IRF-V-KGRLKF--------GS------------A----K----N-------S------APF-----------PSMVVVF---RGE------------------------------------------------------------------| MCOL2_RS04700_Listeria_fleischmannii_738104299 MD---R-------VIFSS--------------ERD----D-WETPTDLFNELDK-E-F---L------FDLDAT-ANKN------NA-KCPKFFTK------EQ-----NALV--QEW-----R--G-SVFCNPPY--------------------------GR-E----I-----Q-KFIEKAYIE--SKK-AY-C--E-----------------RVVLLI--PARTD-TKIWHDFI-----------F-PFS-KEIIF-I-KGRLKY----ELN-KI------------S----N----S-------P------APF-----------PSAIIIF---EECNL----------------------------------------------------------------| BN927_RS09785_Lactococcus_lactis_554763517 MN-K-E-------LMFSS--------------KTD----L-WSTPWNFFDKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTI------EE-----DGLL--QDW-G---N--E-VVFCNPPY--------------------------GR-Q----I-----K-DWVKKAYEE-SQ---KD-D--T-----------------TVVMLI--PARTD-TIYFHEYI-----------Y-HKA-E-IRF-I-KGRLKF--------GD------------A----K----N-------A------APF-----------PSMVVIF-----RKDNQ--------------------------------------------------------------| PI74_RS05125_Clostridium_botulinum_500994137 MN-T-A-------VMFSS--------------GTD----L-WATPQDFFDKLNK-E-F---D------FDLDPC-ATHK------NA-KCSKYFTK------EI-----DGLK--QDW-Q---G--Y-KVFCNPPY--------------------------GR-S----I-----K-DWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TRYFHEYI-----------Y-NKAKE-IRF-V-KGRLKF--------GD------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------| Q332_RS01180_Pseudobacteroides_cellulosolvens_739064083 ---T-E-------IMFSS--------------KSD----E-WETPQQFFDKLHK-E-F---N------FQLDVC-ATAE------NA-KCDKYYTK------ID-----DGLS--QSW-H---HWAQ-RCWMNPPY--------------------------GR-N----I-----D-KWIKKAFDE--S---QE-G--A-----------------TVVCLI--PARTD-TKYWHTYC-----------M--KAHE-IRF-V-KGRLKF--------SN------------S----K----D-------C------APF-----------PSAIVVF------K-P--TLK--QLKVSSY-------------------------------------------------| G454_RS0102995_Desulfovirgula_thermocuniculi_654100680 MF-N-R-------VLFSS--------------ATS----E-WETPQELFARLHA-E-F---G------FTLDVC-ARPW------NA-KCTRYFSP------EQ-----NGLI--QEW-A---P--E-TCWMNPPY--------------------------GR-E----I-----S-RWVRKAWEE--A---QK-G--A-----------------TVVCLL--PSRTD-TAWWHEYV-----------M-RAA-E-IRF-I-RGRLHF--------EG------------A----K----N-------G------APF-----------PSCVVVF------R-P--GCT--G--------PPV-IRSMAAR------------------------------------| Phi93_04_Lactococcus_phage_phi93_673939868 MN-N-E-------LMFSS--------------KTD----L-WSTPNDFFDKLND-E-F---H------FTLDPC-STHE------NA-KCYKHFTK------EE-----NGLL--QDW-G---N--E-VVFCNPPY--------------------------GR-Q----I-----K-EWIKKSYEE-SQ---KD-N--T-----------------TVVMLI--PARTD-TIYFHEYI-----------Y-HKA-E-IRF-I-KGRLKF--------GN------------A----K----N-------S------APF-----------PSMVVIF----E-------------------------------------------------------------------| GAP32_068_Cronobacter_phage_vB_CsaM_GAP32_414086984 NN-M-S-------VHFSS--------------ASN----T-WDTPDDFYQKLHA-V-W---N------FTLDPA-AMDE------TA-KCEKYYTP------ET-----DGLA--HSW-A---G--E-TVWCNPPY--------------------------GR-E----I-----S-KWFKKFDEE-FK---QN-G--T-----------------TIIALP--PARTD-TTYFHKYV-----------R-DSATA-ICF-V-KGRLKF--------DNRSLPSWKEDGSHK----K----T-------G------APF-----------PSMIVIY----D-N----NITQEK-YEVLNSLGFV-VQPFLLG------------------------------------| CO98_RS04645_Staphylococcus_aureus_739716594 ---M-S-------VHFSS--------------KSN----E-WTTPQYLFDELNE-E-F---N------FTLDPC-ATDE------NA-KCSKYFTI------ED-----DGLS--KDW-S---N--D-VVFMNPPY--------------------------GR-E----I-----K-KWIKKAYEE--S---LN-G--A-----------------TVVCLI--PARTD-TTYWHDFI-----------F-DKADD-IRF-L-KGRLKF--------GN------------S----K----N-------S------APF-----------PSSIVIY------E----CKEAEQ-------------------------------------------------------| TS65_RS13365_Aneurinibacillus_migulanus_759006369 MN-T-A-------VMFSS--------------ATD----E-WATPQDFFDQLNQ-E-F---H------FTLDPC-ATHE------SA-KCARYFTE------ED-----NGLA--QDW-T---G--E-IVFMNPPY--------------------------GR-V----L-----G-QWVKKAFEE--S---IK-G--A-----------------TVVCLL--PARTD-TRWFHDYI-----------Y-HRA-E-IRF-V-KGRLKF--------GD------------S----K----N-------S------APF-----------PSMVVIF------N-RA-GVKVGG-------------------------------------------------------| KU40_RS04850_Clostridium_botulinum_737823765 --------------MFSS--------------KTD----M-WSTPQDFYNKLNQ-E-F---N------FNLDPC-STNE------NA-KCERHYTI------AE-----DGLK--QNW-V---G--S-TVFCNPPY--------------------------GR-V----L-----K-DWVKKCYEE-SK---KD-N--T-----------------TVVMLI--PARTD-TTYFHNYI-----------Y-KKVKE-IRF-I-RGRLKF--------GD------------C----K----N-------A------APF-----------PSMVVVF------------------------------------------------------------------------| SD74_RS18965_Clostridium_botulinum_752703286 MN-T-A-------VMFSS--------------ETD----L-WATPQDFFDELNK-E-F---D------FDLDPC-ATHE------NA-KCDKYYTI------VE-----DGLK--QDW-Q---G--H-KVFCNPPY--------------------------GR-G----I-----K-DWVEKAYKE-SK---KE-N--T-----------------TVVMLI--PARTD-TKYFHSYI-----------Y-HKAKE-IRF-I-KGRLKF--------GD------------A----K----N-------S------APF-----------PSMVVVF------------------------------------------------------------------------| EMTOL_RS19950_Emticicia_oligotrophica_504839093 MN-I-K-------AIFSC--------------KTT----N-WETPQDLFDELDK-Q-Y---N------FTLDVC-ATSE------NA-KCNEFFTP------EI-----DGLK--QEW-K------G-MCWMNPPY--------------------------GR-E----I-----G-KWVRKAHLE--V---IT-G--R----------------CRIIALL--PARTD-TKWFHEWV-----------LNKH--E-IKF-I-KGRLRF--------SD------------S----K----N-------S------APF-----------PSMLVIF------E-G--RP-----------------------------------------------------------| J546_RS10975_Acinetobacter_baumannii_736663998 MA-N-H-------QLFGL-----A------ENRTD----I-WATPQDFFDKLNA-V-F---K------FDLDVC-ALPN------NA-KCERFFSP------ED-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----I-EWVAKAACT--A---KQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----S-------N------APF-----------GCCVVVF------R-P--TLN-------------------------------DV----EWE--N--AG-----------| J532_4398_Acinetobacter_baumannii_940793_630464595 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWD--K--GA-----------| J594_4091_Acinetobacter_baumannii_259052_588219826 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERFFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAYT--A---EQ-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| ACIN5021_2863_Acinetobacter_sp_OIFC021_444754682 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GR-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| J660_0735_Acinetobacter_baumannii_88816_593668543 MA-Q-S-------KLFGL-----A------ENRTD----V-WSTPQDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-S------G-TCWMNPPY--------------------------GC-E----I-----V-DWIAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----S----S-------N------APF-----------GCCVVVF------R-P--SLK-------------------------------DV----QWI--V--TE-----------| PaVLD_ORF117R_Planktothrix_phage_PaV-LD_371496242 IQ-Q-L-------CLFET---Q-P--SLI---DSN----E-NYTPSDLIDLVHK---F-Y-G-F----PELDPF-SCEQ-AN-Q-II-KAQKIFTI------QD-----DGFK--QNW-R-R-A--K-TLWLNPPY----------SA--------------GF------I-----E-KVVDKLIAT--L---NE-T--E----A------------EAFLLT--NTDNS-TAWYKKAL----NR-----C-DR----FCL-P-STRLTF-Y--SPK-RA--V----E----G--K-K-Q--N-------Q------NRF-----------SQTLFYF------G-L--QPQ-------------------------------RF-EE-IFE--G--WG-----------| M095_RS22645_Bacteroidales_492455391 MN-----------TSFER---S--------KQTTD----E-WYTPKWIVDALGS--------------FDLDPC-APEN-RL---WN-TAKRHITP------SE-----DGLK--TEW-G------GVRVWLNPPY--------------------------SR-P--L-I-----E-RFVEKMVRN------N-----------------------NGIALL--FNRCD-SKMFQDLI----FP-----N-ASA---IMF-V-KGRIKF-Y--RPD-GT------------Q----G----D-------S------PGC-----------GSVLIAF------G----EEN----------------------------------AK-ILE--Y--SN-----------| BN796_00478_Alistipes_sp_CAG:831_547185524 MN-----------TSFER---C--------ANTTD----E-WYTPKWIIDSLGE--------------FDLDPC-SPAN-RL---WN-TAKRHITP------QE-----DGLK--TSW-G------GVRVWLNPPY--------------------------SR-P--L-I-----E-RFVEKMVAN------N-----------------------NGIALL--FNRCD-SKMFQDLI----FP-----N-ASA---ILF-V-RGRIKF-Y--RPD-GT------------Q----G----D-------S------PGC-----------GSVLIAF------G----ESN----------------------------------AE-ALE--K--SN-----------| JCM13658_RS05485_Bacteroides_gallinarum_517496590 MD-----------VRFEG---R-S------STGKN----E-WLTPPDLLERLGP--------------FDLDPC-APVN-RP---WA-TAAHHYTI------ED-----DGLK--QPW-F------G-RVFCNPPY--------------------------DT-S--L-I-----V-QFIRRCSEH------G-----------------------NAVALT--FARTD-TRLFHEWI----FP-----R-ADS---VLF-I-KGRLSF-H--HVS-GE------------R----G----S-------T------AGA-----------PSCLIAF------G----KAN----------------------------------TA-VLK--S--CG-----------| HMPREF9447_RS03430_Bacteroides_oleiciplenus_496419253 MN-----------VTFEG---N-S------HTGKN----E-WLTPPDLLKKLGH--------------FDLDPC-SPVN-RP---WS-TAHRHYTI------LD-----NGLE--QEW-T------G-RVFCNPPY--------------------------DT-N--L-I-----V-RFIHRCAEH------G-----------------------NAIALT--FARTD-TRLFHDEI----FR-----K-ADS---ILF-I-KGRLRF-Y--HVN-GE------------Q----G----G-------T------AGA-----------PSCLIAF------N----KEN----------------------------------TE-VLR--N--CG-----------| BN938_RS08150_Mucinivorans_hirudinis_740870005 MN-----------VTFEG---N-S------STGKN----E-WLTPPDILAKLGE--------------FDLDPC-APIN-RP---WA-TANNHFTI------ED-----DGLV--QPW-Q------G-RVFCNPPY--------------------------DT-R--L-I-----I-QFIERCIEH------K-----------------------NAIALT--FARTE-TKLFQELI----FR-----H-AHS---ILF-I-KGRLSF-H--HVT-GE------------R----G----G-------T------AGA-----------PSCLIAF------D----EAN----------------------------------SQ-VLK--N--CG-----------| TY03_RS11290_Bacteroides_graminisolvens_640565353 MN-----------VTFEG---N-S------ATGKN----Q-WLTPPELLAKLGQ--------------FDLDPC-APIN-RP---WP-TATQHYTI------ED-----DGLK--QPW-F------G-RCWVNPPY--------------------------DT-Q--L-I-----I-QFIERCVEH------K-----------------------NAIALT--FSRTE-TKLFQELI----FK-----K-AHS---ILF-I-KGRLSF-H--HVT-GE------------R----G----G-------T------AGA-----------PSCLISF------N----EVN----------------------------------SE-ILK--S--CG-----------| M120_RS21850_Bacteroides_fragilis_695522728 MN-----------VTFEG---K-S------STGKN----E-WLTPPCLLDRLGE--------------FDLDPC-SPVN-RP---WD-TARHHYTV------GD-----DRLR--QPW-F------G-RVFCNPPY--------------------------DT-P--L-I-----V-RFIRKCVEH------R-----------------------NAIALT--FARTD-TRLFHELI----FP-----Y-ADT---ILF-I-RGRLRF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLISF------N----REN----------------------------------TA-ALK--M--CG-----------| M121_RS02435_Bacteroides_fragilis_695336745 MN-----------VTFEG---K-S------STGKN----E-WLTPPCLLDRLGE--------------FDLDPC-SPVN-RP---WD-TARHHYTV------GD-----DRLR--QPW-F------G-RGFCNPPY--------------------------DT-P--L-I-----V-RFIRKCVEH------R-----------------------NAIALT--FARTD-TRLFHELI----FP-----Y-ADT---ILF-I-RGRLRF-Y--HVT-GE------------Q----G----G-------T------AGA-----------PSCLISF------N----REN----------------------------------TA-ALK--M--CG-----------| H599_RS0112420_Flavobacterium_daejeonense_652309842 MN-----------TSFER-----C------ENTKV----E-WLTPPELVKKLGE--------------FDLDPC-SPIN-AP---FL-HAKNNFTV------LD-----NGLS--QKW-F------G-RVYLNPPY--------------------------GR-G--M-E-----L--WLEKLKFH------G-----------------------NGIALI--FARTE-TKCFFEHI----WN-----D-ADA---VLF-V-KGRIRF-Y--HIS-GI------------Q----A----G-------T------PGA-----------PSVFIAY------G----KEN----------------------------------AF-ALK--N--CG-----------| D478_RS25245_Brevibacillus_agri_748713908 --------------MFTS--------------ERE----E-WETPQDFFEKLNK-E-F---G------FQLDVC-ALPT------NA-KCERYFTP------DE-----DGLK--QEW-T------G-VCWMNPPY--------------------------GR-E----I-----G-KWVKKAYES--A---KQ-G--A-----------------TVVCLL--PARTD-VKWWHDYC-----------M-KG--E-IRL-V-RGRMKF--------VG------------A----D----N-------M------APF-----------PNAVVIF------S-P--ASA-------------------------------GC----SYK--A--ID-----------| M655_RS0109725_Bacillus_sp_NSP21_737442515 --------------MFKS--------------ERE----E-WETPQEFFDKLND-E-F---G------FQLDVC-ALPT------NA-KCERYFTP------DD-----DGLH--QEW-T------G-VCWMNPPY--------------------------GR-E----I-----G-KWVKKAYES--A---KQ-G--A-----------------TVVCLL--PARTD-VKWWHDYC-----------M-KA--E-IRL-V-RGRMKF--------VG------------A----D----N-------M------APF-----------PNAVVIF------S-P--ASA-------------------------------GC----SYK--A--ID-----------| BTS2_RS02440_Bacillus_sp_TS-2_780117918 MN-Q---------AMFSS--------------STD----K-WSTPQSFYDKLNQ-E-F---Q------FDIDVC-ATDS------DK-KCERYFSP------EQ-----DGLK--QEW-T------G-ICWMNPPY--------------------------GR-G----I-----G-PWIQKAYES--S---QQ-G--A-----------------TVVCLL--PSRTD-TKWWHEYC-----------M-KG--E-IRF-I-KGRLKF--------GD------------S----K----N-------S------APF-----------PSVVVIF------R-P--KVV-------------------------------SM-------------------------| SBVP3_0091_Vibrio_phage_phi_3_751186426 ----------------MN--------------SND----E-WYTPEFIMDKVRR-V-L-G-E------IDLDPA-SNPT-AN-T-IV-RAKTYYTK------EQ-----NGLN--YPW-L------G-KVWCNPPY--------------------------SA-A--L-I-----K-KFTKYFAEE--Y---KR-G--V--------------MT-EGIMLT--NSGTD-TQWNIAL--------------QGG-V-QAY-T-NGRISF--------LQ----P--DL---T----P----K-------G------KGS-----------RGQCFTY--F---G-P--NPE-------------------------------LF-IK-VFTEDN--FC-----------| METEXDRAFT_RS01570_Methylobacterium_extorquens_489692296 MG------------ETLG---I-GGHQRPRKERTD----T-WLTPPGIVRALGP--------------FDLDPCAAPDP-KP---WA-TAATHYTW---P--AQ-----DGLL--LPW-Y------G-RVWLNPPY--------------------------GR-A----L-----G-TWLAKMARH------GC-----------------------GTAFT--FARTE-TKAFFDHV----WN-----E-ADA---ILF-L-KGRVSF-H--HQD-GS------------P----A----R-------N------GGA-----------PSVLIAF------G----ADD----------------------------------VE-RLM--E--SG-----------| MICLODRAFT_RS13290_Microvirga_lotononidis_497160926 MT------------LNKG---M-GGHHSA-AAMTE----T-WLTPPGIIQALGSSSS-----------FDLDPCAAPKS-RP---WD-TARNHYTW---P--EQ-----DGLR--LPW-E------G-RVWLNPPY--------------------------GR-A----M-----T-DWLKKMSRH------NK-----------------------GTALI--FARTE-TEAYHEFV----WP-----Y-ASG---LLF-L-RGRLHF-H--YPD-GR------------R----A----E------AN------SGA-----------PSVLVAY------G----EED----------------------------------VE-RLI--Q--SG-----------| K035_3825_Acinetobacter_baumannii_691039509 ---------------------------------------------QDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-----------------------------------------------------------------| ND2E_RS09310_Colwellia_psychrerythraea_696562339 ----------------SD---------------------S-WYTPNIYTDMTRK-V-L-G-T------IDLDPF-SSSL-AN-E-YV-KAERYFDA------DS-----NAFK--QIW-F-K-EQ-G-TVFMNPPY--SRKLI-------------------DK-A----V-----E-IFLQNISDS--S-I-S-----------------------QAVVLV--NNATE-TKWFQSLT--R--------K-SDA---LCL-V-DKRIPF-E--SFD-GK-----------------H----S-------S------GNT-----------RGQVFLY--Y---G-V--NKK-------------------------------AF-KK-VFK--E--IG-----------| K035_3825_Acinetobacter_baumannii_42057_4_629017472 ---------------------------------------------QDFFEKLDR-V-F---N------FDLDVC-ALPE------NA-KCERYFTP------EI-----DGLK--QEW-T------G-TCWMNPPY--------------------------GK-E----I-----I-DWVAKAAET--A---SK-G--H-----------------TVVALV--PVRTD-ARWFQDYC-----------L-GR--E-IHF-I-RGRLKF--------GG------------S----K----T-------N------APF-----------GCCVVVF------R-P--SLI-------------------------------DV----SWE--K--SA-----------| CLOM621_08346_Clostridium_sp_M62/1_291074040 -----------------M---------------------C-WCTPPDFFAELDR-E-F---H------FELDPA-STDK------SA-KCAKHFTP------DD-----DGLK--QDW-----G--GYCVFCNPPY--------------------------GR-A----I-----A-DWVRKGYEE--S-R-KP-G--T-----------------TVVMLI--PSRTD-TAYFHDWI-----------F-GKA-SEVRF-L-RGRLKF-T--DED-GN------------G----E----D-------A------APF-----------PSAVIVW-RSPE-S-T--GRE----------------------------------FA-TWH--I---------------| C471_08405_Halorubrum_saccharovorum_490147912 ----------------------------------------------RIGRPLSW-A-V---DG-----FDLDPA-SGAE------PVPIADQRYTE------AD-----DGLA--QPW-H------G-DVFLNPPW--------------------------TS-E----DSDGTPKRRWLRKARNE--A-Q-RD-A-VD-----------------TMIVLL--PAATE-AGWFRDHM-----------W-GAP-A-LCF-VGPGRIPF--------IG------------E----D----R-------N------PSFP-----------LAIAAF------G-D--VPA-------------------------------AL-LD-VLD--S--FG-----------| RIV7116_RS23705_Rivularia_sp_PCC_7116_763462851 ----------------SD---------------------E-WYTPPHISDLVTQ-V-L---GQ-----ITLDPC-ADEG------KHIRAAQHYTV------LD-----DGLI--QEW-N------G-RIFMNPPY--------------------------SA-P----S-------VWIKKLQAE--F-E-SG-R-VT-----------------EAIALV--PAATD-TRWLSPLL-----------K-SQP---VCF-W-TGRIKF--------LD------------M----S----Y-------K------PRLSARQ-------SHCLVYW------G-G--NWE-------------------------------RF-KE-VF-------------------| OPIT5_RS20660_Opitutaceae_bacterium_TAV5_763429761 ---------------MDM---------------------T-WGTPQVWFDYLHL-E-F---G------FTLDPC-CLHQ------TA-KCKKHYTP------AE-----DGLA--QSW-A---E--E-RVFMNPPY--------------------------GR-D----L-----P-KWMKKAYEE--A---RDNG--T-----------------LIVCFV--PARVD-TEWWHRYA-----------T-K-G-E-VRF-P-KGRVKF--------AD------------A----L----D-------S------APF-----------PVAVVIF------R-S--RL-----------------------------------------------------------| DALK_RS23730_Desulfatibacillum_alkenivorans_506429612 ---------------MNC---------------------E-WATPQDLFDSLNK-E-F---H------FTLDPC-CTIE------NA-KCERFYTK------AE-----DGLS--QDW-T---G--E-TVFMNPPY--------------------------SRSE----M-----P-KWIQRAYES--S---LA-G--S-----------------KVVCLL--PAKTD-TRWFHDFC-----------L-K-G-E-IRF-I-KGRICF--------GS------------G----E----G-------R------APF-----------PSMVVIF------N-G--------------------AK-----------------------------------------| VE20213_RS09880_Clostridiales_bacterium_VE202-13_639741003 -----------------M---------------------D-YCTPQDFFDKLNQ-E-F---H------FTLDAA-ATSK------SA-KCPQYYTP------EI-----DGIK--NPW-SIAGG--G-AVFCNPPY--------------------------GR-K----I-----G-KWVRKAYEE-S----RN-G--T-----------------TVVLLI--PARTD-TAYFHDYI-----------Y-GCA-E-IRF-V-RGRLHF--------TD------------E----DGNTYD-------R------APF-----------PSMVVIY------N----G----N-RVG---------------------------------------------------/ consensus/100% .......................................................................D...s.................................s.h.....h.......................................................................................................h....s....................................................................................................................................................................................... consensus/95% .........................................b.pP..b...h..................lDss.s.............s...hs..............suL.....W...........hhhNPPa...........................................ah.+h...................................lhl....s.sp...ha.........................h.....ch.b...............................................s................hhhh........................................................................ consensus/90% .................................p.......a.TP..hhp.l.....h...........pLDss.u.............s.p.as..............suL...p.W...........hahNPPa..........................s................ah.+h..p.........s......................lhLh...s.os.s.hap..h...................h.h.h.p.Rl.F...............................................sshs.............lhha........................................................................ consensus/85% ...............b.................p.......a.TP..hhp.l.....h..........hsLDss.u.............s.p.as..............sGL...p.W...........hahNPPY..........................sp......h........ah.+h.pp.........s.....................hlhLl...spT-.s.aapphh...................l.a.l.c.RlpF........................................s......sshss............lhha........................................................................ consensus/80% ...............h.................pp......W.TP..hhc.lp....h..........hsLDss.u..p......ps..sppaao.......p......sGL...p.W...........hahNPPY..........................up......l........ah.+hhpp.........s.....................hlhLl...scT-.s.aapphh...................l.a.l.+uRlpF........s.......................p.......s......ushss...........hllha........................................................................ consensus/75% ...............a.................ps......W.TP..hhc.Ls....a..........hsLDss.u.sp......ss.psppaaT.......pp.....DGL...ppW...........hahNPPY..........................up......l.......pWlpKhhpp.........s.....................hlhLl..ssRTD.spaapchh...................lpF.l.+GRl+F........s..................p....s.......s......ushss...........hllla........................................................................ consensus/70% ...............a.................ss......W.TPpphh-bLsp...a..........hsLDsC.ussp......ss.psp+aaT.......cp.....DGLp..ppW..........plahNPPY..........................uc.p....l.......cWlpKuhpp.......p.u.....................lVhLl..PuRTD.opaapchh...........b.......lpF.l.+GRL+F........u.............s....p....s.......s......APhss...........hllla........................................................................Back to Contents
GI Gene neighborhood Domain arch Pfam-architectures Gene name Len Taxonomy Species name Genbank # ; Eukaryotic Chlorophyte DAM -ParB-HTH fused 302838997 <-ParB-HTH+N6-MTase* ParB-HTH+N6-MTase HSP90+Dam VOLCADRAFT_91459 473 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_91459 [Volvox carteri f. nagariensis]. <-302838989_?||302838783_?->302838785_?-><-302838991_?<-302838993_?<-302838995_?||302838787_?-><-302838997_ParB-HTH+N6-MTase*<-302838999_?||302838789_?->302838791_?->302838793_?-><-302839001_?||302838795_?-><-302839003_? 302842945 <-ParB-HTH+N6-MTase*||?->?-><-?||?-><-?||?-><-Guanylate_kin ParB-HTH+N6-MTase Dam VOLCADRAFT_118198 357 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_118198 [Volvox carteri f. nagariensis]. <-302842935_?<-302842937_?||302842757_?-><-302842939_?<-302842941_?<-302842943_?||302842759_?-><-302842945_ParB-HTH+N6-MTase*||302842761_?->302842763_?-><-302842947_?||302842765_?-><-302842949_?||302842767_?-><-302842951_Guanylate_kin 302845993 <-ParB-HTH+N6-MTase* ParB-HTH+N6-MTase - VOLCADRAFT_106408 816 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_106408 [Volvox carteri f. nagariensis]. 302845869_?->302845871_?-><-302845987_?||302845873_?-><-302845989_?||302845875_?-><-302845991_?<-302845993_ParB-HTH+N6-MTase*||302845877_?->302845879_?->302845881_?-><-302845995_?<-302845997_?<-302845999_?||302845883_?-> 302846292 <-ParB-HTH+N6-MTase*<-?<-?||?->?->?-><-METHYLASE ParB-HTH+N6-MTase Dam VOLCADRAFT_106473 528 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_106473 [Volvox carteri f. nagariensis]. 302846154_?-><-302846286_?||302846156_?->302846158_?-><-302846288_?||302846160_?-><-302846290_?<-302846292_ParB-HTH+N6-MTase*<-302846294_?<-302846296_?||302846162_?->302846164_?->302846166_?-><-302846298_METHYLASE<-302846300_? 302854263 <-ParB-HTH+N6-MTase*||?->?-><-?<-RelA ParB-HTH+N6-MTase Dam VOLCADRAFT_108225 570 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_108225 [Volvox carteri f. nagariensis]. <-302854251_?<-302854253_?<-302854255_?<-302854257_?||302854221_?-><-302854259_?<-302854261_?<-302854263_ParB-HTH+N6-MTase*||302854223_?->302854225_?-><-302854265_?<-302854267_RelA<-302854269_?||302854227_?->302854229_?-> 302838546 <-ParB-HTH+N6-MTase*||?->?-><-?<-?<-P-kinase ParB-HTH+N6-MTase Dam VOLCADRAFT_104840 490 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_104840 [Volvox carteri f. nagariensis]. 302838326_?->302838328_?-><-302838542_?||302838330_?-><-302838544_?||302838332_?->302838334_?-><-302838546_ParB-HTH+N6-MTase*||302838336_?->302838338_?-><-302838548_?<-302838550_?<-302838552_P-kinase||302838340_?-><-302838554_? 302855015 <-N6-MTase+N6-MTase* N6-MTase+N6-MTase - VOLCADRAFT_108410 146 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_108410 [Volvox carteri f. nagariensis]. 302854993_?->302854995_?->302854997_?-><-302855013_?||302854999_?-><-302855015_N6-MTase+N6-MTase*<-302855017_?||302855001_?-><-302855019_?<-302855021_?<-302855023_?<-302855025_?<-302855027_? 302839284 <-ParB-HTH+N6-MTase* ParB-HTH+N6-MTase Dam VOLCADRAFT_104970 364 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_104970 [Volvox carteri f. nagariensis]. <-302839284_ParB-HTH+N6-MTase*<-302839286_?||302839130_?-><-302839288_?<-302839290_?<-302839292_?||302839132_?-><-302839294_? 302855367 <-ParB-HTH+N6-MTase* ParB-HTH+N6-MTase Dam VOLCADRAFT_100579 287 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_100579 [Volvox carteri f. nagariensis]. 302855347_?-><-302855357_?<-302855359_?<-302855361_?<-302855363_?||302855349_?-><-302855365_?<-302855367_ParB-HTH+N6-MTase* 302838722 <-N6-MTase* N6-MTase Dam VOLCADRAFT_104908 495 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_104908 [Volvox carteri f. nagariensis]. 302838498_?->302838500_?-><-302838714_?<-302838716_?<-302838718_?||302838502_?-><-302838720_?<-302838722_N6-MTase*||302838504_?-> 302843631 <-ParB-HTH* ParB-HTH PAT1 VOLCADRAFT_105875 819 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_105875 [Volvox carteri f. nagariensis]. 302843457_?-><-302843625_?<-302843627_?||302843459_?->302843461_?-><-302843629_?||302843463_?-><-302843631_ParB-HTH*||302843465_?-><-302843633_?||302843467_?-><-302843635_?||302843469_?-><-302843637_?||302843471_?-> 302839946 METHYLASE->?-><-?||?-><-?||N6-MTase+N6-MTase*-> N6-MTase+N6-MTase TM VOLCADRAFT_92117 1075 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_92117 [Volvox carteri f. nagariensis]. 302839936_?->302839938_?->302839940_METHYLASE->302839942_?-><-302840106_?||302839944_?-><-302840108_?||302839946_N6-MTase+N6-MTase*-><-302840110_?||302839948_?-><-302840112_?||302839950_?-><-302840114_?||302839952_?->302839954_?-> 159472462 <-ParB-HTH+N6-MTase*||?-><-?||?-><-?||?->?-><-Guanylate_kin ParB-HTH+N6-MTase Dam CHLREDRAFT_191158 321 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein [Chlamydomonas reinhardtii]. <-159472452_?<-159472454_?||159472222_?-><-159472456_?<-159472458_?<-159472460_?||159472224_?-><-159472462_ParB-HTH+N6-MTase*||159472226_?-><-159472464_?||159472228_?-><-159472466_?||159472230_?->159472232_?-><-159472468_Guanylate_kin 302852824 N6-MTase*-> N6-MTase Nop14 VOLCADRAFT_99082 300 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_99082 [Volvox carteri f. nagariensis]. <-302852874_?<-302852876_?||302852822_?-><-302852878_?||302852824_N6-MTase*-><-302852880_?<-302852882_?<-302852884_?||302852826_?-><-302852886_?<-302852888_?||302852828_?-> 575474002 APSES->?-><-MBL<-?||P-kinase-><-?||?-><-ParB-HTH+N6-MTase* ParB-HTH+N6-MTase Dam BATDEDRAFT_85509 496 eukaryota>fungi>chytridiomycota Batrachochytrium dendrobatidis JAM81 hypothetical protein BATDEDRAFT_85509 [Batrachochytrium dendrobatidis JAM81]. 575471282_APSES->575472862_?-><-575472060_MBL<-575472062_?||575471354_P-kinase-><-575474000_?||575472864_?-><-575474002_ParB-HTH+N6-MTase*||575472064_?->575471574_?->575472666_?-><-575472066_?||575473118_C6FunFin-><-575471562_?||575472708_?-> # ; Eukaryotic Chlorophyte DAM 760440511 BMB+PHD+N6-MTase*-> BMB+PHD+N6-MTase+ZFCW PHD+zf-CW F751_3154 516 eukaryota>viridiplantae>chlorophyta Auxenochlorella protothecoides hypothetical protein F751_3154 [Auxenochlorella protothecoides]. 760440497_?-><-760440499_?||760440501_?-><-760440503_?||760440505_?-><-760440507_?<-760440509_?||760440511_BMB+PHD+N6-MTase*-><-760440513_?||760440515_?->760440517_?->760440519_?-><-760440521_?<-760440523_?<-760440525_? 612396523 <-FtsJ_methylase||?-><-?||?-><-?<-?<-RAMA+N6-MTase+ZFCW*<-RNase_T RAMA+N6-MTase+ZFCW Nucleoplasmin+Drf_FH1 Bathy04g03050 1310 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 612396033_?-><-612396669_FtsJ_methylase||612396113_?-><-612395985_?||612395879_?-><-612396147_?<-612396601_?<-612396523_N6-MTase*<-612395721_RNase_T||612396527_?-><-612396483_?||612395921_?-><-612395871_?||612396661_?->612395853_?-> 159476182 <-Histone||?->?->BMB+N6-MTase*-><-?||Histone-><-?||?-><-?||ABC-transporter-> BMB+N6-MTase+ZFCW PWWP+zf-CW CHLREDRAFT_167032 1174 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii hypothetical protein CHLREDRAFT_167032, partial [Chlamydomonas reinhardtii]. <-159476714_?<-159476716_?<-159476718_?||159476176_?-><-159476720_Histone||159476178_?->159476180_?->159476182_BMB+N6-MTase*-><-159476722_?||159476184_Histone-><-159476724_?||159476186_?-><-159476726_?||159476188_ABC-transporter->159476190_?-> 552817679 <-METHYLASE||?->?-><-?||?->?->?-><-BMB+PHD+N6-MTase+ZFCW* BMB+PHD+N6-MTase+ZFCW PHD+zf-CW CHLNCDRAFT_138470 865 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_138470 [Chlorella variabilis]. <-552817673_METHYLASE||552817310_?->552817313_?-><-552817676_?||552817317_?->552817320_?->552817323_?-><-552817679_BMB+PHD+N6-MTase+ZFCW*<-552817682_?<-552817686_?||552817327_?-><-552817690_?||552817330_?->552817333_?-><-552817694_? 545372676 BMB+N6-MTase*-> BMB+N6-MTase - COCSUDRAFT_83615 537 eukaryota>viridiplantae>chlorophyta Coccomyxa subellipsoidea C-169 hypothetical protein COCSUDRAFT_83615 [Coccomyxa subellipsoidea C-169]. 545372133_?-><-545372135_?<-545372137_?<-545372140_?||545372142_?->545372145_?->545372147_?->545372676_BMB+N6-MTase*-><-545372150_?<-545372152_?<-545372155_?||545372157_?-><-545372160_?<-545372162_?<-545372165_? 633905054 PHD+N6-MTase+ZFCW*-> PHD+N6-MTase+ZFCW PHD+zf-CW H632_c3034p0 488 eukaryota>viridiplantae>chlorophyta Helicosporidium sp. ATCC 50920 hypothetical protein H632_c3034p0, partial [Helicosporidium sp. ATCC 50920]. 633905054_PHD+N6-MTase+ZFCW*-> 303277723 <-BMB+N6-MTase+ZFCW* BMB+N6-MTase+ZFCW Nucleoplasmin+zf-CW MICPUCDRAFT_57353 1004 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 303276985_?-><-303277717_?||303276987_?-><-303277719_?||303276989_?-><-303277721_?||303276991_?-><-303277723_BMB+N6-MTase+ZFCW*<-303277725_?||303276993_?-><-303277727_?<-303277729_?||303276995_?->303276997_?->303276999_?-> 255071987 BMB+N6-MTase+ZFCW*-> BMB+N6-MTase+ZFCW DUF3987+zf-CW MICPUN_55980 1012 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 255071979_?-><-255072943_?<-255072945_?||255071981_?->255071983_?-><-255072947_?||255071985_?->255071987_BMB+N6-MTase+ZFCW*->255071989_?-><-255072949_?||255071991_?-><-255072951_?||255071993_?-><-255072953_?||255071995_?-> 145349057 <-FtsJ_methylase||?->?-><-BMB+N6-MTase*<-RNase_T||?-><-?||?-><-?||?->ABC-transporter-> BMB+N6-MTase SP+PWWP+SMC_N OSTLU_87805 847 eukaryota>viridiplantae>chlorophyta Ostreococcus lucimarinus CCE9901 predicted protein [Ostreococcus lucimarinus CCE9901]. 145348583_?-><-145349053_?||145348585_?->145348588_?-><-145349055_FtsJ_methylase||145348590_?->145348592_?-><-145349057_BMB+N6-MTase*<-145349060_RNase_T||145348594_?-><-145349062_?||145348596_?-><-145349064_?||145348598_?->145348600_ABC-transporter-> 308806169 <-ubiquitin||?->?-><-FtsJ_methylase||?->?-><-BMB+N6-MTase*<-RNase_T||?-><-?||?->?->ABC-transporter-> BMB+N6-MTase PWWP Ot07g02900 854 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri Actin filament-coating protein tropomyosin (ISS) [Ostreococcus tauri]. 308806155_?-><-308806157_ubiquitin||308806159_?->308806161_?-><-308806163_FtsJ_methylase||308806165_?->308806167_?-><-308806169_BMB+N6-MTase*<-308806171_RNase_T||308806173_?-><-308806175_?||308806177_?->308806179_?->308806181_ABC-transporter-><-308806183_? 693499233 <-ubiquitin||?->?-><-FtsJ_methylase||?->?-><-BMB+N6-MTase*<-RNase_T||?-><-?||?->?->ABC-transporter-> BMB+N6-MTase PWWP OT_ostta07g03040 1018 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri Zinc finger, CW-type [Ostreococcus tauri]. 693499229_?-><-116058850_ubiquitin||693499230_?->116058852_?-><-693499231_FtsJ_methylase||693499232_?->116058855_?-><-693499233_BMB+N6-MTase*<-693499234_RNase_T||693499235_?-><-693499236_?||693499237_?->693499238_?->116058862_ABC-transporter-><-116058863_? 302835622 BMB+BMB+PHD+N6-MTase+ZFCW*-> BMB+BMB+PHD+N6-MTase+ZFCW PWWP+MSP1_C+PWWP+PHD+zf-CW VOLCADRAFT_89771 1214 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_89771 [Volvox carteri f. nagariensis]. <-302835846_?||302835614_?-><-302835848_?||302835616_?->302835618_?-><-302835850_?||302835620_?->302835622_BMB+BMB+PHD+N6-MTase+ZFCW*-><-302835852_?||302835624_?-><-302835854_?||302835626_?-><-302835856_?||302835628_?-><-302835858_? ------------------------Prokaryotic homologs------------------------ # 2; Versions somewhat closer to the eukaryotic ones 491011364 <-Methylase<-?<-RusA<-?<-?<-?<-?<-ParB+N6-MTase*<-?<-?||Peptidase_S24-> ParB+N6-MTase ParBc+Dam ACAty_RS09645 388 bacteria>proteobacteria>gammaproteobacteria Acidithiobacillus caldus hypothetical protein [Acidithiobacillus caldus]. <-491011357_Methylase<-491011359_?<-740686876_RusA<-740686878_?<-491011362_?<-740687613_?<-740686880_?<-491011364_ParB+N6-MTase*<-740686882_?<-491011365_?||740686884_Peptidase_S24-><-740686886_?||491011368_?->491011369_?->740687616_?-> 503768726 <-ArdC+MPTase<-Methylase<-?<-RusA<-?<-?<-?<-ParB+N6-MTase*<-?<-?||Peptidase_S24-> ParB+N6-MTase ParBc+Dam ATC_RS06425 388 bacteria>proteobacteria>gammaproteobacteria Acidithiobacillus caldus hypothetical protein [Acidithiobacillus caldus]. <-503768720_ArdC+MPTase<-503768721_Methylase<-503768722_?<-753905032_RusA<-503768724_?<-753905034_?<-753902617_?<-503768726_ParB+N6-MTase*<-503768727_?<-503768728_?||753905037_Peptidase_S24->503768729_?->753902625_?->491011368_?->503768731_?-> # 3; Packaging associated 500114172 Terminase_LS->Phage_portal-><-?<-?||?-><-?<-N6-MTase* N6-MTase Dam SPUTW3181_RS15120 208 bacteria>proteobacteria>gammaproteobacteria Shewanella sp. W3-18-1 hypothetical protein [Shewanella sp. W3-18-1]. <-500114165_?||500114166_Terminase_LS->500114167_Phage_portal-><-500114168_?<-500114169_?||500114170_?-><-500114171_?<-500114172_N6-MTase*<-500114173_?<-500114174_?<-752761115_?<-500114175_?<-500114176_?||500114177_?->500114178_?-> 739569226 TET-JBP->?-><-?<-?<-?<-?<-?<-N6-MTase* N6-MTase Dam SHEWPOL2_RS06540 196 bacteria>proteobacteria>gammaproteobacteria Shewanella sp. POL2 hypothetical protein, partial [Shewanella sp. POL2]. 739569175_TET-JBP->739569177_?-><-739569179_?<-739569180_?<-739569181_?<-739569224_?<-739569183_?<-739569226_N6-MTase*<-739569228_?<-739569184_?<-739569186_?<-739569229_?<-739569187_?<-739569189_?<-739569192_? 446980525 <-N6-MTase* N6-MTase - VII_RS00060 195 bacteria>proteobacteria>gammaproteobacteria Vibrio mimicus hypothetical protein [Vibrio mimicus]. <-694128903_?<-446367425_?<-447182778_?<-447051858_?<-446937185_?<-694128904_?<-446915745_?<-446980525_N6-MTase*<-446925120_?<-447034144_?<-694128905_?||446144374_?->446829364_?->446123684_?->694128906_?-> # 4; the circularly permuted methylase is of the HpaI family 694338559 <-DpnII-likeRE<-cpDAM<-N6-MTase* N6-MTase SP+Dam ND2E_3441 184 bacteria>proteobacteria>gammaproteobacteria Colwellia psychrerythraea DNA N-6-adenine-methyltransferase [Colwellia psychrerythraea]. 694338552_?->694338553_?-><-694338554_?<-694338555_?||694338556_?-><-694338557_DpnII-likeRE<-694338558_cpDAM<-694338559_N6-MTase* 696562339 <-DpnII-likeRE<-cpDAM<-N6-MTase* N6-MTase Dam ND2E_RS09310 162 bacteria>proteobacteria>gammaproteobacteria Colwellia psychrerythraea hypothetical protein, partial [Colwellia psychrerythraea]. 696562293_?->696562294_?-><-696562295_?<-696562337_?||696562296_?-><-696562297_DpnII-likeRE<-696562338_cpDAM<-696562339_N6-MTase* # 4; phage/prophage 749448467 <-N6-MTase*<-?<-Phage_integrase N6-MTase Dam JCM19241_5986 168 bacteria>proteobacteria>gammaproteobacteria Vibrio sp. JCM 19241 modification methylase Bsp6I [Vibrio sp. JCM 19241]. <-749448460_?<-749448461_?<-749448462_?<-749448463_?<-749448464_?<-749448465_?<-749448466_?<-749448467_N6-MTase*<-749448468_?<-749448469_Phage_integrase||749448470_?->749448471_?->749448472_?->749448473_?->749448474_?-> 751186426 <-N6-MTase* N6-MTase Dam SBVP3_0091 167 viruses>dsdna viruses, no rna stage>caudovirales Vibrio phage phi 3 hypothetical protein SBVP3_0091 [Vibrio phage phi 3]. <-751186419_?<-751186420_?<-751186421_?<-751186422_?<-751186423_?<-751186424_?<-751186425_?<-751186426_N6-MTase*<-751186427_?<-751186428_?<-751186429_?||751186430_?->751186431_?->751186432_?->751186433_?-> # 1; Prophage 333734957 <-N6-MTase*<-?<-?<-?||SFII-RAD3-> N6-MTase Dam TREAZ_0592 238 bacteria>spirochaetes Treponema azotonutricium ZAS-9 gp44 [Treponema azotonutricium ZAS-9]. <-333734241_?<-333735058_?<-333734368_?<-333735683_?<-333736204_?<-333736985_?<-333737240_?<-333734957_N6-MTase*<-333736693_?<-333737435_?<-333734288_?||333734581_SFII-RAD3-><-333735236_?<-333734247_?<-333736610_? # 1; 497315962 <-Phage_integrase<-?||?-><-N6-MTase* N6-MTase Dam SYN7509_RS0224085 273 bacteria>cyanobacteria Synechocystis sp. PCC 7509 DNA N-6-adenine-methyltransferase (Dam) [Synechocystis sp. PCC 7509]. <-497315669_?<-497315670_?<-497315671_?||655839734_?-><-497315638_Phage_integrase<-740179817_?||497315960_?-><-497315962_N6-MTase*||497315963_?->497315964_?-><-655839735_?<-497315966_?<-497315967_?<-740179819_?<-740179822_? 505099336 <-N6-MTase* N6-MTase Dam Metfor_2481 181 archaea>euryarchaeota Methanoregula formicica DNA N-6-adenine-methyltransferase (Dam) [Methanoregula formicica]. 432331833_?->432331834_?->432331835_?->432331836_?->432331837_?->432331838_?-><-432331839_?<-505099336_N6-MTase*<-432331841_?<-432331842_?<-432331843_?<-432331844_?<-432331845_?<-432331846_?<-432331847_? 490177569 <-N6-MTase* N6-MTase SP+Dam Metlim_0419 212 archaea>euryarchaeota Methanoplanus limicola hypothetical protein [Methanoplanus limicola]. 490177562_?->490177563_?-><-490177564_?<-490177565_?<-490177566_?<-490177567_?<-490177568_?<-490177569_N6-MTase*<-490177570_?<-490177571_?||490177572_?->490177573_?-><-490177574_?<-490177575_?||490177576_?-> # 1; 427370342 N6-MTase*-> N6-MTase Dam Riv7116_1753 519 bacteria>cyanobacteria Rivularia sp. PCC 7116 DNA N-6-adenine-methyltransferase (Dam) [Rivularia sp. PCC 7116]. 427370335_?->427370336_?->427370337_?->427370338_?-><-427370339_?<-427370340_?||427370341_?->427370342_N6-MTase*-><-427370343_?||427370344_?-><-427370345_?||427370346_?->427370347_?-><-427370348_?<-427370349_? # 1; Type IV system 763462851 AAA-><-?||ASCH->N6-MTase*-> N6-MTase Dam RIV7116_RS23705 144 bacteria>cyanobacteria Rivularia sp. PCC 7116 hypothetical protein, partial [Rivularia sp. PCC 7116]. 504933756_?->504933757_?->504933758_?->504933759_?->504933760_AAA-><-504933761_?||763462848_ASCH->763462851_N6-MTase*-><-504933763_?||504933764_?->504933765_?->504933766_?-><-504933768_?||763462853_?->763461708_?-> 427373349 AAA-><-?||ASCH+N6-MTase*-> ASCH+N6-MTase Dam Riv7116_4895 629 bacteria>cyanobacteria Rivularia sp. PCC 7116 ASCH domain-containing protein [Rivularia sp. PCC 7116]. 427373342_?->427373343_?->427373344_?->427373345_?->427373346_?->427373347_AAA-><-427373348_?||427373349_ASCH+N6-MTase*-><-427373350_?||427373351_?->427373352_?->427373353_?->427373354_?-><-427373355_?||427373356_?-> # 1; 186465327 ParB-HTH->?->?->?->?->?->DCM+N6-MTase*-> DCM+N6-MTase DNA_methylase+Dam Npun_F2574 1180 bacteria>cyanobacteria Nostoc punctiforme PCC 73102 C-5 cytosine-specific DNA methylase [Nostoc punctiforme PCC 73102]. 186465320_?->186465321_ParB-HTH->186465322_?->186465323_?->186465324_?->186465325_?->186465326_?->186465327_DCM+N6-MTase*->186465328_?->186465329_?-><-186465330_?<-186465331_?<-186465332_?<-186465333_?<-186465334_?Back to Contents
Alignment of eukaryotic members only. FINAL -H--HHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHH-------HHHEEE-----E-------HHHHHHH--HHHHHHHHH----------HHHH-----------------------HHHHH------HHHHHH-HH---------EEEEEEEEEE--E-------------- ALIGN ----HHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH------HEEHHH-----HHH-----HHHHHHH--HHHHHHHHH-------------------------------------HHHHH------HHHHHH-HHH---------EEEEEEE-E--EE--E---------- HMM ----HHHHHHHHHHHHH----HHHHHHHHHHHHHH---HHE------EEEEEE-----E---E---HHHHHHH--HHHHHHHHH-----EE-----EE---------------EEEE------HHH------HHHHHH-HH------EEEEEEEEEEEE---E--EEEEEEE----- FREQ -H--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHHHHHH-----HH------HHHHHHH--HHHHHHHHH----------H-H-----------------HH-----HHHHH------HHHHHH-HH---------EEEEEEEHHH--H-------------- PSSM -H--HHHHHHHHHHHHHH---HHHHHHHHHHHHHH-------------HHEEH-----------------HHH--HHHHHHHHH-----------HH------------------------HHHHH------HHHHHH-HH--------EEEEEHHEE------------------- CHLREDRAFT_191158_Creinhardtii_159472462 RP--EDLEGCEKLIEGSSLKNNFLSIGQALVTINDRKLYKDSG-YTSFTQYIE-----QKGDFGFGPRQALRL--LAATRLVRNFPPNIALPSSERQV---------------RALVGLEQAEAIE------VWSKAT-KISQDTNTPLTHRLVESVLGK--ELPTTYRQVTRDWQD VOLCADRAFT_118198_Vcarteri_302842945 QP--EDLEGCEKLIEGSSLKNNFLRIGQALVTINDRKLYKRAG-SSSFTQYIE-----QKSDFGFGPRQALRL--LAATRLVRNFPPTIALPTSERQV---------------RALVGLEQQQAVK------VWVKAN-LIAQETGVPLTHRLVESVLGK--ELPASYRQAARDWQD VOLCADRAFT_91459_Vcarteri_302838997 EG--NALASAERRVARAA-PAYFAEASLAMLEIAEGKLYSFAG-HASFQDYIR---K-SSAVLGFGLRQARNL--IAAARVIRNLPADVARPSNERQV---------------RPLVGCHPDVQLK------VWVLALERADGDTRHEDALSLQYGGL-P--VITRSDTCEWYTPDF VOLCADRAFT_108225_Vcarteri_302854263 RA--RTLGEAEARVTMAA-GGFFVEASCALLDIAEQHLFEAEG-YKSFRAYIL-----ERKSLGFGYRQARAW--VAAARFIRSLPPSMPVPQYERLV---------------RPLTRCEPGVARL------AWERVL-RYHNEEGRRMTAELVVSCI-Q--EVGTASTVAQSDSED VOLCADRAFT_105875_Vcarteri_302843631 RA--RTLGEAEARVTMAA-GGFFVEASCALLDIAEQHLFEAEG-YKSFRAYIL-----ERKSLGFGYRQARAW--VAAARFIRSLPPSMPVPQYERLV---------------RPLTRCEPGVARL------AWERVL-RYHNEEGRRMTAELVVSCI-Q--EVGTASTVAQSDSED VOLCADRAFT_106408_Vcarteri_302845993 EM--ALLLRAEQVVQGS--GSQFRQMANSLLDIQERRLYSCLG-FGSFVQYVS-----ESGRIDIAPRYAQAL--VAAAHFLRLLSATDVVPNSECQV---------------RPLTALPPCDALG------AWRLAV-AHSVAESRLLSGRLVEQCA-L--EVTGRTGSDNSSSDM VOLCADRAFT_104970_Vcarteri_302839284 -M--ARLLRAEQLVQGC--GSQFRQMANSLLDIQERRLYSCLG-FGSFVQYVS-----ESGRIDIAPRYAQAL--VAAARFLRLLSATDVVPNSGRQV---------------RPLTALPPCDALG------AWRLAV-ARSVAESRLLGGRLVEQCA-L--EVTGRSRLLRGTGSD VOLCADRAFT_104840_Vcarteri_302838546 EG--NALASAERRVPMAA-LGFFAEASLALLDIAEGKLYSFAG-HASFKAYIR---R-SLAVLGFGLHQARNL--IAAARVIRNLPAGVARPSNQMQV---------------RPFVGCHPDVQLK------AWVLALGRAGGLESARVSGRLVRECL-R--EVKGSLADDVLAGSA BATDEDRAFT_85509_Bdendrobatidis_575474002 HL--ERLHNLEKAITEHLSTGKFFIVAAALRCIEEERLF-----YPERTVYSY-----AKSRFGFSRRTTNTY--LCSSYVYESITEDKTLPIPVNIS-----------HV--RSLHKYPPEVRRQ------IW-----KQLNDSGLTITEENVVAM-----TIKYETGVSFTELNN BATDEDRAFT_90358_Bdendrobatidis_575483232 EY--ARIPLEQK-------QKQFVETVTAIRAIICRKLYRDEG-YDSLQTYFL-------SKWDVSRAQVYRL--MDCWPILTTICKAHVIPYKERLC---------------RTLKQCTRSPSELVL----LWDNVI---GSCDPAFVSPKFIFDVW-D--RLQSTLHTFLDQDTE VOLCADRAFT_108630_Vcarteri_302856103 EM--ARLLRAEQLVQGC--GSQFRQMANSLLDIQERRLYSCLG-FGSFVQYVS-----ESGRIDIAPRYAQAL--VAAARFLRLLSATDVVPNSGRQPSHDSWVGGSLSNVHWRSQAGAVCCGVLAVTTAAATWMSAI-ASLVRESIWFEVEGVGMCA-CFSPVRVRSGSPH----- MVEG_03971_Mverticillata_672826234 DS--SVIRGSSKTSHQSK-PTLFETTVLAFRDIIVRRLWRSDNRFQSESEFCK-------HHWEIQRSRRDEL--IECAELLVELSRIPCRPTSESVC---------------RVLA---------------NWSA---KYQHEQPSLELGKTTVSV-----KIWTKVLDE------ VOLCADRAFT_106473_Vcarteri_302846292 RRPMRVRGRGLAWL-------LAVVIKCWARGEHGRWNWRRWG-LPPLVHQVVVVAKDETRSEHGGLTSRPMWPMPGSRRLQVAIGRSMPVPQYERLV---------------RPLTRCEPGVARL------AWERVL-RYHNEEGRRMTAELVVSCI-Q--EVGTASTVAQSDSED VOLCADRAFT_100579_Vcarteri_302855367 ------------------------------------------------------------------------------------------MPNSERQV---------------RPLTALPPCDALG------AWRLAV-ARSEAESRLLSGRLVEQCA-L--EVTGRSRLLRGTGSD consensus/100% ...........................................................................................P.....s...............Rsb.................W..........p......p...........l............. consensus/95% ...........................................................................................P.....s...............Rsb.................W..........p......p...........l............. consensus/90% ......b...............h......h..b.....a........b..b...........b.........h....s..h...hs...s.P...pbs...............Rsh..h..............W..........ps.....c....s......l............. consensus/85% p.....l...b...........F...s.uh.sI...+Lap..s...sb..ah..........h.hu.p....h..hss..hh..hs.s.s.Pp..pbs...............RsLs.h.....b.......sW..s.......ps..hs.c.l..sh....pl.s.........p. consensus/80% p.....l...b...........F...s.uh.sI...+Lap..s...sb..ah..........h.hu.p....h..hss..hh..hs.s.s.Pp..pbs...............RsLs.h.....b.......sW..s.......ps..hs.c.l..sh....pl.s.........p. consensus/75% c.....l..sb..l......sbF.phs.uhbsI.pb+Lap..G.a.Sh..Yh.........phshu.pbs..h..lsuschlp.ls.s.shPp.bpbs...............RsLs.h..ss.b.......sW..s.......ps.bhs.chV.psh....cl.sp......p.p. consensus/70% c.....L..sEb.l..s...sbF.phu.ulbsI.-b+Lap..G.asSh..Yl......pp.phshu.+bsb.h..luus+hlc.ls.s.slPpsERbV...............RsLs.h.ssssb.......sW.bsl....sppu.bhoscLV.psh....cl.sps.s...s.p. Alignment with prokaryotic homologs. RES R-P--ED-LEGCEKL---IEGSSLK---NNF-------LSIGQALVTIND-----RKLYK--D---------SG-YT--S-FTQYIE-----QKGDF-GF-GPRQALRLL--AATRLVRNF-----------------------------------------------------------PPN-I------------------------------------------------A-------LP-S-------------SERQV---------------RAL----------VG--L--EQ-AE-A---IE------VW--S-KAT-KISQD-TN-----T-PLTHRLVESVLG ALIGN ------H-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-------HHH--H-------------------EHEHH--------HH-HH-HHHHHHHHH--HHHHHHHHH-----------------------------------------------------------HH------------------------------------------------------------------------------------------------------------------------------HH------HH--H-HHH-HHH--------------HHHHHH---- HMM -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH---------E---------------------HHHHHH-----HHHHH-----HHHHHHHH--HHHHHHHH---------------------------------------------------------------------------------------------------------------------------------------------EE---------------EEE----------E---------H--H---HH------HH--H-HHH-HH---------------HHHHHHHH-- FREQ -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-----HHHHH-------------------H-HHHHHH-----HHH-------HHHHHHHH--HHHHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HH-H---HH------HH--H-HHH-HHH---------------EEEEEEE-- PSSM -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-----H--H-----------------------HHHHH-----HH--------HHHHHHHH--HHHHHHHHH-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HH------HH--H-HHH-HHH---------------HHHHHH--- FINAL -----HH-HHHHHHH---HHHHHHH---HHH-------HHHHHHHHHHHH-----HHHHH---------------------HHHHHH-----HHH-------HHHHHHHH--HHHHHHHHH------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------HH-H---HH------HH--H-HHH-HHH---------------HHHHHHH-- CHLREDRAFT_191158_Creinhardtii_159472462 R-P--ED-LEGCEKL---IEGSSLK---NNF-------LSIGQALVTIND-----RKLYK--D---------SG-YT--S-FTQYIE-----QKGDF-GF-GPRQALRLL--AATRLVRNF-----------------------------------------------------------PPN-I------------------------------------------------A-------LP-S-------------SERQV---------------RAL----------VG--L--EQ-AE-A---IE------VW--S-KAT-KISQD-TN-----T-PLTHRLVESVLG VOLCADRAFT_118198_Vcarteri_302842945 Q-P--ED-LEGCEKL---IEGSSLK---NNF-------LRIGQALVTIND-----RKLYK--R---------AG-SS--S-FTQYIE-----QKSDF-GF-GPRQALRLL--AATRLVRNF-----------------------------------------------------------PPT-I------------------------------------------------A-------LP-T-------------SERQV---------------RAL----------VG--L--EQ-QQ-A---VK------VW--V-KAN-LIAQE-TG-----V-PLTHRLVESVLG VOLCADRAFT_91459_Vcarteri_302838997 E-G--NA-LASAERR---VARAA-P---AYF-------AEASLAMLEIAE-----GKLYS--F---------AG-HA--S-FQDYIR---K-SSAVL-GF-GLRQARNLI--AAARVIRNL-----------------------------------------------------------PAD-V------------------------------------------------A-------RP-S-------------NERQV---------------RPL----------VG--C--HP-DV-Q---LK------VW--V-LALERADGD-TR-----H-EDALSLQYGGLP VOLCADRAFT_108225_Vcarteri_302854263 R-A--RT-LGEAEAR---VTMAA-G---GFF-------VEASCALLDIAE-----QHLFE--A---------EG-YK--S-FRAYIL-----ERKSL-GF-GYRQARAWV--AAARFIRSL-----------------------------------------------------------PPS-M------------------------------------------------P-------VP-Q-------------YERLV---------------RPL----------TR--C--EP-GV-A---RL------AW--E-RVL-RYHNE-EG-----R-RMTAELVVSCIQ VOLCADRAFT_105875_Vcarteri_302843631 R-A--RT-LGEAEAR---VTMAA-G---GFF-------VEASCALLDIAE-----QHLFE--A---------EG-YK--S-FRAYIL-----ERKSL-GF-GYRQARAWV--AAARFIRSL-----------------------------------------------------------PPS-M------------------------------------------------P-------VP-Q-------------YERLV---------------RPL----------TR--C--EP-GV-A---RL------AW--E-RVL-RYHNE-EG-----R-RMTAELVVSCIQ VOLCADRAFT_106408_Vcarteri_302845993 E-M--AL-LLRAEQV---VQGS--G---SQF-------RQMANSLLDIQE-----RRLYS--C---------LG-FG--S-FVQYVS-----ESGRI-DI-APRYAQALV--AAAHFLRLL-----------------------------------------------------------SAT-D------------------------------------------------V-------VP-N-------------SECQV---------------RPL----------TA--L--PP-CD-A---LG------AW--R-LAV-AHSVA-ES-----R-LLSGRLVEQCAL VOLCADRAFT_104970_Vcarteri_302839284 --M--AR-LLRAEQL---VQGC--G---SQF-------RQMANSLLDIQE-----RRLYS--C---------LG-FG--S-FVQYVS-----ESGRI-DI-APRYAQALV--AAARFLRLL-----------------------------------------------------------SAT-D------------------------------------------------V-------VP-N-------------SGRQV---------------RPL----------TA--L--PP-CD-A---LG------AW--R-LAV-ARSVA-ES-----R-LLGGRLVEQCAL VOLCADRAFT_104840_Vcarteri_302838546 E-G--NA-LASAERR---VPMAA-L---GFF-------AEASLALLDIAE-----GKLYS--F---------AG-HA--S-FKAYIR---R-SLAVL-GF-GLHQARNLI--AAARVIRNL-----------------------------------------------------------PAG-V------------------------------------------------A-------RP-S-------------NQMQV---------------RPF----------VG--C--HP-DV-Q---LK------AW--V-LALGRAGGL-ES-----A-RVSGRLVRECLR BATDEDRAFT_85509_Bdendrobatidis_575474002 H-L--ER-LHNLEKA---ITEHLST---GKF-------FIVAAALRCIEE-----ERLF----------------YP--E-RTVYSY-----AKSRF-GF-SRRTTNTYL--CSSYVYESI-----------------------------------------------------------TED-K------------------------------------------------T-------LP-I-------------PVNIS-----------HV--RSL----------HK--Y--PP-EV-R---RQ------IW--------KQLND-SG-----L-TITEENVVAMTI BATDEDRAFT_90358_Bdendrobatidis_575483232 E-Y--AR-IPLEQK----------Q---KQF-------VETVTAIRAIIC-----RKLYR--D---------EG-YD--S-LQTYFL-------SKW-DV-SRAQVYRLM--DCWPILTTI-----------------------------------------------------------CKA-H------------------------------------------------V-------IP-Y-------------KERLC---------------RTL----------KQ--C--TR-SP-S---EL---VL-LW--D-NVI---GSC-DP-----A-FVSPKFIFDVWD VOLCADRAFT_108630_Vcarteri_302856103 E-M--AR-LLRAEQL---VQGC--G---SQF-------RQMANSLLDIQE-----RRLYS--C---------LG-FG--S-FVQYVS-----ESGRI-DI-APRYAQALV--AAARFLRLL-----------------------------------------------------------SAT-D------------------------------------------------V-------VP-N-------------SGRQPSHDSWVGGSLSNVHWRSQ----------AG--A--VC-CG-V---LAVTTAAATW--M-SAI-ASLVR-ES-----I-WFEVEGVGMCAC MVEG_03971_Mverticillata_672826234 D-S--SV-IRGSSKT---SHQSK-P---TLF-------ETTVLAFRDIIV-----RRLWR--S---------DNRFQ--S-ESEFCK-------HHW-EI-QRSRRDELI--ECAELLVEL-----------------------------------------------------------SRI-P------------------------------------------------C-------RP-T-------------SESVC---------------RVL----------A------------------------NW--S-AKY--QHEQ-PS-----L-ELGKTTVSVKIW VOLCADRAFT_106473_Vcarteri_302846292 R-RPMRV-RGRGLAW---L----------LA-------VVIKCWARGEHG-----RWNWR--R---------WG-LP--P-LVHQVVVVAKDETRSE-HG-GLTSRPMWPMPGSRRLQVAI-----------------------------------------------------------GRS-M------------------------------------------------P-------VP-Q-------------YERLV---------------RPL----------TR--C--EP-GV-A---RL------AW--E-RVL-RYHNE-EG-----R-RMTAELVVSCIQ -_Fischerella_sp_PCC_9431_737132827 L-T--EE-EQCLRLH---LERKV-E---RAF-------YEAGKALRELRD-----RKLYR--S---------T--HQ--T-FEEYCR-------DRF-GY-SRRHPYLLM--EAAVIVDNL-SE--------------------------------------------------------KCD-P---------------------------------MDH------------I-------PP-T-------------SERQV---------------RPL----------TK--L--DP-DT-Q---CE------AW--Q-QAV-SEAGG--------K-VPSSRIVKDIVQ -_Nostoc_sp_PCC_7120_764953510 L-T--EQ-EQSDRLF---LKRKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HA--T-FEEYCK-------DRF-GY-NRSRSYQLI--DAAIVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------FVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EP-QE-Q---QE------AW--P-TAV-EETGG--------K-VPTGRIVKDVVQ -_Nostoc_punctiforme_501381481 L-T--EE-EQRDRLH---LERRV-E---RAF-------FEAGKALAELRD-----RRLYR--S---------S--HR--T-FEEYCR-------DRF-GH-SRRQSYLLM--DAAVIFDNL-EQ--------------------------------------------------------KCD-R---------------------------------SDH------------I-------LP-T-------------NEWQV---------------RPL----------SK--L--DP-DI-Q---PE------AW--E-QAV-ESANG--------K-VPSHRIVKDVVQ -_[Scytonema_hofmanni]_UTEX_B_1581_740464136 L-S--DA-EAVELRR---LEAKV-ELGLKAF-------WEIGQALSQIRD-----KRLYR--E---------T--HK--T-FEEYCI-------TRW-EM-SRRSAYQLI--GAAIVVENV----R------------------------------------------------------NCA-Q------------------------------------------------I-------LP-L-------------NEAQA---------------RPL----------VA--L--PP-EQ-Q---RE------AW--K-TAV-STAAN--------G-KVTALHVAQVAR -_Chroococcales_cyanobacterium_CENA595_769921346 L-S--DD-ELSDRHR---LELRV-E---RVF-------YEAGTALRELRD-----RKLYR--D---------T--HR--T-FEDYCK-------NRF-GY-HRRHCYQLI--DAADVVENL------C----ANS---------------------------------------------AQK-K----------------S-GTS------------GAH------------I-------LP-T-------------NEYQV---------------RPL----------TK--L--EP-AQ-Q---IM------IW--Q-QAV-ESAGG--------K-APSGRIVKSIVE alr7299_Nostoc_sp_PCC_7120_17135837 L-T--EQ-EQSDRLF---LKRKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HA--T-FEEYCK-------DRF-GY-NRSRSYQLI--DAAIVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------FVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EP-QE-Q---QE------AW--P-TAV-EETGG--------K-VPTGRIVKDVVQ Sta7437_4876_Stanieria_cyanosphaera_PCC_7437_428272365 L-T--DE-EQQERLH---LERQV-E---RSF-------YVAGKALQQLRD-----RRLYR--S---------T--HS--T-FEDYCR-------ERF-GY-SRRHPYLLI--DAAIVVDNL-SQ--------------------------------------------------------KCD-P---------------------------------LDH------------I-------LP-T-------------SERQV---------------RPL----------SK--L--DR-YQ-Q---VE------VW--Q-QAV-EEAGG--------V-VPSSRIVRDLVQ -_Tolypothrix_campylonemoides_751574024 L-T--DG-ELRLRLE---LERQV-E---SAF-------YEAGKALRELRD-----KRLYR--S---------T--HK--T-FEEYCK-------DRF-GF-ERRHPYRLI--DGADIVDNL-IQ--------------------------------------------------------MCP-N---------------------------------GTQ------------I-------LP-T-------------SERQV---------------RPL----------TK--L--ER-EE-Q---RQ------AW--Q-MAL-EQAGG--------K-VPTGNIVKDIVQ -_Microcystis_aeruginosa_501223295 L-S--EE-EVRDRER---LERTV-E---RAF-------YQAGSALQELRD-----RRLYR--D---------G--YD--S-FEDYCR-------GRF-GH-SRQKANYLI--TGAAIYRTL-----------------------------------------------------------SAA-N----------------------------------CP------------L-------LP-S-------------SEYQV---------------RPL----------AV--L--TP-QQ-Q---PT------VW--N-EAV-AVAGG--------R-TPDHRIVRETVG -_Fischerella_sp_PCC_9431_737134277 L-T--LE-EQRDRLH---LERKV-E---RAF-------YEAGKALRELRD-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-GH-SRQKSNYLI--AAAGVFDNL-----------------------------------------------------------TTI-G-----CQNLPSED---L-TTN------------GSQ------------I-------LP-T-------------NERQV---------------RPL----------TQ--L--EP-DQ-Q---RE------VW--Q-QAV-TEAGG--------K-VPSGRIVKDIVQ -_Cylindrospermum_stagnale_505141377 L-T--EE-EERDRLH---LERQV-E---RAF-------YEAGKALRQLRD-----RKLYR--N---------T--HK--T-FEEYCK-------DRF-SY-NRSRSYQLI--DAAFVVDNL-E---------------------------------------------------------ECP-Q---------------------------------IVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EA-EE-Q---VT------CW--Q-EAV-ESAGG--------K-VPSGRIVKSIVD -_Scytonema_millei_748136445 L-S--ET-EAAERHR---LELRV-E---RAF-------YEAGRALRELKQ-----KKLYR--S---------T--HN--T-FEDYCI-------ERF-GF-SRRHPYRLI--EAASVFENL------C----PIG---------------------------------------------TQN-D----------------L-PTN------------ERQ------------I-------LP-T-------------SERQI---------------RDL----------VS--L--EP-QQ-Q---RE------IW--Q-SAV-LIANG--------K-VPSSRIVKGIVE Npun_BR102_Nostoc_punctiforme_PCC_73102_186469442 L-T--EA-EERDRLS---LERKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HK--T-FEEYCR-------FRF-AY-TYRHVNYLI--AGSVIVDNI------------------------------K----------------------------MGT-N-----SSQNEKSH--EM-GTN------------SSQ------------I-------LP-T-------------SEVQV---------------RPL----------AK--L--EP-QQ-Q---PE------AW--Q-QAV-EQAEG--------K-VPSGRIVKDVVQ -_Nostoc_punctiforme_753811080 L-T--EA-EERDRLS---LERKV-E---RAF-------FEAGKALMELRD-----RRLYR--S---------T--HK--T-FEEYCR-------FRF-AY-TYRHVNYLI--AGSVIVDNI------------------------------K----------------------------MGT-N-----SSQNEKSH--EM-GTN------------SSQ------------I-------LP-T-------------SEVQV---------------RPL----------AK--L--EP-QQ-Q---PE------AW--Q-QAV-EQAEG--------K-VPSGRIVKDVVQ CWATWH0402_1907_Crocosphaera_watsonii_WH_0402_543531309 L-T--HS-EERDRLH---LERKV-E---RAF-------YEAGKALQELRD-----RRLYR--S---------T--HK--T-FERYCR-------ERF-GY-NRSRSYQLI--DAGMVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------IVD------------I-------FP-T-------------KESLV---------------RPL----------AS--L--NP-SQ-Q---VE------VW--T-KAV-ELVNG--------Q-VPPARVVKNIVD -_Scytonema_millei_748134961 L-T--ED-EERDRHR---LELRV-E---QAF-------YQAGAALRELKE-----RRLYR--S---------T--HS--T-FEEYCQ-------DRF-GY-HRRHSYQLI--DAAVVFENL------C----AIG---------------------------------------------AQK-N----------------A-DTR------------GAR------------I-------LP-T-------------SERQC---------------RPL----------TQ--L--EP-AQ-Q---VK------AW--Q-QAI-ELTGG--------K-APSGRTVKGIVE -_Pleurocapsa_sp_PCC_7319_518335686 L-T--TE-EEGDRLH---LERKV-E---RAF-------YEAGMALMQLRD-----RRLYR--S---------T--HA--T-FEDYCR-------DRF-DY-VRRRSYQLI--DAAKIYNNL-SE--------------------------------------------------------KCV-Q---------------------------------FVH------------I-------LP-T-------------REGQV---------------RPM----------SQ--L--NA-EE-Q---VL------AW--E-TAV-EEAGG--------K-VPTGKIVKDVVQ -_Anabaena_cylindrica_505030514 L-T--EE-EERDRFR---LERQV-E---RAF-------SAAGKALRQLRD-----RKLYR--S---------T--HK--T-FEEYCK-------DRF-SY-NRSRSYQLI--DAADVVDNL-E---------------------------------------------------------ECP-Q---------------------------------IVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EA-EE-Q---VS------CW--Q-EAV-AAVGG--------K-VPSGRIVKSIVD -_Stanieria_cyanosphaera_753865019 L-T--DE-EQQERLH---LERQV-E---RSF-------YVAGKALQQLRD-----RRLYR--S---------T--HS--T-FEDYCR-------ERF-GY-SRRHPYLLI--DAAIVVDNL-SQ--------------------------------------------------------KCD-P---------------------------------LDH------------I-------LP-T-------------SERQV---------------RPL----------SK--L--DR-YQ-Q---VE------VW--Q-QAV-EEAGG--------V-VPSSRIVRDLVQ -_Calothrix_sp_PCC_7103_737188140 L-S--YD-EEQERII---LEKQV-E---RSF-------YVAGRALRILRD-----KKLYR--N---------S--HK--N-FEEYCQ-------YKF-AF-TRRNVNYLI--ASSQVVDNL--------------------------------------------------AGTNI----EGT-E--FL-GTNCSQ-------------------------------------I-------LP-T-------------NECQV---------------RPL----------TK--L--EP-SE-Q---RE------CW--H-QAV-EAAGN--------K-VPSGRQVKDIVT -_Synechocystis_sp_PCC_7509_740179759 L-T--ED-EEKERHW---LERKV-E---LAF-------VEAGTALRRLRD-----ERLYR--S---------T--HK--T-FEAYCR-------DRF-GF-TRRRPYQLI--DAANVIENL------C----TNG---------------------------------------------TQ---------------------------------------------------I-------LP-S-------------SERQI---------------RDL----------IE--L--NP-KE-Q---CK------VW--Q-QAV-DESGG--------K-VPSGRIVKGIVE DA73_0201705_Tolypothrix_bouteillei_VB521301_744453553 L-T--PE-EQKERLR---LERVI-E---RSF-------YEAGKALRELRD-----RRLYR--S---------T--HK--T-FEEYCK-------NRF-GY-NRSRSYQFI--DAATVVDNL-Q---------------------------------------------------------KCP-Q---------------------------------FVD------------I-------FP-T-------------AESQV---------------RPL----------VP--L--ES-DQ-Q---WE------AW--Q-LSV-EAAGK--------K-VPSARIVKDIVE -_Scytonema_millei_748135946 L-S--EP-EAAERHR---LEQKV-E---RAF-------YEAALALRELHE-----RKLYR--S---------T--HS--R-FDHYCR-------DRF-GF-SQQNADLLI--RAAGVIDNL-----------------------------------------------------------KIT-T----------------------I----------GCN------------F-------XP-T-------------NERQV---------------RPL----------TK--L--EP-NE-Q---RQ------VW--Q-QAI-EAAGN--------R-VPSGRVVKDIVV -_Anabaena_variabilis_499635872 L-T--PE-EQSDRLL---LERKV-E---RAF-------FEAGKALAELRD-----RRLYR--S---------T--HR--T-FEEYCK-------DRF-SY-THRHVNYLI--AASLIVDNI------------------------------I----------------------------MGT-N-----SSQIEEAQADEM-GTN------------SSQ------------I-------FP-I-------------SEVQV---------------RPL----------SK--L--EP-QQ-Q---RK------AW--Q-DAV-QEAGD--------K-VPTGRIVKDVVQ -_Tolypothrix_sp_PCC_7601_797212730 I-S--ET-EAQELRR---LEATV-ERGLRAF-------WEIGQALRQIQD-----QRLYR--Q---------D--YK--N-FEEYCI-------TRW-EM-SRRSAYQLI--EAASVYENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------TV--L--PP-EK-Q---RE------AW--N-KAV-STAPS--------G-KVTSVHVAQVAK -_Tolypothrix_campylonemoides_751574204 L-S--EA-EAQELRK---LEATV-ERCLKAF-------WQIGQALRGIRD-----KHLYR--Q---------Q--YK--T-FEEYCI-------TRW-EM-SRRSAYQLI--EAASVYENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------VA--L--SP-EQ-Q---RE------AW--A-KAV-STAPS--------G-KVTAVHVTQVAR -_Fischerella_muscicola_515347403 L-T--EE-EKADRHR---LELKI-E---RAF-------YEAGCALKELWE-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-NY-SRDTAYLKM--AAAVVYDNI---------------QKF-----------------------------------------LPT-I-----GRQTP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANL--EP-EL-Q---AA------TW--L-QGV-EEAGG--------K-VPSGRIIKGIVE -_Oscillatoria_nigro-viridis_504992580 L-T--DA-EALELSS---LEATV-ERSLKAF-------WEIGQALRQIRD-----RRLYR--Q---------D--FS--T-FEDYCT-------NRW-EM-SRRWAYQLI--EAATVYENV----R------------------------------------------------------HGA-P------------------------------------------------I-------LP-A-------------NERQV---------------RPL----------TA--L--PS-QE-Q---PR------AW--A-QAV-STAPN--------G-KLTAFHVARVVE -_Tolypothrix_sp_PCC_7601_797208446 L-T--PE-EQSDRLH---LERKV-E---RAF-------FEAGKALAELRD-----RRLYR--S---------T--HR--T-FEDYCR-------DRF-GH-SRQQSNYLI--AAAGVYENL-----------------------------------------------------------TTI-G-----CQNVENEN---L-TTI------------CCQ------------I-------LP-T-------------NERQV---------------RPL----------TK--L--EP-QQ-Q---QE------VW--Q-QAV-EEAGG--------K-VPTGKIVKDVVQ -_Stanieria_cyanosphaera_505024902 L-T--VE-EESDRYS---LERKV-E---RAF-------YEAGMALMELRD-----RKLYR--S---------T--HA--T-FEDYCR-------DRF-DY-TRRRPYQLI--EAALIYDNL-SE--------------------------------------------------------KCV-K---------------------------------FLH------------I-------LP-T-------------KEGQV---------------QPL----------TQ--L--EW-ES-Q---PS------AW--E-TAV-EEAGG--------K-VPTGRIVKDVVR -_Anabaena_sp_PCC_7108_515515560 L-T--EE-EERDRFR---LERQV-E---RAF-------SAAGKALRELRD-----RKLYR--N---------S--HQ--T-FEEYCK-------DRF-SY-NRSRSYQLI--DAADVVDNL-E---------------------------------------------------------ECP-Q---------------------------------FVD------------I-------LP-T-------------AEGQV---------------RPL----------TK--L--EA-EE-Q---VS------CW--Q-EAV-EAAGG--------K-VPSGRIVKSIVD FDUTEX481_04373_Tolypothrix_sp_PCC_7601_407266820 I-S--ET-EAQELRR---LEATV-ERGLRAF-------WEIGQALRQIQD-----QRLYR--Q---------D--YK--N-FEEYCI-------TRW-EM-SRRSAYQLI--EAASVYENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------TV--L--PP-EK-Q---RE------AW--N-KAV-STAPS--------G-KVTSVHVAQVAK BegalDRAFT_1574_Beggiatoa_alba_B18LD_386428626 L-S--PE-EFQALAQ---HEAIV-KAGLQTF-------YDIGEALLTIRD-----KRLYR--A---------E--FN--S-FEEYCQ-------EKW-GF-VRRQADRLI--QAFEVTENL----R------------------------------------------------------PVG-L------------------------------------------------S-------MP-H-------------NEAQA---------------RPL----------VK--L--EP-EL-Q---RQ------AW--Q-KAV-EMAPD--------G-KPTSSLVKKIVK MICAB_900014_Microcystis_aeruginosa_PCC_9717_389714985 L-S--EE-EVRDRER---LERTV-E---RAF-------YQAGSALQELRD-----RRLYR--D---------G--YD--S-FEDYCR-------GRF-GH-SRQKANYLI--TGAAIYRTL-----------------------------------------------------------SAA-N----------------------------------CP------------L-------LP-S-------------SEYQV---------------RPL----------AV--L--AP-QQ-Q---PT------VW--N-EAV-AEAGG--------R-TPDHRVVRETVG -_Calothrix_sp_PCC_7103_737188608 L-S--EA-EVLELES---LESTV-QRGLRAF-------WEIGQALRILRD-----KRLYR--Q---------C--YD--T-FEEYCI-------NRW-EM-SRRSAYYLI--DAAAVYENV----N------------------------------------------------------HGS-Q------------------------------------------------I-------LP-A-------------NERQA---------------RPL----------TA--L--TP-SE-Q---QK------VW--Q-QAV-STAPN--------G-KITATHIIQVVK -_Crocosphaera_watsonii_737857352 L-S--EA-EQSEKKR---LEGVV-S---EAV-------WNAGKALRELRD-----KKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYHNL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KSV-EIANG--------K-VPTHRIVKQVVR -_Chlorogloeopsis_fritschii_515383623 L-T--ED-EQRDRLY---LERKI-E---RAF-------FEAGKALMELRD-----HRLYR--S---------T--HK--T-FEEYCK-------DRF-GF-ERRHPYRLI--EAAVVVDNL-MQ--------------------------------------------------------MCP-NGTQIEANSNDEQKQSIG-TQIEIESSEQQMRPNGTQ------------I-------LP-T-------------SERQV---------------RPL----------TE--L--EP-SQ-Q---QE------VW--Q-TAV-QEAGG--------K-VPTGRIVKDVVQ -_Crocosphaera_watsonii_494519775 L-S--EA-EQSEKKR---LEGVV-S---EAV-------WNAGKALRELRD-----KKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYHNL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KSV-EIANG--------K-VPTHRIVKQVVR DA73_0214905_Tolypothrix_bouteillei_VB521301_744450902 L-T--PE-ELRERLQ---LERKV-E---RAF-------YEAGKALMELRN-----QRLYR--S---------T--HK--T-FEEYCR-------DRF-GH-TRQKSNYLI--AAADVFENL-----------------------------------------------------------TTS-G-----CQ-----------------------------------------I-------LP-T-------------SERQI---------------RPL----------TK--L--EP-VK-Q---PE------AW--Q-LSI-EAADG--------K-SPPSRIVNDIVE -_Crocosphaera_watsonii_757158775 L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRSLSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR N44_02315_Microcystis_aeruginosa_NIES-44_718251661 L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYVQ-------DRF-GM-KRAHSYRLI--DAAAVVDNL-F---------------------------------------PLCLQIGDNLSEMSPQE-MSP-N-----WRQNSTGE---K-LTN---------------------------P-------VP-T-------------NESQC---------------RPL----------TQ--L--EP-DQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ -_Mastigocladopsis_repens_703170672 L-T--DG-ELRLRLE---LERQV-E---SAF-------YEAGKALRELRD-----KRLYR--S---------T--HK--T-FEEYCK-------DRF-GF-ERRHPYRLI--DGADIVDNL-IQ--------------------------------------------------------MCP-N---------------------------------GTQ------------I-------LP-T-------------SERQV---------------RPL----------TK--L--ER-EE-Q---RQ------AW--Q-MAV-EQAGG--------K-VPTGNIVKDIVQ -_Crocosphaera_watsonii_546222413 L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR -_Microcystis_aeruginosa_763118968 L-S--EE-EVRDRER---LERTV-E---RAF-------YQAGSALQELRD-----RRLYR--D---------G--YD--S-FEDYCR-------GRF-GH-SRQKANYLI--TGAAIYRTL-----------------------------------------------------------SAA-N----------------------------------CP------------L-------LP-S-------------SEYQV---------------RPL----------AV--L--AP-QQ-Q---PT------VW--N-EAV-AEAGG--------R-TPDHRVVRETVG Sta7437_4607_Stanieria_cyanosphaera_PCC_7437_428272125 L-S--LE-DERDKLK---LEREV-E---RAF-------YRAGCALKELRD-----RRLYR--S---------T--HK--T-FKEYCQ-------DRF-GF-TRRRSDYLI--GAAEVVDNL---------------------------------------------------------------S-----GEPKPKRE-------P------------LVL------------I-------LP-T-------------SERQC---------------RPL----------TK--L--EP-EQ-Q---RE------IW--R-EAV-ESSKG--------K-VPSGKVVADLVA Cyan7822_6833_Cyanothece_sp_PCC_7822_306986606 L-S--AD-EEKELLR---LERVV-E---RSF-------YEAGSALRKIRA-----LRLYR--A---------R--FN--S-FEEYTQ-------ERF-GF-TRRQPYYLI--EAANVVDNL-----------------------------------------------------KS----ECE-P--LV-------------------------------H------------I-------LP-S-------------SERQV---------------RPL----------TK--L--NA-TE-Q---RS------VW--N-DAV-SRAQG--------K-VPSGRIVTEALE -_Crocosphaera_watsonii_546206668 L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRSLSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR CwatDRAFT_0109_Crocosphaera_watsonii_WH_8501_67852287 L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRSLSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR -_Crocosphaera_watsonii_494523801 L-T--EV-ELAEKQR---LEAVI-I---GAV-------WSAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-VPTHRIVKQVVR -_Microcystis_aeruginosa_779871805 L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYVQ-------DRF-GM-KRAHSYRLI--DAAAVVDNL-F---------------------------------------PLCLQIGDNLSEMSPQE-MSP-N-----WRQNSTGE---K-LTN---------------------------P-------VP-T-------------NESQC---------------RPL----------TQ--L--EP-DQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ DA73_0203765_Tolypothrix_bouteillei_VB521301_744452929 L-K--EE-ELRLRLH---LERKV-E---RSF-------YEAGKALMELRD-----KRLYR--S---------T--HK--T-FEEYCR-------DRF-SH-SRQKSNYLI--AAADVFENL-----------------------------------------------------------TTI-R-----CQNSSSED--------------------DLQ------------I-------LP-S-------------SEYQI---------------RPL----------TK--L--EP-EQ-Q---LQ------AW--Q-ISV-EEAGG--------V-APAARIVKDVVQ -_Cylindrospermum_stagnale_505141386 E-A--SA-IALELDR---LEGRI-EKGLRAF-------WEIGQSLGQIRD-----KQLYR--Q---------T--YK--T-FEEYCL-------NRW-EM-SRRSAYRLI--EAASVYENV----T------------------------------------------------------HGS-Q-IPE-NV----------------------------------THGSHFKI-------LP-A-------------NERQV---------------RPL----------AT--L--TP-EQ-Q---RQ------AW--A-KAV-STAPG--------G-KVTSGHVAQVAR -_Cyanothece_497232044 L-T--DS-EQKERLR---LERQV-E---RAF-------YVAGCALAKLKT-----DKLYR--S---------T--HS--T-FEDYCQ-------DRF-SF-TRRHVNYLI--AAAGVVDNL--------------------------------------------------K----------------M-GTNCSQNN---------------EDA--ENL------------I-------LP-T-------------TASQC---------------RPL----------TA--L--EP-LK-Q---VE------AW--S-EAI-TQAGG--------K-VPPARIVQEVVQ -_Calothrix_sp_PCC_7103_518327692 L-T--IS-EQEERDY---LEKLV-E---RAF-------YSAGKALQTLRD-----KKLYR--S---------T--HK--S-FESYCL-------DRF-NY-NSSRSYQLM--DAADVVDNL-K---------------------------------------------------------KVP-Q---------------------------------IVE------------L-------LP-T-------------AEGQV---------------RPL----------VK--L--DF-DT-R---RE------AW--K-MAV-EEVNG--------K-VPSGRVVKDIVN MICAK_2860002_Microcystis_aeruginosa_PCC_9701_389882556 L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-GY-SRRKMDYLI--SGSEVFENL-Q-----------------------------------------TRTIGSQSDRDETRT-IGS-Q-----SDRDETRT---I-GSQ---------------------------I-------LP-I-------------SERQV---------------RPL----------TQ--L--EP-EQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ -_Calothrix_sp_PCC_7103_737187200 L-T--YD-EQRERER---LERLV-E---RAF-------YQAGLALKELRD-----KRLYR--N---------T--HT--S-FDKYCK-------DRF-AY-HRSRYYQLI--NAATIVDNL-Q---------------------------------------------------------PCL-Q---------------------------------IVD------------I-------LP-T-------------AESQV---------------RPL----------VL--L--DP-DE-Q---RL------AW--T-QAV-KAASG--------K-VPSAKVVKDIVD MC7420_4124_Coleofasciculus_chthonoplastes_PCC_7420_196179143 L-T--DA-EIVEFRS---LEATV-EKGLRAF-------WQIGQALRQIRD-----KRLYR--Q---------D--YG--T-FEDYCL-------TRW-EI-SRRSAYQLI--EAASVVENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------IP-A-------------NERQA---------------RPL----------TA--L--KP-EQ-Q---QA------AW--A-KAV-STAPR--------G-KVTAAHVAQVAQ -_Crocosphaera_watsonii_494514224 L-T--EV-ELAEKQR---LEAVV-I---GAV-------WAAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--VGANIYENL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--ET-KD-Q---VV------AW--G-KAV-EIANG--------K-VPTHRIVKQVVR -_Cyanothece_sp_CCY0110_737832178 L-S--EA-EEVEKQR---LEAVV-S---GAV-------WAAGKALQKLRD-----KKLYR--D---------S--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------EAS-G----------------------------------CE------------V-------LP-Q-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--N-KAV-DICNG--------K-VPSHRIVKQVVR -_Crocosphaera_watsonii_737862397 L-S--EV-ELAEKQR---LEAVV-I---GAV-------WSAGFALQQLRD-----QKLYR--D---------T--HS--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-IPTHRIVKQVVR -_Synechocystis_sp_PCC_7509_655839534 L-S--ED-EEKERHR---LELKV-E---RAF-------VEAGTALRKLRD-----RRLYR--S---------T--HK--T-FEEYCS-------DRF-GF-SRRHPYRLI--DAANVVENL-EK--------------------------------------------------------FCV-Q---------------------------------FGH------------I-------LP-A-------------KEFVC---------------RPL----------TI--L--RP-DQ-Q---RE------VW--Q-EIL-QETEG--------K-HPTGKEVKSIVE -_Desulfococcus_multivorans_527022036 M-------TADRLSE---LEAII-DRNRRSF-------YVIGKALYEIRE-----NRLYR--L---------LG-FK--T-FEAYVK-------DRW-SM-GKSHAHRFI--EAYRVIENL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-E-------------NESQV---------------RPL----------VP--L--TP-LE-Q---RN------IW--R-QFL---ASG--------M-ALTAKNICRLVS -_Anabaena_sp_PCC_7108_515520582 R-G--EA-ITVELGR---LEDRI-EKGLRAF-------WDIGQSLGQIRD-----KQLYR--Q---------S--YK--T-FDEYCL-------NRW-EM-SRRSAYRLI--QAALVYENV----T------------------------------------------------------RGS-Q-SFV-NV----------------------------------THGSQNQI-------LP-T-------------NERQI---------------RPL----------VT--L--PP-EK-Q---RE------AW--A-KAV-STAPN--------G-KVTADHVAQVAR -_Cyanothece_sp_PCC_7424_501601085 L-T--PL-ELSEKEQ---LERQV-E---EAF-------FIAGEALRSLRD-----RRLYR--D---------T--HR--T-FEQYCQ-------DRF-GH-TRQKINYLI--AGAAIYSNL-----------------------------------------------------------TTA-R----------------------------------CQ------------V-------LP-A-------------GEYQV---------------RPL----------SV--L--ES-EL-Q---PE------AW--N-KAV-SLADG--------K-VPTSRIVREVVE -_Cyanothece_sp_PCC_7424_752567372 L-T--PP-ELSEKEQ---LEQQV-E---EAF-------FIAGEALRSLRD-----RRLYR--D---------T--HR--S-FEQYCQ-------DRF-GH-TRQKINYLI--AGAAIYSNL-----------------------------------------------------------TTA-R----------------------------------CQ------------V-------LP-A-------------GEYQV---------------RPL----------SV--L--ES-EL-Q---PE------AW--N-KAV-SLADG--------K-VPTSRIVREVVE -_Scytonema_millei_748137603 L-N--EV-EERDRHR---LELRV-E---RAF-------YEAGKAIKELRD-----RRLYR--S---------T--HN--NDFVGYCR-------DRF-GK-TKQAVNYLI--AAAEVYENL-T--------------------------------------------------------------------------------TTN------------CCR------------V-------LP-T-------------SEGQV---------------RSL----------SG--L--KL-EK-Q---VE------VW--Q-QAI-DLAEG--------K-VPSARIVKGIVE Sta7437_4542_Stanieria_cyanosphaera_PCC_7437_428272064 L-T--AS-QTKELLR---LEKTI-E---TSF-------YLAGLALRQIQS-----KRLYR--E---------N--YR--T-FEAYCR-------NRF-DF-TRASAYYLI--KAASVVDNL-----------------------------------------------------------KCQ-Q---------------------------------FVD------------I-------LP-T-------------KESQC---------------RPL----------MS--L--PP-EK-Q---TQ------VW--L-EAI-SQAKG--------K-VPSARLVKNIVA -_Crocosphaera_watsonii_546220971 L-T--EV-ELAEKQR---LEAVV-I---GAV-------WAAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--VGANIYENL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--ET-KD-Q---VV------AW--G-KAV-EIANG--------K-VPTHRIVKQVVR -_Nostoc_sp_PCC_7120_499309017 M-T--EE-EQRDRLN---LERKV-E---RAF-------VEAGKALMELRD-----RRLYR--N---------T--HK--T-FEEYCR-------DRF-GY-SRDAAYLKM--SATNVYENI---------------QKH-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RSL----------AKAEL--EP-KV-Q---AK------VW--R-QAV-QEAKG--------K-TPSGRIVQDVID PCC7424_5542_Cyanothece_sp_PCC_7424_218175378 L-T--PP-ELSEKEQ---LEQQV-E---EAF-------FIAGEALRSLRD-----RRLYR--D---------T--HR--S-FEQYCQ-------DRF-GH-TRQKINYLI--AGAAIYSNL-----------------------------------------------------------TTA-R----------------------------------CQ------------V-------LP-A-------------GEYQV---------------RPL----------SV--L--ES-EL-Q---PE------AW--N-KAV-SLADG--------K-VPTSRIVREVVE -_Anabaena_variabilis_499635567 M-T--EE-EQRDRLN---LERKV-E---RAF-------VEAGKALMELRD-----RRLYR--N---------T--HK--T-FEEYCR-------DRF-GY-SRDAAYLKM--SATNVYENI---------------QKH-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RSL----------AKAEL--EP-KV-Q---AK------VW--R-QAV-QEAKG--------K-TPSGRIVQDVID -_Stanieria_cyanosphaera_753864885 L-S--LE-DERDKLK---LEREV-E---RAF-------YRAGCALKELRD-----RRLYR--S---------T--HK--T-FKEYCQ-------DRF-GF-TRRRSDYLI--GAAEVVDNL---------------------------------------------------------------S-----GEPKPKRE-------P------------LVL------------I-------LP-T-------------SERQC---------------RPL----------TK--L--EP-EQ-Q---RE------IW--R-EAV-ESSKG--------K-VPSGKVVADLVA -_Chroococcales_cyanobacterium_CENA595_769922127 L-T--ED-EEKERHR---LELKV-E---RAF-------YEAGSALRELRD-----RRLYR--S---------T--HK--T-FEAYSQ-------ERF-GM-TPRPAYYLI--AAAGVVENL-E---------------------------------------------------------MRT-N---------------------------------GSQ------------I-------LP-T-------------TERQV---------------RPL----------AN--L--EP-EE-Q---RQ------IW--Q-QAV-QEAGN--------K-VPSGRIVKDIVQ CWATWH0401_4234_Crocosphaera_watsonii_WH_0401_543428839 L-T--EV-ELAEKQR---LEAIV-I---GAV-------WAAGKALQQLRD-----QKLYR--D---------T--HT--S-FERYCR-------EQF-GH-SRQKSDYLI--VGANIYENL-----------------------------------------------------------TTN-R----------------------------------CE------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--ET-KD-Q---VV------AW--G-KAV-EIANG--------K-VPTHRIVKQVVR Sta7437_4575_Stanieria_cyanosphaera_PCC_7437_428272094 L-S--FE-EERDRLR---LERQV-E---RAF-------YQAGIALKELRD-----RRLYR--S---------T--HE--T-FEKYCQ-------DRF-GM-QRRHPYRLI--DAAAVVDNI------LQMCPI-----------------------------------------------RTQ-N-----GSDTSDAN-------K------------TLE------------I-------IP-T-------------SEWQI---------------RSL----------TK--L--EP-RQ-Q---RE------IW--A-RAI-ELAGN--------K-VPSGKIVSELVS UH38_20050_Chroococcales_cyanobacterium_CENA595_768384071 L-T--ED-EEKERHR---LELKV-E---RAF-------YEAGSALRELRD-----RRLYR--S---------T--HK--T-FEAYSQ-------ERF-GM-TPRPAYYLI--AAAGVVENL-E---------------------------------------------------------MRT-N---------------------------------GSQ------------I-------LP-T-------------TERQV---------------RPL----------AN--L--EP-EE-Q---RQ------IW--Q-QAV-QEAGN--------K-VPSGRIVKDIVQ -_Chroococcales_cyanobacterium_CENA595_769920071 L-T--PE-EQRDRQR---LELGV-E---QAF-------YQAGKALAQLRE-----RRLYR--T---------T--HK--T-FEAYCQ-------DRF-GF-TRRHSDYLI--NGAKVVENL-----------------LSI---------------------------------------RTI-S-----PPNYAQGN---L-RTI------------PAQ------------I-------LP-T-------------KLEQV---------------KPL----------TS--L--EP-DQ-W---RL------AW--N-KAV-EKAHG--------K-VPSGQIVRAVVE -_Cyanothece_sp_PCC_8802_752568031 L-T--LA-EQAEKQH---LESIV-T---GAV-------WSAGLALRELRD-----LRLYR--D---------T--HA--N-FAEYCR-------ERF-GH-SRQKSDYLI--VAAKIYENL-----------------------------------------------------------SEN-H----------------------------------CQ------------V-------LP-T-------------TEFQV---------------RPL----------GG--L--EP-DL-Q---VQ------AW--Q-EAV-AIASDTSSRNAHPK-VPSNQIVKQVVR -_Microcystis_aeruginosa_763118064 L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYCR-------DRF-GY-SRRKMDYLI--SGSEVFENL-Q-----------------------------------------TRTIGSQSDRDETRT-IGS-Q-----SDRDETRT---I-GSQ---------------------------I-------LP-I-------------SERQV---------------RPL----------TQ--L--EP-EQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDIVQ -_Cyanothece_sp_PCC_7424_752567338 P-T--PQ-EEEDLQR---LEKIV-E---CSF-------LDAGLALQEINT-----RKLYR--F---------S--HK--T-FEDYCR-------DRF-GYLNRRHPYRLI--EAALVVENL------L------K---------------------------------------------KCD-Q----------------I-GHK------------KIP------------M--------P-N-------------NEAQV---------------RPL----------TQ--L--DE-EQ-Q---WE------AW--E-NAV-TESKT--------K-VPSAAFVKKSVE -_Nostoc_punctiforme_501381405 L-T--DQ-EQSLRLQ---LERQV-E---RAF-------LSAGQALMELRD-----RRLYR--S---------T--HR--T-FEEYCR-------ERF-NY-SRDAAYLKI--SATVVYENL---------------QKF-----------------------------------------LPT-I-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RFL----------AKAEL--EP-AV-Q---AD------VW--Q-QAV-EQAGN--------K-IPSGRIVKDVVD Anacy_5838_Anabaena_cylindrica_PCC_7122_428682367 R-G--EA-ITVELGR---LEDRI-EKGLRAF-------WDIGQSLGQIRD-----KQLYR--Q---------S--YK--T-FDEYCL-------NRW-EM-SRRSAYRLI--QAALVYENV----T------------------------------------------------------RGS-Q-SFV-NVIHGSQN------------PETLTC--GSRSFVNVTHGSQNQI-------LP-T-------------NERQI---------------RPL----------VT--L--PP-EK-Q---RE------AW--A-KAV-STAPN--------S-KVTAAHVAQVAR -_Anabaena_cylindrica_755115685 R-G--EA-ITVELGR---LEDRI-EKGLRAF-------WDIGQSLGQIRD-----KQLYR--Q---------S--YK--T-FDEYCL-------NRW-EM-SRRSAYRLI--QAALVYENV----T------------------------------------------------------RGS-Q-SFV-NVIHGSQN------------PETLTC--GSRSFVNVTHGSQNQI-------LP-T-------------NERQI---------------RPL----------VT--L--PP-EK-Q---RE------AW--A-KAV-STAPN--------S-KVTAAHVAQVAR -_Cyanothece_sp_PCC_7822_754536191 L-S--AD-EEKELLR---LERVV-E---RSF-------YEAGSALRKIRA-----LRLYR--A---------R--FN--S-FEEYTQ-------ERF-GF-TRRQPYYLI--EAANVVDNL-----------------------------------------------------KS----ECE-P--LV-------------------------------H------------I-------LP-S-------------SERQV---------------RPL----------TK--L--NA-TE-Q---RS------VW--N-DAV-SRAQG--------K-VPSGRIVTEALE PCC7424_5430_Cyanothece_sp_PCC_7424_218175274 P-T--PQ-EEEDLQR---LEKIV-E---CSF-------LDAGLALQEINT-----RKLYR--F---------S--HK--T-FEDYCR-------DRF-GYLNRRHPYRLI--EAALVVENL------L------K---------------------------------------------KCD-Q----------------I-GHK------------KIP------------M--------P-N-------------NEAQV---------------RPL----------TQ--L--DE-EQ-Q---WE------AW--E-NAV-TESKT--------K-VPSAAFVKKSVE -_Microcystis_aeruginosa_763120073 L-T--PE-EERQRLF---WERKV-E---RAF-------YEAGTALKELRD-----RRLYR--S---------T--HK--T-FEEYVQ-------DRF-GM-KRAHSYRLI--EATGVVDNL-LA-----------------------------------KVPPMVELLGDSSDKVPP--------------------------LVE---------------------------V-------LP-T-------------NERQV---------------RPL----------IQ--L--EP-DQ-Q---RE------VW--Q-QAV-EAAGG--------K-VPSGRIVKDI-- Xen7305DRAFT_00000510_Xenococcus_sp_PCC_7305_442790849 L-T--IE-EESIRFS---LEKKV-E---RAF-------YEAGKALRELRN-----RRLYR--S---------T--HV--T-FEEYCR-------DRF-DF-TRRRPYQLI--EAAQIYDNL-ID--------------------------------------------------------KCE-P---------------------------------IVP------------V-------LP-T-------------KEGQV---------------RPL----------SE--L--TI-DE-Q---PI------AW--E-TAV-EQAGG--------K-VPTGRIVKEVVK -_Xenococcus_sp_PCC_7305_750617827 L-T--IE-EESIRFS---LEKKV-E---RAF-------YEAGKALRELRN-----RRLYR--S---------T--HV--T-FEEYCR-------DRF-DF-TRRRPYQLI--EAAQIYDNL-ID--------------------------------------------------------KCE-P---------------------------------IVP------------V-------LP-T-------------KEGQV---------------RPL----------SE--L--TI-DE-Q---PI------AW--E-TAV-EQAGG--------K-VPTGRIVKEVVK CY0110_32445_Cyanothece_sp_CCY0110_126620031 L-S--EA-EEVEKQR---LEAVV-S---GAV-------WAAGKALQKLRD-----KKLYR--D---------S--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------EAS-G----------------------------------CE------------V-------LP-Q-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--N-KAV-DICNG--------K-VPSHRIVKQVVR Cyan8802_4571_Cyanothece_sp_PCC_8802_256592473 L-T--LA-EQAEKQH---LESIV-T---GAV-------WSAGLALRELRD-----LRLYR--D---------T--HA--N-FAEYCR-------ERF-GH-SRQKSDYLI--VAAKIYENL-----------------------------------------------------------SEN-H----------------------------------CQ------------V-------LP-T-------------TEFQV---------------RPL----------GG--L--EP-DL-Q---VQ------AW--Q-EAV-AIASDTSSRNAHPK-VPSNQIVKQVVR Pse7367_3831_Pseudanabaena_sp_PCC_7367_427992361 L-S--VA-ERQRLHK---YEQMI-R---QNI-------IEIGLALLDIQE-----SRLYR--E---------T--HA--N-FEAYAF-------EQF-GI-SKTYAYGKI--AAAKVIKNL----T------------------------------------------------------GVA-P------------------------------------------------M-------LP-Q-------------NERQC---------------RPL----------AG--L--DA-QQ-Q---RL------AW--Q-EVL---ATG--------D-RITGKLVKEIVA -_Crocosphaera_watsonii_737859558 L-T--ED-EEKEKLR---LERKV-E---RSF-------YEAGIALKLLRD-----GRYYR--N---------T--HP--S-FESYCQ-------DRF-GYRNRRHPYRLI--EAAVTIENL------L------E---------------------------------------------NCD-Q----------------F-GHI------------SSP------------I-------IP-V-------------NESQA---------------RPL----------TS--L--DDPSQ-Q---VK------AW--T-QAI-EKAGG--------K-VPPARIVKEV-- Glo7428_4930_Gloeocapsa_sp_PCC_7428_428267400 L-S--DS-EERERYR---LEFKV-D---RGI-------AQAWLALKELRD-----RRLYR--S---------T--HK--T-FEEYAK-------ERF-GY-NRAHAYRLI--EAAQVLENL------SPNWRQNE---------------------------------------------LQD-E----------------M-SPI------------WRQ------------K-------FP-N-------------SESQC---------------REL----------AK--L--PP-HF-Q---PI------AW--E-KVL-EASGN--------K-APTAKLIKGIVE -_Desulfobacterium_autotrophicum_506384528 --------EQDRLTR---LENLI-ARNQSHF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVINNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-N-------------NESQV---------------RPL----------AP--L--DP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID -_Cyanothece_sp_PCC_7822_754535993 L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGQALKAIRD-----KRLYR--F---------L--YA--T-FEDYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEAQCESGAQTENISDNLC---------------ANGA----QTE-T--II-SGETPVAK---------------NIT--PRQ------------I-------LP-T-------------SERQV---------------RPL----------TS--L--NP-SQ-Q---RE------AW--A-KAV-HLAKG--------K-VPSNRIVTRVAE -_Desulfobacterium_autotrophicum_501881616 --------EQDRLTR---LENLI-ARNQSHF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVINNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-N-------------NESQV---------------RPL----------AP--L--DP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID -_Gloeocapsa_sp_PCC_7428_754508876 L-S--DS-EERERYR---LEFKV-D---RGI-------AQAWLALKELRD-----RRLYR--S---------T--HK--T-FEEYAK-------ERF-GY-NRAHAYRLI--EAAQVLENL------SPNWRQNE---------------------------------------------LQD-E----------------M-SPI------------WRQ------------K-------FP-N-------------SESQC---------------REL----------AK--L--PP-HF-Q---PI------AW--E-KVL-EASGN--------K-APTAKLIKGIVE Cyan7822_6546_Cyanothece_sp_PCC_7822_306986431 L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGQALKAIRD-----KRLYR--F---------L--YA--T-FEDYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEAQCESGAQTENISDNLC---------------ANGA----QTE-T--II-SGETPVAK---------------NIT--PRQ------------I-------LP-T-------------SERQV---------------RPL----------TS--L--NP-SQ-Q---RE------AW--A-KAV-HLAKG--------K-VPSNRIVTRVAE Syn6312_1142_Synechococcus_sp_PCC_6312_427376379 L-S--LI-ERSDLER---LEQTI-RAGLNTF-------VEVGQALQKIRE-----QRLYR--E---------T--HQ--T-FEAYCE-------DKF-DL-RRNYADKTI--AASSFVERI----S------------------------------------------------------TIG-V------------------------------------------------I-------LP-T-------------NESQV---------------REI----------LT--L--PE-DR-Q---VE------AW--R-EVA-EAAAS----E---G-KLTADLVKTVVK -_Calothrix_sp_PCC_7103_737187623 L-T--TA-EAEEFRY---LETRV-EECLKSF-------WEIGRALARIRD-----ERLYR--E---------N--YK--T-FEEYCM-------TRW-EM-SRRSAYQLI--DAAVIYRNI----S------------------------------------------------------ENI-I-----------DD------------DVSVAY--GRQKIQ---------I-------LP-A-------------NERQI---------------RPL----------VA--L--SP-KQ-Q---QE------AW--N-QVV-STAPN--------G-KVTAVHVACVVN -_Desulfobacterium_autotrophicum_506384753 --------EQDRLTR---LENLI-ARNQGRF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVISNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-S-------------NESQV---------------RPL----------AP--L--GP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID -_Desulfobacterium_autotrophicum_501880589 --------EQDRLTR---LENLI-ARNQSRF-------HEIGKALKEIKD-----TRLYK--L---------NL-FS--S-FETYAR-------VRW-DM-GRAQAYRLI--ESYKVINNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-N-------------NESQV---------------RPL----------AP--L--DP-IE-Q---RK------IW--K-AFL---KTA--------M-EITAPNIKQFID BegalDRAFT_1454_Beggiatoa_alba_B18LD_386428514 M-D--ER-LDELEQV---IEKEL-----SAF-------YRVGNALAEIKE-----SRLYR--S---------KG-YE--N-FEAYCV-------EVW-GM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQAVLAV -_delta_proteobacterium_PSCGC_5296_654515559 --------SNQRLVH---LESVI-KKYRQDF-------YSVGKALTEIRD-----GRYYL--K---------LS-FK--S-FESYLK-------HRW-DM-GRSQAYRLI--QAAYVIDNL----S------------------------------------------------------PIG-D------------------------------------------------V-------LP-Q-------------NEAQA---------------RAL----------NK--L--DL-FS-Q---RK------VW--R-NFL---KTQ--------K-PLSALNISKFVS -_delta_proteobacterium_PSCGC_5451_654517946 --------SNQRLVH---LESVI-KKYRQDF-------YSVGKALTEIRD-----GRYYL--K---------LS-FK--S-FESYLK-------HRW-DM-GRSQAYRLI--QAAYVIDNL----S------------------------------------------------------PIG-D------------------------------------------------V-------LP-Q-------------NEAQA---------------RAL----------NK--L--DL-FS-Q---RK------VW--R-NFL---KTQ--------K-PLSALNISKFVS -_Cyanothece_497232068 L-S--KA-EQDEKQR---LEAVI-S---GAV-------WAAGKALKELRD-----KKLYR--D---------T--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------KAG-G----------------------------------CE------------V-------LP-Q-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--D-KAV-AIESG--------K-VPRHHIVKKVVR -_Stanieria_cyanosphaera_753864865 L-T--AS-QTKELLR---LEKTI-E---TSF-------YLAGLALRQIQS-----KRLYR--E---------N--YR--T-FEAYCR-------NRF-DF-TRASAYYLI--KAASVVDNL-----------------------------------------------------------KCQ-Q---------------------------------FVD------------I-------LP-T-------------KESQC---------------RPL----------MS--L--PP-EK-Q---TQ------VW--L-EAI-SQAKG--------K-VPSARLVKNIVA BegalDRAFT_1483_Beggiatoa_alba_B18LD_386428542 M-D--ER-LEELEQV---IEKEL-----SAF-------YRVGNALVEIRD-----KRLYR--L---------KG-YE--N-FEAYCV-------EVW-KM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQAVLAV -_Tolypothrix_sp_PCC_7601_797212629 L-T--TE-EWSDRIF---LERQV-E---RAF-------YAAAKALKALRD-----RRLYR--S---------T--HA--T-FEDYCR-------SRF-GF-THRHVNYLI--AGSLVVDNL------------------------------MGTNG---SQIENSDKTGTNGSQVENLDEMGT-N-----GSQIENSD--ET-GTN------------GSQ------------I-------LP-T-------------SERQV---------------RPL----------VP--L--EP-EQ-Q---RQ------AW--Q-KAV-ELAGG--------K-IPSGRIVQDIVD -_Beggiatoa_alba_749816531 M-D--ER-LDELEQV---IEKEL-----SAF-------YRVGNALAEIKE-----SRLYR--S---------KG-YE--N-FEAYCV-------EVW-GM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQ----- -_Beggiatoa_alba_749816534 M-D--ER-LEELEQV---IEKEL-----SAF-------YRVGNALVEIRD-----KRLYR--L---------KG-YE--N-FEAYCV-------EVW-KM-HRQHAHRLI--NASAVVRNL-----------------------------------------------------------SSV-G------------------------------------------------D-------MP-K-------------NEAQV---------------RPL----------VG--L--PP-EK-Q---RE------VW--E-TVC---QSG--------K-VTAERVQAVLA- -_Coleofasciculus_chthonoplastes_763350225 L-T--DA-EIVEFRS---LEATV-EKGLRAF-------WQIGQALRQIRD-----KRLYR--Q---------D--YG--T-FEDYCL-------TRW-EI-SRRSAYQLI--EAASVVENV----R------------------------------------------------------HGA-Q------------------------------------------------I-------IP-A-------------NERQA---------------RPL----------TA--L--KP-EQ-Q---QA------AW--A-KAV-STAPR--------G-KVTAAHVAQVAQ FDUTEX481_04300_Tolypothrix_sp_PCC_7601_407266570 L-T--TE-EWSDRIF---LERQV-E---RAF-------YAAAKALKALRD-----RRLYR--S---------T--HA--T-FEDYCR-------SRF-GF-THRHVNYLI--AGSLVVDNL------------------------------MGTNG---SQIENSDKTGTNGSQVENLDEMGT-N-----GSQIENSD--ET-GTN------------GSQ------------I-------LP-T-------------SERQV---------------RPL----------VP--L--EP-EQ-Q---RQ------AW--Q-KAV-ELAGG--------K-IPSGRIVQDIVD -_Stanieria_cyanosphaera_753864872 L-S--FE-EERDRLR---LERQV-E---RAF-------YQAGIALKELRD-----RRLYR--S---------T--HE--T-FEKYCQ-------DRF-GM-QRRHPYRLI--DAAAVVDNI------LQMCPI-----------------------------------------------RTQ-N-----GSDTSDAN-------K------------TLE------------I-------IP-T-------------SEWQI---------------RSL----------TK--L--EP-RQ-Q---RE------IW--A-RAI-ELAGN--------K-VPSGKIVSELVS CWATWH0003_2674t1_Crocosphaera_watsonii_WH_0003_357263645 L-T--ED-EEKEKLR---LERKV-E---RSF-------YEAGIALKLLRD-----GRYYR--N---------T--HP--S-FESYCQ-------DRF-GYRNRRHPYRLI--EAAVTIENL------L------E---------------------------------------------NCD-Q----------------F-GHI------------SSP------------I-------IP-V-------------NESQA---------------RPL----------TS--L--DDPSQ-Q---VK------AW--T-QAI-EKAGG--------K-VPPARIVKEV-- -_Myxosarcina_sp_GI1_738538560 L-T--ES-ERQERNN---LEITV-Q---QAF-------FVAGQALKLLRD-----KRLYR--E---------T--HA--T-FEAYVR-------DRF-DY-TRRAVDYLI--LAAEVVENL-----------------------------------------------------------KRE-Q--IV------------------L----------KTN------------V-------LP-T-------------KESQC---------------RPL----------AK--L--SP-EQ-Q---RE------VW--L-TAV-EKTGG--------K-VPSARIVKEVVN -_Cyanothece_sp_CCY0110_495554039 L-S--EA-ELAQKQE---LESIV-S---SAV-------WSAGRALRELRD-----KKLYR--D---------T--HQ--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAANIYENL-----------------------------------------------------------KDS-G----------------------------------CE------------V-------LP-K-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--D-KAV-DICNG--------R-VPKHQIVKQVVR -_Crocosphaera_watsonii_494523812 ------------------MEAVV-I---GAV-------WSAGFALQQLRD-----QKLYR--D---------T--HS--S-FERYCR-------EQF-GH-SRQKSDYLI--AGANIYENL-----------------------------------------------------------TTN-R----------------------------------CQ------------I-------LP-T-------------TEFQV---------------RPL----------GV--L--EN-SD-Q---VV------AW--E-KAV-EIANG--------K-IPTHRIVKQVVR -_Synechocystis_sp_PCC_7509_740179430 L-T--TD-EEQERHR---LELKV-E---LGF-------QEAVKALKQLRD-----KKLYR--S---------T--HQ--T-FEDYVV-------ERF-GM-QRAHAYRLI--NAAVVIENL------S----PIG----------------------------------------------D---------------------------------------------------I-------LP-I-------------TESLC---------------REV----------AK--LP-NC-AQ-Q---QK------AW--R-QTL-VGTGG--------K-MPTIKQVRGIVE -_Desulfobacter_postgatei_748757961 MTS--VDSGHKQLAH---LESLI-SSNQEDF-------CQAGRALKEIRD-----NRLYK--L---------AL-FD--T-FEAYTK-------ARW-DI-SRAHAYRLI--KYCEVIHNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-V-------------NESQV---------------RHL----------AP--L--MP-ME-Q---RR------VW--K-DFL---AGG--------S-ELTAQNIKRFIT -_Scytonema_millei_748135960 L-S--DE-EKGRLFE---LERQV-E---ESF-------YRAGIALKEIRD-----SRLYR--I---------T--HP--T-FEEYCR-------ERF-GF-ERRYPYQLI--DAAIVADNI---------------------------------RQC---------------------------------------------------------VR--DAH------------I-------FP-T-------------NEYQL---------------RPL----------AK--LKGDP-AK-Q---AE------VW--L-RAV-ERAQG--------K-QPTYEAVKETVQ DespoDRAFT_03587_Desulfobacter_postgatei_2ac9_389403119 VTS--VDSGHKQLAH---LESLI-SSNQEDF-------CQAGRALKEIRD-----NRLYK--L---------AL-FD--T-FEAYTK-------ARW-DI-SRAHAYRLI--KYCEVIHNL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-V-------------NESQV---------------RHL----------AP--L--MP-ME-Q---RR------VW--K-DFL---AGG--------S-ELTAQNIKRFIT -_Streptomyces_roseochromogenus_559034988 L-T--PE-EEKTFEA---CKAGM-DNLHKAF-------WIAGKSLETMAT-----GNLHR--N---------SG-HP--N-FADFVW-------VHW-EI-SESQTYRLM--DEWRIGEAL----S------------------------------------------------------QMG-W---------------------------------------------------------H-P-------------RESQV---------------RKL----------VD--I--KN-AA-G---NT-AAVA-VY--D-AVA---RTG--------K-RVTASLLEDVAR -_Fischerella_sp_PCC_9339_737126426 Q-S--SK-ELEQKIC---LLRNK-E---AKF-------YELGKVLRELRD-----KKLYA--A---------T--HK--T-FKDYCK-------S-F-GL-GNRYVYLLI--AAADVVDNL--------------------------------------AQ-------------------RCP-P----------------------------------GS------------P-------LP-T-------------SERQI---------------RPL----------LR--L--PL-EQ-Q---CM------VW--Q-EAI-ALASG--------Q-VPTCRIVEEVVQ -_Tolypothrix_campylonemoides_751570983 L-T--EE-EVADRHR---LELKI-E---RAF-------YEAGCALRELRE-----RRLYR--S---------T--HK--T-FEEYCR-------ARF-NY-SRDTAYLKI--AAAVVCDNI---------------QKF-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANF--EP-EI-Q---AA------AW--L-QGV-EEAGG--------K-VPSGRIVKGIVE -_Scytonema_millei_748136747 L-S--PQ-EERERHR---LELRV-E---RAF-------YEAGVALRELRD-----KKLYR--S---------T--HR--T-FEAYCR-------DRF-NY-SRDTAYLKI--AAAVVYENI---------------QKF-----------------------------------------LPT-N-----CRQIP----------------------------------------------MP-M-------------NEYQL---------------RAI----------AKAEL--EP-EI-Q---AS------MW--L-QGV-EEAGG--------K-SPSGRIVKGIVE -_Scytonema_millei_748136457 L-T--PE-EERERHR---LELKV-E---RAF-------IESALALRELRD-----RRLYR--D---------T--HP--NDFVGYCR-------DRF-GK-TKQAVNYLI--AALEVYENL-T--------------------------------------------------------------------------------TTI------------GCR------------I-------LP-T-------------NERQC---------------REL----------AK--L--PN-EL-Q---PQ------VW--D-AAV-EQNNG--------K-VPTSSIVKNAVE -_Chroococcidiopsis_thermalis_752825464 L-S--PD-EERERHR---LEIRV-D---RAL-------GEGWSALKQLRD-----LRLYR--S---------T--HK--T-FEEYAK-------DRF-GY-NRAHAYRLI--NAAAVLENL------S----HTD---------------------------------------------RKE-E----------------M-SPN------------WRQ------------K-------MP-S-------------SESQC---------------REL----------AK--L--PA-NK-Q---PK------AW--E-KVL-SVSGD--------K-APTAQIVKTVVE -_Fischerella_sp_PCC_9339_515877940 L-N--EE-EKADRHR---LELKI-E---RAF-------FEAGSALRELRE-----RRLYR--S---------T--HR--T-FEEYCR-------DRF-NY-SRDTAYLKI--AAAVVYDNI---------------QNF-----------------------------------------LPT-N-----GRQTP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKGNL--EP-EL-Q---AA------VW--L-QGV-EEAGG--------K-VPSGRIIKGIVE -_Pseudanabaena_sp_PCC_7367_754053711 L-S--VA-ERQRLHK---YEQMI-R---QNI-------IEIGLALLDIQE-----SRLYR--E---------T--HA--N-FEAYAF-------EQF-GI-SKTYAYGKI--AAAKVIKNL----T------------------------------------------------------GVA-P------------------------------------------------M-------LP-Q-------------NERQC---------------RPL----------AG--L--DA-QQ-Q---RL------AW--Q-EVL---ATG--------D-RITGKLVKEIVA Chro_5819_Chroococcidiopsis_thermalis_PCC_7203_428013042 L-S--PD-EERERHR---LEIRV-D---RAL-------GEGWSALKQLRD-----LRLYR--S---------T--HK--T-FEEYAK-------DRF-GY-NRAHAYRLI--NAAAVLENL------S----HTD---------------------------------------------RKE-E----------------M-SPN------------WRQ------------K-------MP-S-------------SESQC---------------REL----------AK--L--PA-NK-Q---PK------AW--E-KVL-SVSGD--------K-APTAQIVKTVVE -_Cyanothece_497231939 L-S--AA-ELSEKQR---LEAIV-V---GAV-------WAAGKALRELRD-----KKLYR--D---------T--HP--S-FAVYCQ-------ETF-GH-SRQKSDYLI--VAATIYENL-----------------------------------------------------------EAS-G----------------------------------CE------------V-------LP-K-------------SEFQV---------------RPL----------GV--L--KK-PPLQ---VE------AW--D-KAV-AISDG--------K-VPRHHIVKKVVR -_Nocardia_jiangxiensis_750537552 L-S--DG-EQNQLSA---CESSI-STLRMAF-------WAAGRALQIVRD-----GRLYR--N---------A--YP--S-FDDYVE-------QRW-DM-QRSYAHKLI--RAWPLAAKL----H------------------------------------------------------PLA----------------------------------------------------------PG-I-------------NEGQI---------------REL----------LP--V--AA-EY-G---ED-AAVT-VY----ATL-VAG-D--------V-KITAGKLREAVA -_[Scytonema_hofmanni]_UTEX_B_1581_657929542 L-T--QD-EADDRHR---LELKI-E---RAF-------YEAGCALRELRE-----RRLYR--S---------T--HS--N-FEEYCR-------DRF-NY-SRDTAYLKI--AAAVVYDNI---------------QKF-----------------------------------------LPT-N-----GRQIP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANF--EP-EV-Q---AD------AW--M-QGV-EEAGG--------K-APSGRIIKGIVE -_Fischerella_sp_PCC_9339_648361686 --------EQNVKKA---LEKNT-P---LYF-------YDAGQALSELFQ-----QKLYR--S---------S--HS--S-FDEYCL-------ERF-RL-GRSQVYRYI--YAASTYDNL----K------------F-----------------------------------------SHG-S---------------------------------NQQ-----A------L-------LP-T-------------TERQI---------------RDL----------YN--L--EP-TL-Q---RE------VW--Q-TAI-DLAVG------C---SPSSRMVKEALL -_Streptacidiphilus_melanogenes_755026932 L-S--EV-ELHDLGT---CERAV-ENLATAT-------WLAGKALQTIRD-----GKLYR--Q---------T--HR--T-FEEYVT-------ERW-EI-GERTAYQMI--EEWPLAERL----N------------------------------------------------------QAY-G--------------------------------------------------------KP-V-------------TASHI---------------RAL----------LP--V--TT-RF-G---LD-DAVE-LYQQL-RAR-AQADG--------V-RLTAQITGQIAK Ple7327_4170_Pleurocapsa_sp_PCC_7327_427981701 L-N--DT-EKMRLQE---IEAIV-AQGLQTF-------YEVGQALIEIRD-----RKLYR--E---------T--HK--T-FEAYCK-------EKW-SL-TRPSAYRLL--KAAEVIKNL----S------------------------------------------------------PMG-D------------------------------------------------K-------FP-T-------------NERQV---------------RPP----------TK--L--PP-AQ-Q---LE------IW--Q-KAV-EESPN--------G-TPTAKIVERLVK -_Nocardia_otitidiscaviarum_748262016 L-S--EC-EHAQLAA---CESSI-DTLRQAF-------WSAGRALQIVRD-----GRLYR--T---------G--YA--T-FDDYVE-------QRW-DM-RRSYAHKLI--RAWPLAARL----H------------------------------------------------------RHA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AA-EH-G---DD-AAVT-VY----TTL-AAE-N--------V-KITAATLREAVA -_Nocardia_otitidiscaviarum_759915784 L-S--EC-EHAQLAA---CESSI-DTLRQAF-------WSAGRALQIVRD-----GRLYR--N---------G--YA--T-FDDYVE-------QRW-DM-RRSYAHKLI--RAWPLAARL----H------------------------------------------------------RHA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AA-EH-G---DD-AAVT-VY----TTL-AAE-N--------V-KITAATLREAVA -_Oscillatoria_sp_PCC_10802_516328499 L-N--GS-ERARLEQ---LESLI-DQEVHLF-------SQVGKALDEICD-----KRLYR--E---------T--HN--T-FQGYCQ-------DKW-GI-ARRRAYQLI--DAAQIVENL----S------------------------------------------------------ALG-A------------------------------------------------Q-------IP-T-------------SERQV---------------RPL----------TG--L--PK-DA-Q---VE------IW--Q-KAV-ALASN--------G-IPTGTAVQRLVD -_Oscillatoria_sp_PCC_10802_763312164 L-T--QP-EQIELEN---LEAQV-QRGIKAF-------WEMGEALRQIRD-----KRLYR--Q---------N--YS--S-FEKYCP-------ARW-QI-SWRSAYQLI--EAAVLMENL----R------------------------------------------------------HGA-G-------------------------------------IE---------T-------LP-A-------------NERQA---------------RPL----------TA--L--PA-EK-Q---RE------AW--V-KAV-TTAPS--------G-RITYHHVVKIAK -_Desulfatibacillum_aliphaticivorans_654862385 L-T--EF-EQDRRDA---LEGII-KRNMAGF-------IAVGLALKEMLE-----SRLYR--S---------T--HP--T-WEAYIR-------DFF-EI-SRSYALRLI--DAADTVRLI----S---------------------------------NEGIDPVDFDDGRQ-------NVA-N---------------------------------WQH-----P--------------VP-A-------------NEAQV---------------RPL----------SK--L--PV-ED-R---PG------AW--F-EAL-KTAPE--------G-KITARHVSDTVK OMM_03956_Candidatus_Magnetoglobus_multicellularis_str_Araruama_571788483 --D--QK-RIKQLHS---FEAVI-KKQQSNF-------HVLGKTLSKIKD-----LSLYK--H---------IG-FK--S-FEDYTI-------KRL-DI-KKSQAYRMI--NASKVIENL----S------------------------------------------------------PIG-D------------------------------------------------I-------LP-Q-------------NEAQA---------------RLL----------TK--F--DT-FT-Q---QQ------LW--Q-KFL---ETG--------M-ALTTSNIRKSII -_Fischerella_sp_PCC_9605_737153646 L-S--AE-ERERLLA---LEREV-V---ESF-------LTAARALREIRD-----RRLYR--E---------S--YP--N-FEEYCE-------ARF-GY-GKRLAYYYI--DAANVADNL-----------------------------------------------------EG----S-E-Q--IV-------------------------------H------------V-------LP-T-------------SESQV---------------RPL----------KG--L--AP-DA-Q---RL------VW--S-KAV-EKAQG--------K-APSINVVRETLK -_Desulfobacterium_autotrophicum_506386923 ---------MDRLIE---LETLI-ARNQERF-------CQIGRALKAIRD-----GRLYR--Q---------AL-FD--T-FEAYAR-------TRW-DM-GRSQAYRLI--KSYEVIHNL----S------------------------------------------------------PIG-D------------------------------------------------R-------MP-A-------------NESQV---------------RPL----------AQ--L--AP-DE-Q---RK------TW--K-DFI---NSG--------V-ESSALNIRRFID C789_3692_Microcystis_aeruginosa_DIANCHI905_443331859 --------ETPSLED---LERII-DRGQKAF-------YVVGTALKSIRD-----ARLYQ--H---------QQNYP--D-FDSYCR-------ERW-DM-SRRKADNFI--RASVFIDNL-----------------------------------------------------------RRN-N---------------------------------CSE--------------------LP-S-------------NESQI---------------RPL----------LS--I--KS-EE-E-Q-IE------IW--L-DII----------Q---S-APLGKITAKYVQ -_Streptacidiphilus_neutrinimicus_755021339 L-S--EV-ELHDLGV---CERAV-ENLATAT-------WLAGKALQTIRD-----GKLYR--H---------T--HA--R-FEDYIT-------ERW-DI-SERAAYQMI--EEWPLAERL----N------------------------------------------------------QAY-G--------------------------------------------------------KP-V-------------TASHI---------------RAL----------LP--V--TT-RF-G---LD-AATE-LYQQL-RTR-ADADG--------V-RLTAQITGQIAK -_Oscillatoria_nigro-viridis_504989405 L-D--VT-ERARLEE---LESIV-EKGLQTF-------YEVGKALDEIRE-----QKLYR--E---------S--HK--T-FDAYCR-------EKW-GI-AKQTANRFI--AAAQVIENL----T------------------------------------------------------PMG-V------------------------------------------------K-------IP-A-------------NERQV---------------RPL----------TG--L--SP-EL-Q---LE------IW--Q-EAL-ESSPN--------G-IPSGAAVQRLVE -_Pleurocapsa_minor_752746526 L-N--DT-EKMRLQE---IEAIV-AQGLQTF-------YEVGQALIEIRD-----RKLYR--E---------T--HK--T-FEAYCK-------EKW-SL-TRPSAYRLL--KAAEVIKNL----S------------------------------------------------------PMG-D------------------------------------------------K-------FP-T-------------NERQV---------------RPP----------TK--L--PP-AQ-Q---LE------IW--Q-KAV-EESPN--------G-TPTAKIVERLVK -_Chlorogloeopsis_fritschii_515385753 L-T--EE-EKADRHR---LELKI-E---RAF-------YEAGCALRELRE-----RRLYR--S---------T--HR--T-FEEYCR-------DRF-NY-SRDTAYLKI--AAAVVYDNI---------------QKF-----------------------------------------LPT-I-----GRQTP----------------------------------------------MP-T-------------NERQL---------------RDL----------AKANF--EP-EL-Q---AA------AW--L-QGI-EEAGG--------K-VPSGRIIKGIVE -_Pleurocapsa_sp_PCC_7319_738911651 L-S--SP-ELDLRSQ---LENQV-R---SAF-------YTAGMALTQLKE-----LRLYR--S---------T--HL--S-FEEFCQ-------DVF-GY-SRDYAYLKM--TAAQVYQNL------LDN--------------------------------------------------LPT-N-----GRQVP----------------------------------------------LP-T-------------RQRQL---------------RPI----------IKAKL--KD-DV-Q---VQ------VW--Q-EAV-DLAHN--------Q-VPTSSIVAQAVR IPF_3218_Microcystis_aeruginosa_PCC_7806_159026604 --------ETPSLED---LERII-DRGQKAF-------YVVGTALKSIRD-----ARLYQ--H---------QQNYP--D-FDSYCR-------ERW-DM-SRRKADNFI--RASVFIDNL-----------------------------------------------------------RRN-N---------------------------------CSE--------------------LP-S-------------NESQI---------------RPL----------LS--I--KS-EE-E-Q-IE------IW--L-DII----------Q---S-APLGKITAKYVQ -_Fischerella_sp_PCC_9605_652339044 L-N--SS-IKETFTA---IDRFE---------------WQAIDEILQMRE-----EQIYR--E---------VG-YK--T-FEEYCQ-REL---YAW-G--GYRRINQLL--GAKKVIDAV---------------------------------------G-------------------ELG-E----------------------H-------------------------I----------K-------------NERQA---------------RPL----------LH--L-----VK-E---PE------KL--K-TAV-AIALK-EN-----P-SPSESDFAAAAQ Emtol_0315_Emticicia_oligotrophica_DSM_17448_387857486 L-S--NE-ENERLTI---CEEVI-DKGLKTF-------IEVGNALFEIRN-----NKLYR--G---------S--FT--T-FEAYCK-------ERW-QL-KRQRAYELM--GAAEVVNQL----S----------------------ENNLS---------------------------EIS-D---------------------------------KSN------------L-------LP-T-------------KESHA---------------NAL----------TQ--I--PV-TL-R---FQ------VW--R-AVV-EESLT-TK-----K-PITAKMIVEQTE -_Streptomyces_sp_CNQ865_654253752 L-S--AQ-EQQDREA---CEAGV-TNLATAF-------WVAGKSLETLEQ-----AKLYR--E---------T--HP--N-FAEYVW-------ERW-EI-SESHLHRLK--AEWRIGEKL----S------------------------------------------------------EFG-Y---------------------------------------------------------R-P-------------REAQV---------------REL----------LP--V--AE-QH-G---PD-AAIR-IY--D-TVA---RQA--------P-RVTAKLLQQAAA -_Synechococcus_sp_PCC_6312_752791755 L-S--LI-ERSDLER---LEQTI-RAGLNTF-------VEVGQALQKIRE-----QRLYR--E---------T--HQ--T-FEAYCE-------DKF-DL-RRNYADKTI--AASSFVERI----S------------------------------------------------------TIG-V------------------------------------------------I-------LP-T-------------NESQV---------------REI----------LT--L--PE-DR-Q---VE------AW--R-EVA-EAAAS----E---G-KLTADLVKTVVK -_Nocardia_seriolae_696559281 L-S--RS-ESEQLEV---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDEYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PS-I-------------NEGQI---------------REL----------LP--V--VA-AH-G---EE-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA -_Streptosporangium_roseum_740087909 M-D--DG-EQADLAA---CEAAI-DTLRIAF-------WAAGKALQVIRD-----GRLYR--A---------T--HS--T-FEEYTI-------DRW-EM-SRTQADRLI--RAWPLAERL----A------------------------------------------------------PIG----------------------------------------------------------VKII-------------NESQV---------------REL----------VP--L--AE-QH-G---QD-AAAV-VY----QTI-VEADG--------V-RVTA-------- -_Nocardia_asiatica_760034517 L-S--ER-ERAQLTA---CESSI-DTLRIAF-------WAAGRALQIVRD-----GRLYR--D---------S--HE--T-FDEYVE-------QRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PA-I-------------NEGQV---------------REL----------LP--V--AA-VY-G---ED-AAVT-VY----TTV-AAGAE--------V-KVTAGKLRQAIA -_Nocardia_sp_CNY236_738617228 L-S--DR-ERAQLTA---CESSI-DTLRIAF-------WAAGRALQIVRD-----GRLYR--D---------T--HE--T-FDQYVE-------QRW-DM-QRSYAHKLI--RAWPLADRL----H------------------------------------------------------PMA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AV-EH-G---DD-AAVT-VY----TTI-ADGIG--------T-RVTAARLREAVA -_Streptacidiphilus_melanogenes_755027016 L-S--EV-ELHDLGV---CERAV-ENLATAT-------WLAGKALQSIRD-----GKLYR--H---------T--HA--R-FEDYVT-------ERW-EI-SERAAYQMI--EEWPLAERL----N------------------------------------------------------QAY-G--------------------------------------------------------KP-V-------------TASHI---------------RAL----------LP--V--TT-RF-G---LD-AATE-LYQQL-RTR-AQADG--------V-RLTAQITGQIAK -_Myxosarcina_sp_GI1_738538439 L-S--AE-ELELRAT---LEHQV-T---SSF-------HTSGMALAKLNE-----LRLYR--N---------T--HS--N-FEEFCL-------DVF-GY-SSDYAYLKM--AAARIYQNL------SDN--------------------------------------------------LPT-N-----GRHFP----------------------------------------------LP-T-------------RQRQL---------------RPI----------VKAKL--DK-DA-Q---LE------VW--L-DAI-ALAEG--------K-IPSYAIVAEAVR I546_4173_Mycobacterium_kansasii_732_576415619 M-N--PA-EARALTQ---HETVI-ERGIKTF-------IAVGTALAAIRD-----QRLYR--E---------R--YA--T-FENYCH-------MRW-GL-SRSRAYRLI--DAANVVDSM----S------------------------------------------------------PIG-D------------------------------------------------T-------VP-A-------------TESQA---------------REL----------MG--L--TP-TQ---A-AT------VM--R-VAH-EQTSG--------K-ITAAAIRAARSR GM3708_3465_Geminocystis_sp_NIES-3708_770470161 --------KYNQLEQ---ITNSI-KYNKISY-------IKLGMQLYQVKY-----YRLYK--N---------N--YK--S-FKDYCE-------KAV-YY-PVWRANQVI--ESSSVAIQL----I------------------------------------------------------KAG----FN--------------------------------------------I-------IP-Q-------------NEAQA---------------RLL----------IK--L--NE-EE---L-IR------KW--Q-EVL-DTYEV------H-K-ITANRIENIVFG -_Nocardia_concava_750531062 L-S--RS-ESEQLEV---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDDYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PS-I-------------NEGQI---------------REL----------LP--V--AA-AH-G---EE-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA -_Opitutaceae_bacterium_TAV1_494604192 L-T--SD-ERRVLRC---CKKAV-QAAMNNL-------MEAAPLLREVKE-----KRLYR--E---------G--YA--S-FEEFCR-------AEF-SM-DRTHAYRLI--AAANVVEAI----D------------------------------------------------------EAR-S-----KARYGQPD---T-GED-------VAL--GDT------------V-LP----LP-T-------------NERQA---------------RPL----------AQ--L--PA-AD-Q---PK------AW--K-KAV-NMAGG--------K-QPTGKQVAAAVK -_Myxosarcina_sp_GI1_738540713 L-S--ES-EKAERDN---LERTV-Q---QAS-------FSAWNALKILRD-----KRLYR--E---------T--HA--T-FESYVR-------DRF-GF-TRRSADYFI--SAAKIVENL----K---------------------------------AENSFPFPQNKKSEL------KRE-Q--FV------------------L----------KTN------------V-------LP-T-------------KESQC---------------RSL----------AK--L--SP-EE-Q---RQ------AW--G-RAV-ELAGN--------K-VPSSRLVKEAVR -_Fischerella_sp_PCC_9339_737126563 L-S--KE-EKKLLER---LEQQV-K---DSF-------LAAAHALREINE-----KRLYR--E---------T--HK--T-FDSYCE-------ERF-GF-KRRQAYHYI--EGAKVTDAL-Q---------------------------------------------------------QSA-R---------------------------------TVH------------I-------LP-A-------------NEYQI---------------RPL----------AS--LK-EP-EK-Q---IE------AW--E-RAV-EHAGG--------K-LPTHELVKKTVQ -_Streptomyces_aurantiacus_514922043 L-T--AE-EREALDA---CKAGL-NNLHNAF-------WIAGKSLETMQT-----GNLHR--N---------EG-IG--S-FAEYVW-------INW-EI-SESQMHRLI--GEWRIGEQL----A------------------------------------------------------QLG-H---------------------------------------------------------R-P-------------RESQV---------------REL----------AD--I--KQ-AA-G---DR-AAVA-VY--D-AVV---RAG--------Q-RVTARLLKDVSR -_Nocardia_abscessus_760001072 L-S--DR-ERVQLTA---CESSI-DTLRIAF-------WAAGRALQIVRD-----GRLYR--D---------S--HE--T-FDEYVE-------QRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PA-I-------------NEGQV---------------REL----------LP--I--AA-EY-G---ED-AAVT-VY----TTI-AAGAD--------V-KVTAGKLRQAIA -_Streptomyces_canus_703383829 L-S--EL-ERETLEA---CKAGM-NNLHNAF-------WVAGKSLETMAV-----GNLHR--N---------EG-FA--N-FAEFVW-------TNW-EI-SESQVYRLM--DGWRIGESL----S------------------------------------------------------QLG-H---------------------------------------------------------R-P-------------RESQV---------------REL----------TD--I--KR-TA-G---DE-AAVA-VY--D-AVA---RSD--------K-RVTARLLARVAR -_Streptomyces_sp_NRRL_F-5123_759461293 L-S--DQ-EQQDLAA---CKAGV-DNLRNAF-------WIAGKSLETLRT-----AELHR--G---------E--NP--N-FAEWVW-------DTW-EI-SETQLYRLM--DEWRVGEAL----A------------------------------------------------------NLG-H---------------------------------------------------------K-P-------------LEGQV---------------RKL----------TE--V--RR-QT-N---DK-IAIT-VY--D-TIA---RCT--------E-RVTGKLVETVVD -_Nocardia_niigatensis_750579664 L-S--RT-ETDQLEL---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDEYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PLA----------------------------------------------------------PA-I-------------NEGQI---------------REL----------LP--V--AA-AY-G---ED-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA Dalk_3579_Desulfatibacillum_alkenivorans_AK-01_218762801 L-S--KK-EQDRRRE---LEGLV-MKNMAAF-------LEMGAALAEIQR-----DRLYR--S---------T--HR--N-FEAYVR-------DVF-EI-GKSYAHRQI--AGYQVVENI----R---------------------------------SA-------------------MAP-D-----GKNVA------------N----------WRQ------------I-------LP-A-------------NEAQV---------------RPL----------TL--L--DD-PE-E-Q-VE------AW--K-HAV-KIGED-SK-R---G-KVTARHVAQAVG -_Myxosarcina_sp_GI1_738539870 L-E--QL-ERTIEKG---LKVLR-----QTF-------FEVGLALAEVKK-----RELYL--A---------KG-YS--S-YTSYCA-------GEW-KI-NKTYAYDLL--KAAEVVVNL-I--P------------------------------------------------------QLE-S----------------F-PQS-------FSA--IAE-----K------L-DYP---LP-R-------------NESQC---------------REI----------AK--L--KT-AE-L-Q-RQ------AW--Q-EIL-ESDEN----------KITAKQIRHTVA -_Prauserella_rugosa_738995333 L-T--VA-EADQLAD---LEAVI-AQGLQTF-------VRVGQALLTIRD-----NRLYR--K---------T--HE--T-FEEYCR-------ERW-EM-TKDSANRVI--RAAEVVEVM----S------------------------------------------------------PIG-L------------------------------------------------T--------P-A-------------TESQA---------------REL----------AP--LKDDP-DA-M---RA------VW--E-TAN-E-RTD--------G-KPTAKVIRECRE -_Deinococcus_sp_2009_760136477 L-A--PH-EVQRLHS---LEATV-RDGLRDF-------QRTGQALSEIRD-----NELFR--A---------T--HD--T-FEAYLE-------ERW-GF-TPTQADRII--EANEVTKVL----E------------------------------------------------------PLG--------------------------------------------------I-------AP-I-------------SERQA---------------RAF----------KG-----AA----K-----------IL----TEL--------------E-PEQRRLVARLAQ -_Deinococcus_ficus_760094872 L-A--PH-EVQRLHN---LEATV-RDGLRDF-------QRTGQALSEIRD-----NELFR--A---------T--HD--T-FEAYLE-------ERW-DF-TPSQADRII--EANEVTKVL----E------------------------------------------------------PLG--------------------------------------------------I-------AP-I-------------SERQA---------------RAF----------KG-----AA----K-----------IL----TEL--------------E-PEQRRLVARLAQ -_Myxosarcina_sp_GI1_738540774 L-S--PQ-EQQLRDK---LEQQV----LTGF-------VLRGQALRTIKR-----LRLYR--D---------S--YD--N-FESYCE-------DVF-GF-SMLYIERCM--RAAETYYQI----V----------EYL-----------------------------------------KTQ-G------------------------------------------------L-KEA---LP-N-------------KQKQL---------------RPI----------FQAHL--SP-IE-A---GE------VW--V-MAV-DIALG--------K-VPSYSMVKTAVK -_Deinococcus_radiodurans_499190814 L-A--PH-EQQRLDD---LEQTV-EGGLRDF-------QRTGQALSEIRD-----NELYR--A---------T--HD--S-FEAYLQ-------DRW-GF-GVRQADRLI--DAAQVAKQL----E------------------------------------------------------PLG--------------------------------------------------I-------SP-R-------------HEAQA---------------RSF----------RP-----AA----R-----------IV----EEL--------------E-PEQQRLVARLVE NS07_v2contig00189-0005_Nocardia_seriolae_749286507 L-S--RS-ESEQLEV---CESSI-DALRVAF-------WTAGRALQIVRD-----GRLYR--A---------D--HA--T-FDEYVE-------KRW-DM-QRSYAHKLI--RAWPLAARL----H------------------------------------------------------PVA----------------------------------------------------------PS-I-------------NEGQS---------------REL----------LP--V--VA-AH-G---EE-AAVT-VY----TTL-AA--D--------V-KVTAGKLREAVA -_Streptomyces_vietnamensis_751920075 L-S--PQ-EEADFEA---CKAGV-RNLQNAF-------WVAGKSLETIKT-----GNLQR--R---------V--HA--N-FATFVW-------EEF-EI-SEPQMHRLV--EEWRVGQAL----S------------------------------------------------------QLG-W---------------------------------------------------------K-P-------------KESQV---------------REL----------TG--I--TK-EA-G---DQ-TAVT-VY--D-TIA---RNV--------K-RVTAQVIRDVVA -_Streptacidiphilus_melanogenes_755027075 L-S--TV-ELHDLGV---CERAV-DNLATAT-------WLAGKALQSIRD-----GKLYR--E---------T--HR--T-FEEYVT-------ERW-EI-GERTAYQMI--EEWPLAERL----N------------------------------------------------------QAL-G--------------------------------------------------------KP-A-------------TASHT---------------RAL----------LP--V--VA-RF-GADGLD-AAAG-LYEEL-RDR-AQAEG--------V-RVTAALTGQIVK -_Cyanothece_sp_PCC_7822_503099618 L-------ELQIQEG---LRLSM-----QGF-------YLIGSALRQLKA-----LKLHR--N---------S--HL--R-FDEYAK-------ERF-KI-SKRYQHYLI--NAVEVIDVF-----------------------------------------------------------SND-K----------------------Q----------YIA-L----------I-NGQ---IP-E-------------REFHC---------------RQL----------LK--L--GN----Q---PD-IWKK-AW--Y-KSV-MLASG--------E-TPTAKIVGEVVK -_Deinococcus_516480931 L-A--PH-EEQRFQA---LEQTV-EGGLRDF-------QRTGQALAEIRD-----NHLFR--E---------T--HA--D-FETYLR-------DRW-GF-NLRQADRII--DAAVVARQL----E------------------------------------------------------PLG--------------------------------------------------I-------EP-R-------------HERQA---------------STF----------KP-----AV----K-----------II----GAL--------------E-PEQQRLISRLVE -_Deinococcus_radiodurans_736351733 L-A--PH-EQQRLDD---LEQTV-EGGLRDF-------QRTGQALSEIRD-----NELYR--A---------T--HD--S-FEAYLQ-------DRW-GF-GVRQADRLI--DAAQVAKQL----E------------------------------------------------------PLG--------------------------------------------------I-------SP-R-------------HEAQA---------------RSF----------RP-----AA----R-----------IV----EEL--------------E-PEQQRLVARLVE -_Crocosphaera_watsonii_494523440 L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD -_Crocosphaera_watsonii_737861903 L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD -_Crocosphaera_watsonii_494523440 L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD -_Crocosphaera_watsonii_737861903 L-T--HE-EARDRGN---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYDNL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDESPISLTTNRAQNPEVE-MTT-N-----GLQTEMAK---M-TTN------------GTQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD CWATWH0003_2673b1_Crocosphaera_watsonii_WH_0003_357263649 L-T--YE-EERDRLH---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYENL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDELPTSLTTNGTQNPEVE-MTT-N-----GRQTEMAK---M-TTN------------GRQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--A-LAV-EQAGG--------K-VPSGRIVKSIVS -_Crocosphaera_watsonii_546230520 L-T--YE-EERDRLH---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYENL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDELPTSLTTNGTQNPEVE-MTT-N-----GRQTEMAK---M-TTN------------GRQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--A-LAV-EQAGG--------K-VPSGRIVKSIVS -_Crocosphaera_watsonii_737859551 L-T--YE-EERDRLH---LERKV-E---RAF-------YEAGKALQELRE-----RRLYR--N---------T--HK--T-FESYCQ-------ERF-GH-SRQKANFLI--AGAKAYENL------TTNCCQTDPENLTTNGTQTETETQMTTNGCQ-NQDELPTSLTTNGTQNPEVE-MTT-N-----GRQTEMAK---M-TTN------------GRQ------------I-------LP-T-------------SEGQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--A-LAV-EQAGG--------K-VPSGRIVKSIVS CWATWH0402_1321_Crocosphaera_watsonii_WH_0402_543538779 L-T--HE-EERDRLH---LEGQV-E---RAF-------FSAGIALQELRD-----RRLYR--S---------T--HK--T-FEDYCQ-------ERF-GY-SRRKMDYLI--AGSEVYQNLLLPSEMRTNCSQTD---------LPDDQSQMRTNGSQITESDDNLQMRTNCSQNADDE-MRT-N-----CSQNADGK---T-RTN------------CSQ------------I-------LP-T-------------REAQV---------------RPL----------TK--L--EP-DQ-Q---RE------AW--Q-KAV-EQAGG--------K-VPSGRIVKSIVD -_Streptomyces_sp_CT34_759552136 L-S--AE-EEDLLHL---CMRGI-EQFQNAW-------WVMAKSLANINA-----RRLYR--K---------T--HA--N-FEDFCW-------DNF-KK-SRPTAYEEM--TAYAMGELL----S-A----------------------------------------------------RAD-K----------------------P---------------------------------FD-E-------------NSNEV-----------SA--RAD--------T-PA--I-----GK-K-V-AS------AY----NPI-TKDYG----------AEVSVAVHETIE -_uncultured_Mediterranean_phage_uvMED_787066260 L-------ETEISAA---YADRL-----------HQD-LAIGKALTQIFR-----RRLYR--G-----KDG----GR--D-WETWLT-ECS---AKF----TQGRGPLTK--KPALYLRGF---------------YQF-----------------------------------------RCE----VL------------L-KGS-G----------RSP-----D------I-----P-LP-A-------------SPYQV---------------RPL--LA----Q-LE--T--HP-EA-A---VD------MW--K-SAC-ADAAR-EK-V-G-K-VPSYEQVQRAAL -_Borrelia_bissettii_503783569 L-------KDKLKTL---TTDDI-----------YNK-IETAKVLNTINQ-----KKLYI--------LDG----YK--N-FYSFLA--------DF-KI-AKSQAYKYI-KIVSGVEKGI-I--D----------YNF-----------------------------------------IAN-N-----GIEKTIKQ---L---E------------SNN------------V-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNG----------------KFTGFLLEELLE -_Borrelia_burgdorferi_695263564 L-------KDKLKTL---TTDDI-----------YNK-IETAKILNTINQ-----KKLYI--------LDG----YK--N-FYSFLA--------DF-KI-AKSQAYKYI-KIVSGVEKGI-I--D----------YNF-----------------------------------------IAN-N-----GIEKTIKQ---L---E------------SNN------------V-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNG----------------KFTGFLLEELLE -_Borrelia_garinii_696415789 L-------KDKLKTL---TTDDI-----------YNK-IETAKILNTINQ-----KKLYI--------LDG----YK--N-FYSFLA--------NF-KI-AKSQAYKYI-KIVSGVEQGI-I--D----------YNF-----------------------------------------IAN-N-----GIEKAIKQ---L---E------------GSN------------I-------IK-K-----------S-NQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS OY14_04355_Borrelia_chilensis_741043351 L-------KEKLKVL---IKEES-----------YNK-IETARILKEIND-----NKYYI--------VDG----YK--N-FSHFLK--------DY-NM-AKTSVYRYI-KIAVGIDSGK-I--D----------YEL-----------------------------------------ILK-K-----GIYYAMQI---L---E------------NNN------------I-------TI-N-----------P-KAILN---------------RSF--------K-LK--I--EE-EE-I---FN------FY--K-SNT----------------SFVSFLLKELYH -_Borrelia_miyamotoi_764988637 L-------KEKLKIL---IKEES-----------YNK-IETARILKEINE-----SKYYA--------LDG----YK--S-FTAFIK--------SY-KI-AKTSVYRYI-KLVSGIDSGK-I--D----------YDL-----------------------------------------ILN-R-----GIDYAIKI---L---E------------SNN------------I-------IS-K-----------S-NVNPL---------------RPL--------R-FQ--L--DD-EE-C---FH------FY--K-SNT----------------KFASFLLKEIFK BHO_0016000_Borrelia_hermsii_YBT_576092807 L-------KEKLKIL---IKEES-----------YNK-IETARILKEINE-----SKYYA--------LDG----YK--S-FTAFIK--------SY-KI-AKTSIYRYI-KLVTGIDSGK-I--D----------YDL-----------------------------------------ILS-R-----GVDYAIKV---L---E------------NNS------------I-------IS-K-----------S-NVNPL---------------RPL--------R-FQ--L--DD-EE-S---FH------FY--K-SNT----------------KFASFLLKEIYK I871_B18_Borrelia_miyamotoi_LB-2001_736012165 L-------KEKLKIL---IKEES-----------YNK-IETARILKEINE-----SKYYA--------LDG----YK--S-FTAFIK--------SY-KI-AKTSVYRYI-KLVSGIDSGK-I--D----------YDL-----------------------------------------ILN-R-----GIDYAIKI---L---E------------SNN------------I-------IS-K-----------S-NVNPL---------------RPL--------R-FQ--L--DD-EE-C---FH------FY--K-SNT----------------KFASFLLKEIFK -_Borrelia_duttonii_740581845 L-------KDRLKSL---VVDDI-----------YNK-IETAKILSLINE-----KKLYI--------FDG----YK--S-FYGFLA--------DF-KI-AKSQAYKYI-KIASGMEQGV-I--D----------YDF-----------------------------------------IIN-N-----GIENTIKK---L---G------------SKN------------I-------VK-K-----------S-KHNLT---------------KQL--------C-FE--F--KS-QD-S---YD------FY--K-RDT----------------KFMCFVLDELFI -_Borrelia_crocidurae_504496248 L-------KDRLKSL---VVDDI-----------YNK-IETAKILSLINE-----KKLYI--------FDG----YK--S-FYGFLA--------DF-KI-AKSQAYKYI-KIVSGMAQGI-I--D----------YDF-----------------------------------------IIN-N-----GIENTIKK---L---G------------SKN------------I-------VK-K-----------S-KHNLT---------------KQL--------C-FQ--F--KS-QD-S---YD------FY--K-RDT----------------KFMCFVLDELFI -_Borrelia_hispanica_639482667 L-------KDRLKSL---VVDDI-----------YNK-IETAKILSLINE-----KKLYI--------FDG----YK--S-FYGFLA--------DF-KI-AKSQAYKYI-KIASGMAQGI-I--D----------YDF-----------------------------------------IIN-N-----GIENTIKK---L---G------------SKN------------S-------IK-K-----------S-KHNLT---------------KQL--------C-FQ--F--KN-QD-S---YD------FY--K-SDT----------------KFVCFVLDELFI -_Borrelia_persica_639480863 C-------KHSLRKS---MVNDI-----------ENK-IQIMEILYNVRK-----KKLYR--------FDH----HA--T-FDAFIK--------AF-GI-GKTQAYLYL-KIYEQILKGT-L--T----------VKE-----------------------------------------IRE-K-----GLIEIYRN---I-KLK------------EIS------------A-------KK-S-------------RQNLI---------------KPL--------R-FQ--L--KD-HK-S---YD------FY--K-KRS----------------KFAAFILDKLFL AM1_B0079_Acaryochloris_marina_MBIC11017_158310190 M-S-----VNDLSEC---VEAGL-NAGAYAI-------AQAGLALREIQR-----RGEYP--T---------T--------FEAFVK-------DKF-AL-TRARAYQLM--YAADIIADL----A----------SVF-----------------------------------------ESN-K--------------------------------------------------------LP-R-------------SESAV---------------RPM----------IG--L--TK-QQ-R---IE------VW--R-RAL-KGKQR----------SPGYGTVKAIVE -_Acaryochloris_marina_753958401 M-S-----VNDLSEC---VEAGL-NAGAYAI-------AQAGLALREIQR-----RGEYP--T---------T--------FEAFVK-------DKF-AL-TRARAYQLM--YAADIIADL----A----------SVF-----------------------------------------ESN-K--------------------------------------------------------LP-R-------------SESAV---------------RPM----------IG--L--TK-QQ-R---IE------VW--R-RAL-KGKQR----------SPGYGTVKAIVE -_Scytonema_millei_748136693 L-T--PE-EERERHR---LELRV-E---RAF-------FEAGKALRELRE-----RRLYR--S---------T--HK--S-WEAYCQ-------ERF-GF-GRDSADIKI--SASRVVEEI---------------REY-----------------------------------------LPT-N-----RRQI-----------------------------------------------LP-T-------------TLEQV---------------RPL----------LKLKA--SS-E--R---IE------AW--L-KAI-DTNHG--------R-IPNGRIVKGIVK Strvi_0238_Streptomyces_violaceusniger_Tu_4113_344043288 L-S--PD-EAEDLRQ---CERAF-ANADEAE-------WMRGKAAHAVRD-----RRLYR---------------PR--T-WPDYCE-------EVL-GE-SESEVNRMI--QEWPIGAMI----T----------QIW-----------------------------------------VTP-R------------------------------------------------P-------TP-A-------------SHRRA-----L---------LPL--VDL-----YG--L--EA-TA-R---GY-VLLR-TW--------AAENN--------E-RVTATVLTAMVD -_Streptomyces_violaceusniger_759522371 L-S--PD-EAEDLRQ---CERAF-ANADEAE-------WMRGKAAHAVRD-----RRLYR---------------PR--T-WPDYCE-------EVL-GE-SESEVNRMI--QEWPIGAMI----T----------QIW-----------------------------------------VTP-R------------------------------------------------P-------TP-A-------------SHRRA-----L---------LPL--VDL-----YG--L--EA-TA-R---GY-VLLR-TW--------AAENN--------E-RVTATVLTAMVD -_Reyranella_massiliensis_522187926 V-------TEAEAHR---LAAEI-A---EAS-D-FDA-FRLGGLLARIHR-----ERWYR--------GAG----YP--D-FRSYVE-------ARH-GF-KLRKALYLA-----AIYESV-I--D----------LGL-----------------------------------------TWQ-E----------------L---------------------------------------RP-V-------------GWSKL---------------KEL----------VG--V--VD-RD-N---AR------DW----LAI-AAAEG-----------MTVLKLHALVQ -_Cyanothece_sp_PCC_7822_754535969 L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGLALKTIRD-----KRLYR--F---------L--YA--T-FEEYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEPQYEPGAQTENPFDHLCA-IGAQIENTFDHLSANGT----QTQ-T--II-SDETPAAK---------------SLA--SRQ------------I-------LP-T-------------SERQV---------------RPL----------IS--L--NP-SQ-Q---RE------AW--V-KAV-NLAQG--------K-VPSNRIVSLVAD Cyan7822_6496_Cyanothece_sp_PCC_7822_306986392 L-T--SE-EEQELLQ---LEGCI-E---RSF-------YQAGLALKTIRD-----KRLYR--F---------L--YA--T-FEEYCR-------ERF-GF-ARRHSYQLI--DAAVVMDNL---------------LAIEPQYEPGAQTENPFDHLCA-IGAQIENTFDHLSANGT----QTQ-T--II-SDETPAAK---------------SLA--SRQ------------I-------LP-T-------------SERQV---------------RPL----------IS--L--NP-SQ-Q---RE------AW--V-KAV-NLAQG--------K-VPSNRIVSLVAD -_Lachnospiraceae_bacterium_10-1_510895729 I----EI-IKDESFR---VQKSF---------------VKIGWYLKHIRD-----NELFK--------EDG----YA--S-IWECAA-------DQL-GY-SQATASRFI-----NICEKF----S----------KNH---------------------------------------------N---------------------------------SPE------------L-----D-VK----YAGF-------DKSQM---------------IEM----------LP--M--EP-EQ-----LE------------KVV--------------P-EMTVKQIRDIKT BDCR2A_01333_Borrelia_duttonii_CR2A_576313683 L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV -_Borrelia_valaisiana_501894927 L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIQEGI-L--E----------EVY-----------------------------------------VIE-N-----GVSKAIAV---L-R-E------------SPS------------G-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KT-KE-S---YD------FY--K-SNV----------------KFTGFMMHEIFE -_Borrelia_duttonii_740581624 L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV -_Borrelia_hispanica_639482723 L-------KKQLKSN---FKNEV-----------YNR-IETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTIIL---L-RNS------------DSN------------L-------VK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNS----------------KLASFILDELFA BCD_1877_Borrelia_crocidurae_DOU_576102765 L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV -_Borrelia_crocidurae_644981026 L-------KKQLKSN---FKNEV-----------YNR-VETMKILKEIKD-----NEYYK--------MDG----YK--T-FDSFIK--------DY-KL-AKSQVYDYL-KVATAIEKGI-V--E----------EVY-----------------------------------------LLE-N-----GFKNTVIL---L-RNS------------DSN------------L-------MK-K-----------S-RRNPI---------------KPL--------R-FQ--L--KR-QD-S---YD------FY--K-KNL----------------KLASFVLDELFV -_Borrelia_bissettii_503783476 L-------KNKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--E----------ETY-----------------------------------------IIE-N-----GLTMSLLS---I-RDK------------ESS------------S-------FK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YA------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_finlandensis_501928340 L-------KNKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--E----------ETY-----------------------------------------IIE-N-----GLTMSLLS---I-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_persica_639480210 L-------KQKLKFN---FQQEV-----------YYK-IESIKVLKEIKD-----NEYYK--------FDG----YR--T-FEDFIK--------EY-KL-ARSQVYDYL-KIATAIENGI-L--E----------ESY-----------------------------------------VVE-N-----GITRTIAF---L-R-T------------TTS------------K-------LK-K-----------S-KRNLI---------------KPL--------R-FQ--L--KN-QK-S---YD------YY--K-KNA----------------KLTGFILDRLFL -_Borrelia_hermsii_645010627 L-------KQKLKPN---FQQEI-----------YYK-MEAIKILKEIKD-----NEYYK--------LDG----YR--I-LEDFIK--------DY-KL-ARSQAYDYL-KIATALENGI-L--D----------ESY-----------------------------------------VVE-N-----GITQAIAF---L-R-T------------TSN------------K-------LK-K-----------S-KRNLI---------------KPL--------R-FQ--L--KS-QE-S---YN------FY--K-KNA----------------RFTGFILDILFS -_Borrelia_bissettii_503783755 L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIKDGI-L--E----------ESY-----------------------------------------VIE-N-----GVTKTLEF---L-R-K------------SPN------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFMLDKLFS -_Borrelia_hermsii_644979647 L-------KQKLKSN---FQQEI-----------YYK-MEAIKILKEIKD-----NEYYK--------LDG----YR--I-FEDFIK--------DY-KL-ARSQAYDYL-KIATALENGI-L--D----------ESY-----------------------------------------VVE-N-----GITQAIAF---L-R-T------------TSN------------K-------LK-K-----------S-KRNLI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-KNA----------------RFTGFILDILFS -_Borrelia_duttonii_501531293 L-------KQKLKSH---CQQEI-----------YYK-MEIIKILKEIKD-----NEYYK--------LDN----YK--T-FEDFIR--------DY-KL-ARSQVYDYL-KIANAIENGI-L--Q----------ESY-----------------------------------------VVE-N-----GITHTIAF---L-R-S------------DSG------------L-------FK-K-----------KLRRNAL---------------KPL--------K-FQ--L--KK-QE-S---YN------FY--K-KNV----------------KFAEFLLDTLFL -_Borrelia_crocidurae_504499910 L-------KQKLKSH---CQQEI-----------YYK-MEIIKILKEVKD-----NEYYK--------LDN----YK--T-FEDFIR--------DY-KL-ARSQVYDYL-KIANAIENGI-L--Q----------ESY-----------------------------------------VVE-N-----GITHTIAF---L-R-S------------DSR------------L-------FK-K-----------KLRRDSL---------------KPL--------K-FQ--L--KK-HE-S---YN------FY--K-KNV----------------KFAEFLLDTLFL -_Borrelia_hermsii_644979702 L-------KSKLIIN---FKSEI-----------CSR-IETMKVLKEIKD-----NEYYK--------LDG----YK--N-FEDFTK--------DY-KL-AKSQAYDYL-KVAGAIEEGI-I--E----------ESF-----------------------------------------LIE-N-----GFRQTLYV---L-RNS------------DSN------------T-------LN-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-KNA----------------KFTSFLMDNLFE -_Borrelia_hermsii_645063163 L-------KSKLIIN---FKSEI-----------CSR-IETMKVLKEIKD-----NEYYK--------LDG----YK--N-FEDFTK--------DY-KL-AKSQAYDYL-KVAGAIEEGI-I--E----------ESF-----------------------------------------LIE-N-----GFRQTLYV---L-RNS------------DSN------------T-------LN-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-KNA----------------KFTSFLMDNLFE -_Borrelia_hermsii_645010701 L-------KDKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YN--S-FNSFAK--------NY-KI-ARTQVYDYL-RLANAMEEGL-L--E----------ERF-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------EGV------------N-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-QN-S---YA------FY--K-SNA----------------KFTSFLMDELFE -_Borrelia_hermsii_644979468 L-------KDKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YN--S-FNSFAK--------NY-KI-ARTQVYDYL-RLANAMEEGL-L--E----------ERF-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------EGV------------N-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-QN-S---YA------FY--K-SNA----------------KFTSFLMDELFE -_Borrelia_duttonii_752506021 L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------R-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFAGFLMDALFK -_Borrelia_hermsii_645010853 L-------KEKLKQN---ARKEI-----------YYK-VESIRILKEIKD-----NGYYK--------LDG----HK--N-FDSFIK--------SY-RM-AKTQVYAYL-RLANAIEEGM-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---IKKNK------------ESI------------A-------IK-K-----------S-RQKAI---------------NPL--------R-FQ--L--KS-QD-S---YD------FY--K-QNS----------------KFTSFVLDTLFL -_Borrelia_hispanica_639481672 L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--N-FDSFIK--------DY-RM-AKTQVYAYL-RLANAIEAGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTV------------K-------TK-K-----------V-KQDSI---------------KLL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFMAFMLDTIFL -_Borrelia_burgdorferi_group_496158399 L-------KKKLYVN---LREGV-----------SNR-VECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIEAGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---L-KDK------------ESP------------V-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMQEIFE -_Borrelia_coriaceae_645024139 L-------KKKLYIN---LREGI-----------YNR-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIATALEEGL-L--E----------EQY-----------------------------------------VLE-N-----GFRQILGL---L-KDK------------ESE------------K-------LK-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QA-S---YD------FY--K-QNA----------------KFTSFLMDRLFA -_Borrelia_miyamotoi_763123871 L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--N-FDNFIK--------SY-RM-AKTQVYAYL-RLANAIEDGL-L--A----------EQY-----------------------------------------IIE-N-----GINESLAM---I-KNK------------ESV------------K-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-QD-S---YD------FY--K-EHS----------------KFTAFILDTLFS BCD_1474_Borrelia_crocidurae_DOU_576102339 L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------K-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFASFLMDTLFK -_Borrelia_burgdorferi_695262165 L-------KKKLYVN---LREGV-----------SNR-VECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIEAGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---L-KDK------------ESP------------V-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KA-QE-S---YD------FY--K-SNA----------------KFTSFMMQEIFE -_Borrelia_garinii_657235060 L-------KKKLYVN---LREGV-----------SNR-IACMKILKEIKD-----NEYYK--------IDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIESGV-I--E----------EQY-----------------------------------------VLD-N-----GFRSILSV---L-KDK------------ESP------------A-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YE------FY--K-SNA----------------KFTGFLLDKLFS -_Borrelia_hermsii_695262844 L-------KKKLYIN---LREGI-----------YNR-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------NY-DV-AKTQAYNYL-KIATALEEGL-L--E----------EQY-----------------------------------------VLE-N-----GFRQILSL---L-KDK------------ESA------------T-------IK-K-----------S-KVNPI---------------KPL--------R-FQ--L--KS-QE-S---YG------FY--K-SNA----------------KFTSFLMDELFE -_Borrelia_coriaceae_654876319 L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------FDG----HK--N-FDSFIK--------SY-RM-AKTQVYAYL-RLANAIAEGM-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KDK------------VSQ------------T-------VR-K-----------S-KQNSI---------------KPL--------R-FQ--L--KS-QE-S---YN------FY--K-ENS----------------KFTAFVLDTLFS -_Borrelia_bissettii_503783548 L-------KKKLYVN---LREGV-----------SNR-VECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIEAGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---L-KDK------------ESP------------V-------LK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMQEIFE -_Borrelia_persica_740577787 L-------MAKMKQN---SKKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LEG----HK--S-FDKFIE--------SY-RM-AKTQVYAYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-H-----GINESLAL---I-KER------------KLL------------K-------LK-R-----------S-TQDLV---------------KPL--------R-FQ--L--ET-YE-S---YD------FY--K-KNS----------------KFISFLLEKLFA -_Borrelia_duttonii_501533328 L-------KEKLKQN---ARKEI-----------YYK-IENIRILKEIKD-----NEYYK--------LDG----HK--H-FDSFIK--------DY-RM-AKTQVYAYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTI------------K-------TK-K-----------I-RRSSI---------------NSL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFIAFMLDTIFL -_Borrelia_hermsii_645063171 L-------KKKLYIN---LREGI-----------YNR-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------NY-DV-AKTQAYNYL-KIATALEEGL-L--E----------EQY-----------------------------------------VLE-N-----GFRQILSL---L-KDK------------ESA------------T-------IK-K-----------S-KVNPI---------------KPL--------R-FQ--L--KS-QE-S---YG------FY--K-SNA----------------KFTSFLMDELFE -_Borrelia_crocidurae_644980725 L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--N-FDSFIK--------DY-RM-AKTQVYVYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTI------------K-------TK-K-----------I-RRDSI---------------NSL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFIAFMLDTIFL BDU_7025_Borrelia_duttonii_Ly_201084505 L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------R-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFAGFLMDALFK -_Borrelia_recurrentis_501533114 L-------KEKLKQN---ARKEI-----------YYK-VENIRILKEIKD-----NEYYK--------LDG----HK--H-FDSFIK--------DY-RM-AKTQVYAYL-RLANAMEKGI-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---I-KNK------------KTI------------K-------IK-K-----------I-RRGSI---------------NSL--------S-FK--L--KN-QA-S---YD------FY--K-NNV----------------KFIAFMLDTIFL BHO_0006701_Borrelia_hermsii_YBT_576093010 L-------KEKLKQN---ARKEI-----------YYK-VESIRILKEIKD-----NGYYK--------LDG----HK--N-FDSFIK--------SY-RM-AKTQVYAYL-RLANAIEEGM-L--A----------EQY-----------------------------------------IIE-N-----GINESLAI---IKKNK------------ESI------------A-------IK-K-----------S-RQKAI---------------NPL--------R-FQ--L--KS-QD-S---YD------FY--K-QNS----------------KFTSFVLDTLFL -_Borrelia_crocidurae_749307948 L-------KLKLKSN---LKENI-----------YNK-LEAMKILKEIKD-----NDYYK--------LDG----YK--R-FSEFLG--------SY-KV-AKSQAYNYL-KIATAIEKGL-L--E----------EQY-----------------------------------------VLE-N-----GFREVLSL---I-RIK------------EGV------------K-------IK-K-----------S-RQSGL---------------KSL--------K-FH--F--KS-QE-S---YD------FY--R-QNV----------------KFASFLMDTLFK -_Borrelia_valaisiana_506379547 L-------KKKLYVN---LREGI-----------SNR-IECMKILKEIKD-----NKYYK--------LDG----YK--S-FDAFIK--------DY-DV-AKTQAYNYL-KIANAIESGV-I--E----------EQY-----------------------------------------VLD-N-----GFRLILSV---F-KNK------------ESP------------T-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS BCD_1485_Borrelia_crocidurae_DOU_576102351 L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD -_Borrelia_crocidurae_644980346 L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------KY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ -_Borrelia_duttonii_501533271 L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ -_Borrelia_hispanica_639481710 L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ -_Borrelia_duttonii_740582129 L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-DV-AKTQAYNYL-KIAAALEEGL-L--E----------EQF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ BDCR2A_01875_Borrelia_duttonii_CR2A_576313055 L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD -_Borrelia_crocidurae_504509673 L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD -_Borrelia_duttonii_740582639 L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KNS----------------KFTGFLLDTLFD -_Borrelia_hispanica_639482094 L-------KERIVNN---FKKEI-----------FHK-IEVIKALKEIKD-----NKYYE--------LDG----YK--S-FNSFAK--------NF-RI-ARTQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-N-----GINETISF---L-RNK------------EGI------------S-------IK-R-----------S-RQNPI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGFLLDTLFD -_Borrelia_persica_740577610 L-------KEKIINN---FKKEI-----------FHK-IETIKALKEIKD-----NKYYK--------LDG----HN--S-FNSFSK--------NF-RL-ARSQVYEYL-RIGDAFEEGL-L--E----------EQF-----------------------------------------VIE-Y-----GIKYTISF---L-RNK------------EGI------------S-------LK-K-----------S-KVNPI---------------KPL--------R-FQ--L--KC-QE-S---YD------YY--K-KDS----------------KFTSFVMDTLFR BOM_0964_Borrelia_miyamotoi_FR64b_576103756 L-------KDKLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NQYYK--------LDG----YN--N-FEEFTR--------HY-KI-AKTQAYEYL-KIANAMEEGL-I--Q----------EQD-----------------------------------------IIK-N-----GIHNIILS---L-RDK------------EGT------------N-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------YY--K-KNA----------------KFTSFLMDELFS -_Borrelia_hispanica_639482644 L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NAYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EQD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------AGF------------N-------IK-K-----------S-RQNVI---------------KPL--------K-FR--L--KR-QE-S---YD------FY--K-KNP----------------KFTGFILDEIFF -_Borrelia_miyamotoi_763123770 L-------KDKLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NQYYK--------LDG----YN--N-FEEFTR--------HY-KI-AKTQAYEYL-KIANAMEEGL-I--Q----------EQD-----------------------------------------IIK-N-----GIHNIILS---L-RDK------------EGT------------N-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------YY--K-KNA----------------KFTSFLMDELFS -_Borrelia_crocidurae_504509579 L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------A-------VK-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE -_Borrelia_persica_740577602 L-------KLKLKSN---FKEGI-----------YNK-LEAMKILKEIKD-----NHYYR--------YDG----YK--K-FSDFLG--------SY-DV-AKSQAYNYL-KIATAIEQGI-I--E----------ENY-----------------------------------------VLE-N-----GFREVLHL---I-RSK------------GCE------------K-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KN-QA-S---YD------FY--K-KNA----------------KFTSFLMDKLFL -_Borrelia_duttonii_499985609 L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------T-------VR-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE -_Borrelia_garinii_657235493 L-------KKKLKDS---FRSEI-----------YYK-MEVIKILKEIKD-----NKYYK--------LDG----YR--I-FEDFIK--------DY-DL-ARTQVYGYL-KIANAIQEGL-L--K----------ENY-----------------------------------------VIQ-N-----GVTKTIAF---L-K-K------------SID------------V-------SK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGYLLDKLFN -_Borrelia_hispanica_740573754 L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------T-------VR-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE -_Borrelia_crocidurae_504496143 L-------KERLKAN---FRKEI-----------FHK-IDSIRILKEIKD-----NKYYK--------LDG----YK--S-FDSFIK--------SY-RL-ARSQVYVYL-KIANAIEEGL-I--E----------ENY-----------------------------------------IIE-N-----GIHDTFNL---I-QNT------------GNK------------T-------IN-K-----------S-KKESI---------------QSL--------S-FH--L--KH-QE-S---YD------FY--K-QNI----------------KIISFILDELVV -_Borrelia_burgdorferi_499985629 L-------KEKLKTN---FKKEI-----------FHK-VENIRILKEIKD-----NEYYK--------FDG----YK--N-FLDFVK--------NF-NV-AKSQAYKYL-KLATALQDGV-L--N----------ENY-----------------------------------------VIE-N-----GIHNSFNY---I-KDK------------ESP------------S-------LK-K-----------S-KENPI---------------KPL--------R-LK--L--KT-QE-S---YD------FY--K-SKA----------------KFTSFMMNEIFE -_Borrelia_coriaceae_645023888 L-------KQKLKSN---FQQEI-----------YYK-MEAIKILKEIKD-----NEYYK--------LDG----YR--I-FEDFIK--------DY-KL-ARSQAYDYL-KIATALANGT-L--E----------ENY-----------------------------------------VIE-N-----GITQTIAF---L-R-T------------TSS------------K-------LK-K-----------S-KYNLI---------------KPL--------H-LQ--L--KS-QE-S---YD------FY--K-KNA----------------KFTGFILDILFS BCD_1669_Borrelia_crocidurae_DOU_576102548 L-------KDKLKEN---FKREI-----------YYK-VESMKILKEIKD-----NEYYK--------LDN----YK--S-FEGFIK--------DY-KV-AKTQAYAYL-RLANALHDGI-I--E----------ENY-----------------------------------------IIE-N-----GIHNALDL---I-GHE------------GSK------------A-------VK-K-----------S-KQNKI---------------KPL--------R-FQ--L--KK-QA-S---YD------FY--K-KNS----------------KFTGYLLDMLFE -_Borrelia_garinii_657235558 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NKYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------ENY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------KTS------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN -_Borrelia_burgdorferi_497942842 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KRLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNQ------------ANG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK -_Borrelia_burgdorferi_671550272 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KRLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNQ------------ANG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK -_Borrelia_garinii_657248004 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_garinii_696413767 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_garinii_671520434 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_garinii_657234804 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SHT----------------RFTSFMMDEIFK -_Borrelia_garinii_671556237 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_garinii_501710213 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK BGP219_Borrelia_garinii_PBi_52696733 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAVETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_afzelii_501669395 L-------ITQLRNN---IKSEI-----------YNI-IDTMKILKKIND-----KKLYV--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-RLATAVETGL-L--E----------ENF-----------------------------------------ITS-N-----GIRASIRY---V-KNK------------TSG------------I-------IK-K-----------S-KQNSI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------NFTNFMMNEIFE -_Borrelia_spielmanii_501898261 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAVETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YT------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_valaisiana_506379500 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_finlandensis_501928245 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KKLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRF---I-KNK------------TNG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FK--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK -_Borrelia_burgdorferi_740592163 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KRLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLAAAIEAGI-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNQ------------ANG------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNS----------------RFVSFMMDEIFK -_Borrelia_garinii_696412166 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKISD-----KKLYL--------EGG----YK--S-FKNFLS--------DF-KL-AKTQSYEYI-KLANAIETGL-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KSK------------TNG------------A-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNT----------------RFTSFMMDEIFK -_Borrelia_bissettii_503789140 L-------ITQLKNN---IKSEI-----------YNI-IDTMKILKKIND-----KKLYL--------EGG----YK--S-FKDFLS--------DF-KL-AKTQSYEYI-KLATAVEEGV-L--E----------ENF-----------------------------------------ITN-N-----GIRASIRY---I-KNK------------ANG------------T-------MK-K-----------S-KQNLI---------------KPL--------K-FQ--L--KN-QE-S---YA------FY--K-SNS----------------KFASFMMDEIFK -_Borrelia_hispanica_639481943 L-------KEQLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NAYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIESGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RKN------------NAD------------S-------VK-K-----------S-RINPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNS----------------KLTSFILDELFE -_Borrelia_duttonii_740582201 L-------KEQLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NAYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNN------------NAD------------S-------IK-K-----------S-RINPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNS----------------KLTSFILDELFE -_Borrelia_crocidurae_504509606 L-------KEQLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NAYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNN------------NAD------------S-------IK-K-----------S-RINPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNS----------------KLTSFILDELFE -_Borrelia_burgdorferi_671563339 L-------KDRLRAN---FRKEI-----------FHK-VDNIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_garinii_696419050 L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--T-FDAFIK--------NY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_afzelii_501574765 L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--T-FDAFIK--------NY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_burgdorferi_695263537 L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_burgdorferi_499186196 L-------KERLKSN---FQKEI-----------YNK-IESMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-AKSQTYEYL-KIASAIENGV-I--E----------ELF-----------------------------------------LLE-N-----GIKETIIF---L-RNS------------NSD------------T-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_turicatae_519700232 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNA----------------KLTSFLLERVFS BAN_0003100_Borrelia_anserina_BA2_576100681 L-------KERLKVN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-KNR------------EGV------------G-------IK-R-----------S-KQNPL---------------SPL--------R-FQ--L--KC-PE-A---YA------FY--K-RNA----------------KLTSFLLEKVFS BHO_0003100_Borrelia_hermsii_YBT_576092650 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----KEYYK--------IDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNA----------------KLTSFLLEKVFS BCO_0003100_Borrelia_coriaceae_Co53_576094173 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--T-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINNTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNA----------------KLTSFLLEKVFS BHY_1114_Borrelia_hermsii_YOR_576105484 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS -_Borrelia_miyamotoi_645073074 L-------KEKLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----KEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINNTMFL---I-RNK------------EGV------------S-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-RNA----------------KLTSFLLEKVFL -_Borrelia_hermsii_644979602 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS -_Borrelia_anserina_645048715 L-------KERLKVN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-KNR------------EGV------------G-------IK-R-----------S-KQNPL---------------SPL--------R-FQ--L--KC-PE-A---YA------FY--K-RNA----------------KLTSFLLEKVFS BDU_1115_Borrelia_duttonii_Ly_201084318 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS -_Borrelia_hispanica_639481996 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------T-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS BHW_0003100_Borrelia_hermsii_MTW_576091528 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS -_Borrelia_parkeri_644922901 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--N-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNS----------------KLTSFLLEKVFS -_Borrelia_hermsii_645062976 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------H-------IK-K-----------S-KQNPL---------------KPL--------R-FQ--L--KC-SD-A---YA------FY--K-QNS----------------KLTSFLLEKIFS -_Borrelia_crocidurae_504496098 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS -_Borrelia_644980614 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--S-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINGTMFL---L-RNK------------EGV------------N-------IK-R-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLERVFS -_Borrelia_lonestari_145652250 L-------KEKLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----KEYYK--------LDH----YS--S-FDDFAR--------DY-RL-ARTQTYKYL-KIATAIEEGI-I--E----------EKY-----------------------------------------VIN-N-----GINSTMFL---L-RNK------------EGV------------S-------IK-K-----------S-RQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-KNA----------------KLTSFLLEKVFL -_Borrelia_coriaceae_645023282 L-------KERLKIN---FQKEI-----------FCK-IEAMKVLKEIKD-----NEYYK--------LDN----YA--T-FDDFAK--------DY-RL-ARTQTYKYL-KIATAIEEGI-V--E----------EKY-----------------------------------------VIN-N-----GINNTMFL---L-RNK------------EGV------------N-------IK-K-----------S-KQNPL---------------RPL--------R-FQ--L--KC-PD-A---YA------FY--K-GNA----------------KLTSFLLEKVFS -_Borrelia_persica_639480295 L-------KESLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YN--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAIAEGL-L--E----------EKF-----------------------------------------IIE-N-----GLTMSLLS---I-RDK------------HGT------------T-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-YE-S---YD------FY--K-KNA----------------KFTGFVLDKLFW -_Borrelia_recurrentis_501533150 L-------KKKLYIN---LREGI-----------YNK-IECMKILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------NY-NV-AKTQAYNYL-KIAAALEEGL-L--E----------EKF-----------------------------------------VLE-N-----GFRQILSL---L-RDK------------QGK------------T-------IK-R-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-KNA----------------KFTSFLLDKLFQ -_Borrelia_duttonii_501533243 L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NEYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EKD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------AGF------------N-------TK-K-----------S-KQNVI---------------KPL--------K-FQ--L--KR-QE-S---YD------FY--K-KNP----------------KFTSFILDEIFF -_Borrelia_crocidurae_504496216 L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NEYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EKD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------TGF------------S-------IK-K-----------S-RQNMI---------------KPL--------K-FQ--L--KR-QE-S---YD------FY--K-KNP----------------KFASFILDEIFF -_Borrelia_crocidurae_644980530 L-------KERLVFN---FQKEI-----------LNK-IESMKILKEIKD-----NEYYK--------LDG----YK--N-FEEFTR--------HY-RI-AKTQAYEYL-KIANAIQEGL-V--E----------EKD-----------------------------------------IIE-N-----GIHDIILS---L-RNK------------AGF------------N-------TK-K-----------S-RQNVI---------------KPL--------K-FQ--L--KR-QE-S---YD------FY--K-KNP----------------KFTSFILDEIFF -_Borrelia_garinii_696422229 L-------KNELKNR---IEDDI-----------RNK-INTMKILLKIRN-----GKLYI--------LDG----YK--R-FEDFIF--------DF-KI-ARTQAYKYI-KIAKLIFEGK-L--K----------EIN-----------------------------------------IIE-D-----GIDKTLFN---L---M------------KDR------------K-------V--K-----------S-RANLV---------------KPL--------R-IR--L--ET-QE-A---YD------FY--K-RNP----------------KFTNHILE---- -_Borrelia_burgdorferi_499192746 L-------KNELKNR---IEDDI-----------RNK-INTMKILLEIRN-----RKLYI--------LDG----YK--K-FEDFIF--------DF-KI-ARTQAYKYI-KIAKLIFEGK-L--E----------EID-----------------------------------------IIE-N-----GIDKTLFN---L---M------------KDK------------K-------I--N-----------S-KANLI---------------TPL--------R-VR--L--ET-QE-A---CD------FY--K-MNP----------------KFANYILEDFYQ -_Borrelia_coriaceae_645024479 L-------KEKLKEN---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------QY-KV-AKTQAYAYL-KLANALQNGI-L--E----------EGY-----------------------------------------IIE-N-----GIHNSLVL---I-ENK------------KNK------------T-------MK-K-----------S-RQKPI---------------RSL--------R-FQ--F--EN-QE-S---YD------FY--K-KNA----------------KFTSFLMDVLFR -_Borrelia_501532751 L-------KSKLKDN---IKDDI-----------YNK-IEAMHILREIKD-----KEYYK--------LDG----YK--S-FSRFIK--------DY-KL-AKSQAYSYL-RIASAIQDGI-L--K----------EEY-----------------------------------------LIE-N-----GFRQSLSF---L-MEK------------ESK------------N-------LK-K-----------S-KINPV---------------KPL--------R-FQ--L--KS-QD-S---YN------YY--K-KNA----------------KLTGFILDKLFL -_Borrelia_hispanica_639481645 L-------KSKLKDN---IKDDI-----------YNK-IEAMHILREIKD-----KEYYK--------LDG----YK--S-FSRFIK--------DY-KL-AKSQAYSYL-RIASAIQDGI-L--K----------EEY-----------------------------------------LIE-N-----GFRQSLSF---L-MEK------------ESK------------N-------LK-K-----------S-KINPV---------------KPL--------R-FQ--L--KS-QD-S---YN------YY--K-KNA----------------KLTGFILDKLFL -_Borrelia_hermsii_644979506 L-------KRKLMIN---LKDEI-----------HAK-IITMKILKEIND-----KELYV--------QEG----YK--T-FSDFIS--------EF-NL-ARTQVYGYI-RMAAAISEGV-L--S----------EEY-----------------------------------------IIQ-N-----GIQNSLLF---I-RST------------NSD------------T-------IK-K-----------S-RVNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-QNA----------------KFTSFLMDELFK -_Borrelia_crocidurae_644980942 L-------KSKLKDN---IKDDI-----------YNK-IEAMHILREIKD-----KEYYR--------LDG----YK--S-FSRFIK--------DY-KL-AKSQAYSYL-RIASAIQDGI-L--K----------EEY-----------------------------------------LIE-N-----GFRQSLSF---L-MEK------------ESK------------N-------LK-K-----------S-KINPV---------------KPL--------R-FQ--L--KS-QD-S---YN------YY--K-KNA----------------KLTGFILDKLFL -_Borrelia_persica_639480216 L-------KCKLKDN---IKEDI-----------YNK-IEAMYILKEIKE-----KRYYK--------LDG----YK--S-FSQFIK--------NY-KL-GRSQAYSYL-RIASAIEYGI-L--K----------EEY-----------------------------------------LIE-N-----GVRQCLIF---L-TKS------------ENI------------K-------IK-K-----------S-RQNLI---------------KPL--------R-FQ--L--KC-QE-T---YD------YY--K-KNS----------------KFTSFLMEELFR BHY_1499_Borrelia_hermsii_YOR_576105904 L-------KENFINS---FKKEI-----------VYK-IECMKILKEIKD-----NQYYK--------LDG----FK--T-FDSFTK--------NF-KI-ARSQIYNYL-KLAGAMEDGL-I--S----------EEY-----------------------------------------LLE-N-----GINDSLDL---I-KNK------------ERA------------T-------LK-K-----------S-TQNSI---------------KPL--------R-FQ--L--KD-RK-V---MI------FT--K-S---------------------ILSLQHSF- -_Borrelia_garinii_671520608 L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESL------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE -_Borrelia_burgdorferi_497943789 L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFL---L-RNK------------ESV------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE -_Borrelia_burgdorferi_group_493479353 L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESI------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE -_Borrelia_valaisiana_501894859 L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---I-RNK------------KGL------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE -_Borrelia_bissettii_503783725 L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESV------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE -_Borrelia_garinii_657248267 L-------KSQLTIN---VKSEI-----------CSR-LETMKILKEIKD-----NEYYK--------LDG----YK--S-FEDFTK--------DY-KL-AKTQAYDYL-RIANAIEEGI-I--E----------EEF-----------------------------------------LVQ-N-----GFRQTLFV---L-RNK------------ESL------------T-------IK-K-----------S-KQNWI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-KHA----------------KFTSFMMDEIFE -_Borrelia_burgdorferi_501588721 L-------KEKLKIN---SKKEI-----------YCK-LETLKILKEIKD-----NHYYR--------FDG----YK--S-FEAFSK--------DY-RL-ARAQVYNYL-KIANAIEDGI-I--Q----------EEF-----------------------------------------LIK-N-----GILETLIV---L-RNK------------ESK------------T-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KR-QE-S---YD------FY--K-SNA----------------KFTGFMLDKLFS -_Borrelia_garinii_657235047 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------KTS------------I-------IK-K-----------T-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-KNA----------------KFTGFLLDKLFN -_Borrelia_spielmanii_493479385 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETA------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS -_Borrelia_burgdorferi_671580297 L-------KNRLKTN---IKRKF-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDF---I-EGQ------------ETS------------I-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS -_Borrelia_garinii_501710973 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETS------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN -_Borrelia_burgdorferi_497945190 L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KKS----------------KFTSFMMHEIFE -_Borrelia_burgdorferi_488739923 L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KNA----------------KFTAFILEELLK -_Borrelia_coriaceae_752506999 L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NEYYK--------IDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVAAAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------T-------IR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ BCO_0130002_Borrelia_coriaceae_Co53_576094619 L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NEYYK--------IDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVAAAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------T-------IR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ -_Borrelia_burgdorferi_group_488735361 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDF---I-EGQ------------ETS------------I-------VK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFS -_Borrelia_burgdorferi_497943336 L-------KDRLKAN---FRKEI-----------YHK-LDSIKILKEIKD-----NQYYK--------IDG----YK--K-FDYFIK--------DY-KI-ARSQAYNYL-KLATALQEGI-L--K----------EDY-----------------------------------------LIE-N-----GIHNSLDL---I-KDK------------ESP------------T-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFM -_Leptospira_interrogans_446063157 --K-----EQNFRAQ--FLHERI-Q---ANF-------IGLVFDLKEMRD-----KKLYS--R---L---G----FD--N-FKDYLK-STL---PKF--V-TLSFASNLM-----LLSDKM-S--E----------EDY-----------------------------------------KKT-N---P-NQVQILAK---I----------------ASN--------PD--V-FE----IS-H-----------K-VKGEI-----------HLS-NGM--V-------MD--L-----EE-----YE----T-TY--A-DEI----------------AQQTDVYREAIK -_Leptospira_santarosai_490596705 --N-----EQNFRAQ--FLHERI-Q---ANF-------IGLVFDLKEMRD-----RKLYA--R---L---G----FD--N-FKDYLK-SAL---PKF--V-TLSFASNLM-----LLSDKM-S--E----------EDY-----------------------------------------KKT-N---P-SKVQVLAK---I----------------ASN--------PD--V-FE----IS-H-----------K-VKGEI-----------HLS-NGT--V-------MD--L-----EE-----YE----T-IY--A-NEI----------------AQQTDAYREAIK -_Borrelia_hermsii_645063111 L-------KDRLKES---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------EY-KV-AKTQAYAYL-KLASALQDGI-L--Q----------EDY-----------------------------------------IIE-H-----GIHNSLVL---I-GNE------------RNK------------T-------IR-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-HD-S---YN------FY--K-KNA----------------KFTSFLMDEL-- -_Borrelia_burgdorferi_497942632 L-------KDRLRAN---FRKEI-----------FHK-VDNIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIK--------DF-NI-ARSQAYNYL-KLAAALQEGI-L--K----------EDY-----------------------------------------VIE-N-----GIHNSLNL---I-QDK------------ESP------------T-------FK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KK-QE-S---YD------FY--K-SNA----------------KFTGFMLDKLFS -_Borrelia_coriaceae_654876378 L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NEYYK--------IDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVAAAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------T-------IR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ -_Borrelia_hermsii_645010591 L-------KDRLKES---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------EY-KV-AKTQAYAYL-KLASALQDGI-L--Q----------EDY-----------------------------------------IIE-H-----GIHNSLVL---I-GNE------------RNK------------T-------IK-K-----------L-RQNPI---------------KPL--------R-FQ--L--KS-HD-S---YE------FY--K-KNA----------------KFTSFLMDELFR -_Borrelia_hermsii_644979720 L-------KDRLKES---FKREI-----------HYK-VEAIKILKEIKD-----NEYYK--------LDN----YN--S-FESFVK--------EY-KV-AKTQAYAYL-KLASALQDGI-L--Q----------EDY-----------------------------------------IIE-H-----GIHNSLVL---I-GNG------------RNK------------T-------IR-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-HD-S---YN------FY--K-KNA----------------KFTSFLMDELFR -_Borrelia_afzelii_500023248 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETL------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN BCO_0130005_Borrelia_coriaceae_Co53_576095359 L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NDYYK--------LDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVATAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------M-------MR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ -_Borrelia_burgdorferi_497944835 L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KKS----------------KFTSFMMHEIFE -_Borrelia_garinii_501704211 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETL------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN -_Borrelia_burgdorferi_501930839 L-------KEKLNDN---FKKEI-----------FHR-VENIKILKEIKD-----NQYYK--------FDG----YK--T-FLDFIK--------DF-DV-AKTQAYKYL-RLATALQEGL-I--K----------EDY-----------------------------------------LIE-N-----GIKNSYNF---I-KDK------------ESP------------A-------LK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-KKS----------------KFTSFMMHEIFE -_Borrelia_garinii_671501759 L-------KNRLKTN---IKKKI-----------FYK-VESIRILKEIKD-----NEYYK--------LDG----YK--S-FDAFIR--------DY-KL-ARTQVYIYL-KLAKALQAGI-L--N----------EDY-----------------------------------------IIE-N-----GIYDSLDL---I-EGQ------------ETS------------I-------IK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------RFTGFLLDKLFN -_Borrelia_coriaceae_645024774 L-------KEQLKTS---FSNEV-----------YNR-VQTMKILKEIRD-----NDYYK--------LDG----YK--T-FDAFIK--------DY-KL-AKTQVYDYL-RVATAIEDGV-V--E----------EDY-----------------------------------------LLK-H-----GFKNTVML---L-RNK------------SSK------------M-------MR-R-----------S-KINPI---------------KPL--------R-FQ--L--KK-KD-S---YD------FY--K-KNA----------------RLTSFILDEIFQ -_Borrelia_afzelii_504299060 L-------KDRLKAN---FRKEI-----------FHK-VDNIRILKEIKD-----NEYYK--------LDG----YK--S-FFAFVK--------DY-NI-ARTQAYNYL-KLATALQEGF-I--K----------EDY-----------------------------------------IIE-N-----GIHNSLDL---I-QDK------------ESP------------T-------FK-K-----------S-KKNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGFLLDKLFN -_Borrelia_duttonii_740582299 L-------VKQLKNN---IKSEI-----------YNV-IDTMKILKKIND-----KKLYI--------EGG----FK--S-FKDFLS--------EF-KL-AKTQSYEYI-KLATAIETGL-L--E----------EDF-----------------------------------------ITL-N-----GIRASIRY---I-KTK------------TNG------------I-------IK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KN-QE-S---YD------FY--K-KNA----------------KLTGFILDRLFL -_Borrelia_persica_639480329 L-------VKQLKNN---IKSEI-----------YNI-IDTMKILKRIND-----NKLYA--------EGG----FS--S-FKDFLS--------EF-KL-AKTQSYEYI-KLARAIETGL-L--E----------EDF-----------------------------------------ITL-H-----GIRASIRY---I-KTQ------------ANG------------I-------IK-K-----------S-KQNPV---------------KPL--------R-FQ--L--KH-QE-S---YN------FY--K-KNA----------------KLTGFILDNLFL -_Borrelia_burgdorferi_499186290 L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIKDGI-L--E----------EAY-----------------------------------------VIE-N-----GVTKTLEF---L-R-K------------SPN------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGYLLDKLFN -_Borrelia_burgdorferi_group_501704326 L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIKDGI-L--E----------EAY-----------------------------------------VIE-N-----GVTKTLEF---L-R-K------------SPN------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-QE-S---YD------FY--K-SNA----------------KFTGYLLDKLFN -_Borrelia_garinii_696415807 L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIQDGV-L--E----------EAY-----------------------------------------VIE-N-----GVTKAIAF---L-R-K------------SPG------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-HE-S---YD------FY--K-SNA----------------KFTSFMMHELFE -_Borrelia_valaisiana_501894944 L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIQEGI-L--E----------EAY-----------------------------------------VIE-N-----GISKAIAV---L-R-E------------SPS------------G-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-KE-S---YD------FY--K-SNA----------------KFTSFMMHEIFE -_Borrelia_garinii_671481046 L-------KQRLKSN---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIEEGV-L--E----------EAY-----------------------------------------VIE-N-----GVTKTIAF---L-R-K------------SPS------------I-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMHEIFE -_Borrelia_burgdorferi_501704894 L-------KNRLVNN---FKKEI-----------FHK-IEFIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--D----------EAY-----------------------------------------VIE-N-----GLTISLLS---L-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE BBUBOL26_W05_Borrelia_burgdorferi_Bol26_226232418 L-------KNRLVNN---FKKEI-----------FHK-IEFIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--D----------EAY-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_burgdorferi_497943851 L-------KNRLVNN---FKKEI-----------FHK-IEFIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--D----------EAY-----------------------------------------IIE-N-----GLTISLLS---L-RDK------------ESS------------S-------FK-K-----------S-RQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_afzelii_504299173 L-------KQRLKSS---FQQEI-----------YYK-MEVIKILKEIKD-----NEYYK--------LDG----YR--T-FEDFIK--------DY-HL-ARSQAYDYL-KIANAIEEGV-L--E----------EAY-----------------------------------------VIE-N-----GVTKTIAF---L-R-K------------SPG------------V-------LK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KT-QE-S---YD------FY--K-SNA----------------KFTSFMMHEIFE -_Borrelia_afzelii_504299035 L-------KNKLVNN---FKKEI-----------FHK-IEVIKILKEIKD-----NEYYK--------LDG----YT--S-FNSFAK--------NY-RI-ARTQVYDYI-RIANAMEEGL-L--E----------EAF-----------------------------------------IIE-N-----GLTMSLLS---L-REK------------ESP------------T-------FK-K-----------S-KQNPI---------------KPL--------R-FQ--L--KS-KE-S---YD------FY--K-SNA----------------KFTGFLLDELFE -_Borrelia_garinii_671501078 L-------KEKLKTN---FKKEI-----------FHK-VENIRILKEIKD-----NQYYK--------FDG----YK--N-FLDFVK--------DF-NV-AKSQAYKYL-KLAAALQDGV-L--N----------DDY-----------------------------------------VIK-N-----GIHNSFNY---I-KDK------------EGP------------S-------LK-K-----------S-KQNPI---------------KPL--------R-LK--L--KT-QE-S---YD------FY--K-SKA----------------KFTSFMMNEIFE -_Planctomyces_brasiliensis_503393144 L-T--DA-QQAKLQD---CEIVI-EDGLKAF-------LKTCIAVVVIDD-----LELYK--P------------HK--S-LHAYCA-------FRF-DY-SDTETGRLR--NAGHVLVNL----S-----------GL-----------------------------------------SAD-D--L-------------F-SGK------------ESD------------I-V-----LP-A-------------NEGQC---------------REM----------AK--L--KK-GK-K-QDAD-LQRK-VW--A-EVI-KRAGQ--------D-KITARLIKEVVD -_uncultured_Mediterranean_phage_uvMED_787047096 M-T--EQ-EQKELIE---AETVI-K---SSFQGKMERDLAIGAGLLKIKR-----QKLYR-GV---------SG-GR--L-WIDYLK-EES-A-KLT-GN-AEPISDQLA--RNLRGFYEF----R------------------------------CE----------------------ILQ-D---------------------------------LYN-YI---------I-------LP-T-------------NKSQV---------------TPI--LGY-----LK-----NP-KE-A---VE------IW--K-AAC-SEAGS----N---K-VPTYHQVNKAYY -_Gloeocapsa_sp_PCC_7428_505002935 I-Q--QH-TQELKER---LQRTA-----QDI-------WEIGQKLAEVRS------------R-----LK-----HG--Q-FDNWLK-------AEF-GW-SRRTAYNFI-----NVYETF------------------------------------N----------------------ERA-K----------------F--------------------------------------------------------AHFNI-AT------------SAL----------YL--L--AS-PS-T---PQ----D-IK--D-QFI-EVAQT----G---Q-KVTHKDIRKALE -_Nostoc_sp_PCC_7120_499306042 V-E--QR-TSEIREQ---LRRTA-----QDI-------WEIGQSLAEVRA------------Q-----LK-----HG--Q-FETWLK-------AEF-GW-SRRTAYNFI-----NVYETF------------------------------------G----------------------NRA-N----------------L--------------------------------------------------------AQIDI-AT------------SAL----------YL--L--AA-PS-T---PE----N-LR--E-QYI-EEAKA----G---K-RITHKELVQTIK -_Cyanothece_497233611 I-Q--QL-TQEIRDC---LRRSA-----QDI-------WEIGQKLADVRD------------R-----LK-----YG--Q-FDTWLK-------VEF-GW-SRRTAYNFI-----SVYQTF------------------------------------G----------------------ERA-N----------------L--------------------------------------------------------AQVDI-AT------------SAL----------YL--L--AA-PS-T---PQ----K-VR--E-EFL-QKAQA----G---Q-TITHKQLSEVIQ -_Crocosphaera_watsonii_494515216 I-Q--QL-TQEIRDC---LRRSA-----QDI-------WEIGQKLADVRD------------R-----LK-----YG--Q-FDTWLK-------TEF-GW-SRRTAYNFI-----SVYQTF------------------------------------G----------------------ERA-N----------------L--------------------------------------------------------AKVNI-AT------------SAL----------YL--L--SA-PS-T---SQ----K-VR--E-EFL-QKARS----G---E-TITYKQLSEVIQ -_Fischerella_sp_PCC_9339_515877202 I-Q--QR-TGEIKER---LRRSA-----QDI-------WEIGQKLADVRS------------Q-----LK-----HG--Q-FDTWLK-------AEF-GW-SRRTAYNFI-----NVYEAF------------------------------------D----------------------ECA-N----------------L--------------------------------------------------------AQIDI-AT------------TAL----------YL--L--AA-PS-T---PE----N-VR--E-EIL-QRAKG----G---E-TLTHKDIRQVIK -_Nostoc_punctiforme_501381574 V-N--TS-IQETLTA---IDRFE---------------WQAIEELRLMRD-----NGYYS--D---------AG-YV--S-FEDYCE-KEL---TKH-G--GYRRVRDLL--SAKKVVDTL--------------------------------------PE-------------------ELR-E----------------------K-------------------------I----------T-------------KPSQT---------------RSL----------LR--L-----VK-T---PD------KL--H-EAV-AIAAK-EK-----P-FPTAADFAKAVQ -_Nostoc_punctiforme_501381627 V-N--TS-IQETFSA---INRFE---------------WQAIDELRLMRD-----NHYYK--D---------GG-YL--S-FEDYCE-KEL---IKH-G--GYRRVRDLL--SAKKVVDTL--------------------------------------PE-------------------ELK-D----------------------K-------------------------I----------T-------------KPSQT---------------RSL----------LR--L-----VK-T---PD------KL--E-QAV-AIAAK-EK-----P-FPTAADFTKAVQ -_Nostoc_punctiforme_501381520 L-N--NG-ILDTFTA---IDRFE---------------WQASDELRLMRD-----KGYYQ--D---------GG-HK--S-FEAYCE-SEL---TKH-G--GYRRVKDLF--AAKRVVDTL--------------------------------------PE-------------------ELR-P----------------------H-------------------------I----------T-------------KPSQT---------------RSL----------LR--L-----VK-T---PD------KL--E-QAV-AIAAK-EK-----P-FPTAADFAKAVQ -_Cyanothece_497232009 --------QIAKRAE---LENIV-IEHSQSF-------TIIGKALREIQE-----KKLHQIED---------P--NK--R-WDVYVD-------ETF-GI-VKSRAYQLI--AGTVLYEVL----E---------------------------------DN-------------------LKG-D--YS----------------------------------------------------LP-K-------------SDTQL---------------RPL----------YS--L------------VK------GW--R-KAT-AKNDV----------EEKERIEQLIVK -_Cyanothece_sp_CCY0110_495553174 --------QIAKRAE---LENIV-IEHSQSF-------TIIGKALREIQE-----KKLHQIED---------P--NK--R-WDVYVD-------ETF-GI-VKSRAYQLI--AGTVLYEVL----E---------------------------------DN-------------------LEG-D--YS----------------------------------------------------LP-K-------------SDTQL---------------RPL----------YS--L------------VK------GW--R-KAT-AKNDV----------EEKERIEQSIVK -_Anabaena_variabilis_499635690 L-N--SS-IQETFTS---LDRFE---------------WQAVGELLQMRD-----QELYI--E---------AG-YA--D-FKEYCQ-REL---SAW-G--GYRRISQLL--GAKKVIDSV---------------------------------------G-------------------ELG-Q----------------------H-------------------------I----------K-------------NERQA---------------LPL----------LR--L-----VK-E---PQ------KL--R-EAV-AIAVQ-EN-----P-SPSESDFAAAAQ -_Nodularia_spumigena_493212749 L-N--TS-IKETFAA---IDRFE---------------WQAVDKILQMRE-----QQIYL--E---------GG-YK--N-FEEYCQ-REL---SAW-G--GYRRINQLL--GAKKVIEAA---------------------------------------G-------------------EFG-G----------------------H-------------------------I----------K-------------NERQS---------------RPL----------LR--L-----VK-E---PE------KL--K-QAL-AIALE-QN-----P-SPSESDFAAAAK -_Nostoc_sp_PCC_7120_499308918 L-N--SG-IKETFTA---IDRFE---------------WQAVGEILQMRD-----QELYL--E---------AG-YA--D-FKEYCQ-REL---SAW-G--GYRRITQLL--GAKRVIDTV---------------------------------------G-------------------ELG-Q----------------------H-------------------------I----------K-------------NERQA---------------RPL----------LR--L-----AK-E---PE------KL--K-QAV-TIALQ-EN-----P-SPSESDFAAAAQ -_Anabaena_variabilis_499635648 L-N--SS-IKETFTT---IDRFE---------------WQAVEEILQMRD-----QQLHR--E---------AG-YK--S-FEEYCQ-AEL---SAW-G--GYRRITQLL--GAKKVIDAV---------------------------------------G-------------------ELG-E----------------------H-------------------------I----------K-------------NERQA---------------RPL----------LR--L-----VK-E---PD------KL--K-EAV-AIALQ-EN-----P-NPSESDFAAAAR -_Kitasatospora_sp_MBT63_727525039 F-E--AD-KQTAHTR---FAQQA------------------GPALWEIHD-----RKLYR--S---------T--HS--T-WEEYLG-------ERW-GL-SRSYAHRLL--EMIPVQAAL-L--P------------------------------------------------NP----AFG-N-------------------------------------L----------V-------LR-E-------------SQARV-----L---------VPV----------LR-EY--GP-AQ-V---RE------VV--E-KAL---ADG--------A-RPTAKALTAART -_Anaeromyxobacter_dehalogenans_501750516 L-A--AR-AEDLPEG-S-MRRKV-LEGAQRF-K-SAW-VELGRLLSEVRR-----KELWR--G---W---G----YP--S-FERYCT-------KEL-FI-RGATADKLT--ASYGFLERH----E----------------------------------------------------P-ELA-K------------------------------------------------A----------R-------------GETRA---------------PPF--------E-----V------------IE------VL----SRA----------------EATGRLSDSGWR PSR1_03440_Anaeromyxobacter_sp_PSR-1_775300647 L-A--AR-AEDLPEG-S-MRRKV-LEGAQRF-K-SAW-VELGRLLSEVRR-----KELWR--G---W---G----YP--S-FERYCT-------KEL-FI-RGATADKLT--ASYGFLERH----E----------------------------------------------------P-ELA-K------------------------------------------------A----------R-------------GETRA---------------PPF--------E-----V------------IE------VL----SRA----------------EATGRLSDSGWR -_Anaeromyxobacter_sp_K_501518403 L-A--AR-AEDLPEG-S-MRRKV-LEGAQRF-K-SAW-VELGRLLSEVRR-----KELWR--G---W---G----YP--S-FERYCT-------KEL-FI-RGATADKLT--ASYGFLERH----E----------------------------------------------------P-ELA-K------------------------------------------------A----------R-------------GETRA---------------PPF--------E-----V------------IE------VL----SRA----------------EAAGRLSDSGWR -_Streptomyces_sp_NRRL_F-5702_664543262 L-T--PK-EQQTLGR---VHAAR-DHHQAAK-------WMRGKALAVAFS-----RRLFR--G-----EDG----RR--T-RQEYLD-------DEWDGI-SESAAYREI--GEWPVAKAI----S----------------------------------------------------D-ACE----------------------------------------------------------RP-A-------------PDSHV---------------RAL----------VD--V--AK-QQ-G---AE--PVA-RW--Y-AEL-RRHGQ-QA-G---H-RVTADVVANLAD -_Streptomyces_sp_150FB_748778099 L-N--AR-EQQDLDR---VHAAR-DHHRAAK-------WMRGKALEAAFR-----RRLFR--G-----EDG----TR--S-RQQYLD-------EEWDGL-SESAAYREI--GEWRLAKEI----T----------------------------------------------------D-ACE----------------------------------------------------------RP-A-------------PDSHV---------------RAL----------LD--V--AG-AQ-G---HK--QVA-HW--Y-AEL-RRHGQ-QT-G---R-RVTADAVANLAD -_Streptomyces_sp_NRRL_F-6628_739996264 L-N--AK-EQQQLER---IHSAR-DHHQAAK-------WMRGKALDSAFR-----RRLFR--G-----EDG----QR--T-RQQYLD-------AEWDGM-SESAAYLEI--REWPLAAQI----S----------------------------------------------------A-TFG----------------------------------------------------------RP-A-------------PDSHV---------------RAL----------VG--V--AE-NQ-G---HE--TVA-AW--Y-ADL-RRHGQ-EL-G---Q-RVTADVVANLAD -_Scytonema_millei_748141416 V-Q--QR-TKELKER---LQRTA-----QDI-------WEIGKKLVEVRA------------E-----LKG----HG--Y-FDAWLR-------AEF-GW-SRRTAYNFI-----YVYEAF-----------------------------------------------------------PYA-K----------------F--------------------------------------------------------AQMII-EP------------SAL----------YR--L--AS-PS-T---PD----A-IR--D-KFI-QQANA----G---S-KVSHKEVLKAVT -_Streptomyces_664512363 I-F--AV-QYAAKAN---HERAE-Q---QKL-------IGLGLRLQAIKD-----EELHK--H---------TG-FE--T-FGALTD-------ARF-GI-KKHQANNIL--RVLGVAQAL----E------------------------------------------------------DVT-T--------------------------------------------------------QE-L-------------KERPL---------------RVL----------VP--I--LD-TH-G---AD-AVRE-TW--A-EAA---RHG----------NVTDTALKEAAN -_Streptomyces_coelicolor_499350288 L-N--DQ-ERGYLDV---CEQAL-HGFRKSV-------VVAGKALEVINR-----GRLYR--E---------T--HE--T-FADYVT-------EVW-DM-KRAHAYRMI--EGWRPADLV----S------------------------------------------------------PIG-D-----------------------------------------------------------I-------------NEGQA---------------REL----------AP--V--LK-EY-G---PE-VTVT-LY----RGV-KELRG----D---R-RVTAADLSEARA -_Streptomyces_yeochonensis_740055334 --Q--KD-QTETVIR---TAHAA---GKAAV-------WVMGQGIAAAAK-----GKWFR--R---------T--HS--S-LEQYVV-----D-LIP-DV-VPRQARRWV--TGYPIALAI-T--S------------------------------------------------------RTG-E--------------------------------------------------------SP---------------VEGQV---------------REI----------AD--L--PE-SV-A---VE------LY----AAA-DTAAR-AA-G---G-RLTAKHLTDLRR -_[Clostridium]_clostridioforme_488660258 --------DTRRLAN-I-AYKDI-K---NGF-------VGFGYYLKIIRD-----EKLWQ--------GQG----YD--S-FNEFLG-------DEY-GK-DKSWASRCI-----NLYDKF---------------------------------------------------------------G--------------------------------------------------------IP-I-------------EPGEL---------------PRL--------------------EE-Q---YE------VY----NVS-QLIEM----------LPMSEELREQVT -_Borrelia_duttonii_740582340 --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK -_Crocosphaera_watsonii_494514846 L-T--RD-DKYGYED-A-LAQLK-S---HQF-G-W---VKFGLWAFQFKV-----KRFYK--Y---------H--HK--T-WKQFCE-------NVL-HR-NRYYVDKLI--KAVRVMKDL-----------------------------------------------------------ICA-G--FE--------------------------------------------V-------LP-Q-------------NEYQC---------------RFL----------TK--F--WG-EE---L-TE------NW--A-MIV-DAVPP--------H-LITGDLLKAQFS -_Borrelia_miyamotoi_645073449 --------EEIRART---LNEAI------NK-------VELAKALYEIKK-----NKLYR--F---------DG-YD--N-FYGFCL--------NY-KF-SRTMIYRYI--KIGAYLEKD---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDV-IH------------SSL----------NK--I------------IN----------D-IRV-KGSDS----------EYKPVII----- EUS_21090_[Eubacterium]_siraeum_70/3_291531542 V----HQ-DLMIQEQ-V-VAQSL---------------TQIAIDLKEIRD-----RRLYA--E---L---G----YS--D-FAEYCE-------NAT-KT-GKRQAYNLI-----SLVEQY-----------------------------------------------------------KID-D----------------L-------------------------------S-----R-LA----------YL---GSTKL---------------IAL----------KS--L--GK-EE-----RE------------ELI-ESGKA--------E-ELSVRELKEKIK -_Pedosphaera_parvula_750252315 L-T--TE-ETTTLSE---CEAVI-EEGMKTF-------VEVGSAVLTISD-----RRLYR--A---------T--HS--T-FEDYIQ-------DKW-DM-TARRAYQLC--EAAEVVMKL----E------------------------------------------------------NVK-H---------------------------------ASQ------------I----------E-------------NARQA---------------EAL----------AK-----AP-EE-K---RD----E-VL--E-KAI----------Q---T-APEGKLTSKH-- NITHO_3110002_Nitrolancea_hollandica_Lb_390172790 --P--SA-SHHARHA---CPRCA-----ISS-------ISRNPTCRVLSSLLRPYRRLYR--E---------AG-YR--T-FEDYCQ-------KRW-GW-TRQRAHQLM--LAAEVSTTV---------------------------------------------------------------D------------------------------------------------I-------PP-A-------------NEAQA---------------REL----------SR--L--KS-TE-A-I-RE------TW----QEV-REERG--------E-SVTARDVRETGY -_Streptomyces_virginiae_664375955 I-E--AS-VGQYQQS---LRRLQ-A---RHR-------VEVGQLLDEINE-----SGLWE--L---------EG-HE--K-FGHYVK-------ARW-GW-DRSYGYRLI--DLALVHRAL----A------------------------------------------------------PLG-P----------------------A----------VLD-----T------V-------VE-S-------------HAREL---------------APV----------VK--V--NG-DE-G---AR------HF--V-EVL-RLESG----G---K-KVTAAVIAARRD -_Chamaesiphon_minutus_504973660 L----GG-GLGDFLS---LDELA-KDNLFGF-------VRVGIAADRIRK-----RKLWQ--C-AKI--------YK--D-WNDYCT-------KGL-GK-TAWYINRTI--DAANVVMTL----I------------------------------------------------------SAG----FK--------------------------------------------I-------LP-T-------------CEAQC---------------RPL--VKL--LG-VG--L--DA------I-AD------VW--T-KVV-SAFTP--------D-KITAGKILAIAE -_Kitasatospora_sp_MBT63_727520012 F-A--GA-EYAARAN---IQQST-Q---QRV-------IVQGQILLAMRE-----EELWK--A---------LG-WT--D-FDVLVK-------HRF-GI-GRNYANKII--RSMPVVRAL----E------------------------------------------------------HVT-S--------------------------------------------------------ME-M-------------AEKHL---------------RAL----------VP--V--QE-RH-G---DE-AVRR-TW--E-EAL---RKG----------KITEKSLKEAAR -_Ruminococcus_flavefaciens_497670003 Y-T--EA-YNLNVRI-C-INAQM---AQQNL-------YEVCKGLKEMRD-----GKLYK--E------LG----YN--S-FEDYTE-------NEV-GL-SRFMAYKY---AAIADMKNV----E----------------------------------------------------------S------------------------------------------------I-------QQ-I-------------GVTKL---------------ALL----------AK--L-----DE-----PQ----------R-EEI-QQSVN------V-E-EVSVRELKAEID -_Streptomyces_sp_NRRL_F-5140_664445691 L-T--PE-EQEQLAE---CHRAV-DNARSAQ-------WMLGRALEIVRR-----RRLYR--G------DG----TR--T-WPQYLA-----A-EHD-GM-TERDARRLQ--EEWRLAKAV-----------------------------------------------------------QEA--------------------------------------------------L-----G-KP-A-------------PASHV---------------RAM----------LE--Y--AD-NT-S---IE-QAAI-DY----AML-RAAFD-SG-R-A-R-LVAHQITARVTR -_Crocosphaera_watsonii_494520212 L-T--RD-DKYGYED-A-LAQLK-S---HQF-G-W---VKFGLWAFSLRS-----KRFYK--Y---------H--HK--T-WKQFCE-------NVL-HR-NRYYVDKLI--KAVRVMKDL-----------------------------------------------------------ICA-G--FE--------------------------------------------V-------LP-Q-------------NEYQC---------------RFL----------TK--F--WG-EE---L-TE------NW--A-MIV-DAVPP--------H-LITGDLLKAQFS -_Streptosporangium_roseum_502659622 V-P-----TLEDCER-H-ITTIT-----TQW---L---LGVGRALAAIRD-----HELFQ--E------KG----YT--S-FTAYLR-TE----HPW----HPSYVSRVI--ANIPVVEAL----E------------------------------------------------------RHG-A---------------------------------DRD-----------------------L-------------NEGQA-TA--I---------RPV--------W-EQ-----HG-EE-A-L-FE------VW--------DATTG--------K-RSAAALVRVARA -_Kamptonema_formosum_518317229 L-D--YL-DVDVDRE---IQVGF-----FSY-------VKVGYFLDKMRY-----YKLYQ--K---------QG-FN--S-FKEYCL-------KVL-RK-SAHYCIKII--SAAEVCLRL----A------------------------------------------------------ALG-----------------------------------FEQ--------------------LP-N-------------CVAQA---------------LPL----------VK--F--NP-VF-G-L-DS-PLYD-KW--Q-DVL-DNTPP----G---Q-PITAKHIAEILD -_Streptomyces_sp_FR1_501453636 I-G--KA-QDNAELT---VRQAK-D---RFT-------REAGPALALIHD-----DELWR--P---------E--YE--S-FVDYVK-------RRW-DY-SSTHGYLLV--ATAKVQKAL-----------------------------------------------------------PEG----------------------------------------------------------AS-A-------------NTGHV---------------QVL----------AP--V--LR-HN-G---LE-AVSE-AW----AKS-EKRNG----------QPTAATLKAAVD HMPREF1089_00435_[Clostridium]_bolteae_90B3_480702301 --------DTRRLAN-I-SYKDI-K---NGF-------VGFGYYLKIIRD-----EKLWQ--------GQG----YD--S-FNEFLG-------DEY-GK-DKSWASRCI-----NLYDKF---------------------------------------------------------------G--------------------------------------------------------IP-V-------------EPGEL---------------PRL--------------------ED-A---YE------SY----NVS-QLIEM----------IPMQEELQEQVT NITHO_3110002_Nitrolancea_hollandica_Lb_390172790 --P--SA-SHHARHA---CPRCA-----ISS-------ISRNPTCRVLSSLLRPYRRLYR--E---------AG-YR--T-FEDYCQ-------KRW-GW-TRQRAHQLM--LAAEVSTTV---------------------------------------------------------------D------------------------------------------------I-------PP-A-------------NEAQA---------------REL----------SR--L--KS-TE-A-I-RE------TW----QEV-REERG--------E-SVTARDVRETGY -_Streptomyces_sp_PAMC26508_505393262 L-D--DR-EREHLAV---CEQAL-TGFRKSV-------IVAGKALEVINR-----GRLYR--E---------T--HS--T-FVEYLD-------DVW-EI-RKSQAYRMI--EAWPVAAAV----S------------------------------------------------------PIG-D-----------------------------------------------------------I-------------NEGQA---------------RQL----------QP--V--FK-DY-G---HE-AALA-VY----REV-KALRG----D---R-KVTAADLAEARA -_Streptomyces_739808094 L-D--DQ-QRAHLLV---CEQAL-TGFRKSV-------IVAGKALEVISR-----GRLYR--E---------T--HA--T-FVEYLD-------DVW-EI-RKSQAYRMI--EAWPVAAAV----S------------------------------------------------------PIG-D-----------------------------------------------------------I-------------NEGQA---------------REL----------RP--V--FT-DY-G---QE-AAVA-LY----REV-KELRG----N---R-KVTAADLAEARA -_Acaryochloris_marina_501118686 L-A--DL-EEIFLEP---TETGS-----DAL-------LSSGLALKVIQD-----NKLYL--P---------D--SK--G-FKVYVE-------ENL-GV-TYIHAFRCI--QAAELVLFL-Q--E------------------------------------------------------HFS--------------------------------------------------V-------LP-Q-------------SESAA---------------RPL----------VK--L--SR-AN-Q---LK------AW--G-EVL-RITAG-DK---W---APGKDRIKKTIA -_Streptomyces_sp_NRRL_B-24484_663245548 --------RQGAAET---IRAAK-A---RHD-------MQVGQALELIRD-----QKLYE--A---------TG-FG--S-FREYVE-------QRW-GY-SLSRAYQMM--DTILVMSAV----S------------------------------------------------------TIV-E------------------------------------------------T-------VP---------------PEGQQ---------------RVL----------AT--V--IR-QH-S---PE-AACM-LL----ESA-RTAPG----------KLTAKKLTELRD -_Streptomyces_sp_CNQ865_654253933 L-I--GE-EEEVFQR---CEAAV-ETLKFAF-------WAAGKGLQVIRD-----GRLYR--A---------T--HG--T-FDDYVQ-------DRW-GM-TRAQANKLI--RMWPIAEAL----F------------------------------------------------------ESQ-A----------------------Q----------ESN------------D-LARTRAKR-L-------------SQSVV---------------WEL----------VP--V--AE-RY-D---VD-AAQH-LY----STT-VEASG--------G-EVTAAVLKGAVA -_Desulfococcus_multivorans_750110637 I----ND-EDEDEFI---IEYEP-----PWF-------VQVGQALSHLKE-ALL-TEFPC--A---------E--QG--P-LPRWGE--TC---KEL-SI-SQSYANRLI--AAAEVYVAL----R------------------------------------------------------SAG--------------------------------------------------I-DEDD--LP-I-------------YERQV---------------RPL----------VR--F--KQ-DP-S-I-LK----Y-LW--E-EAL-VIAED-IE-F-N-S-LPRAGVVEYVVG -_[Kitasatospora]_papulosa_662754816 L-T--DA-DRADLEL---CEQAV-RSHHATF-------WMTGKALDAVAK-----RHLYR--A---------R--YA--N-FDALL--------EDW-DV-TLADSSRMR--RGWPLAARL-L--P------------------------------------------------------DVP-K--------------------------------------------------------LT-R-------------SHVEA-----L---------LPV--VER-----YG--V--DA-AA-T-L-HA------ML--R-EAL--------------P-KVTAKAITDVVR -_Acaryochloris_marina_501117833 L-A--DL-EEIFLEP---PDTGS-----DAL-------LSSGLALKTIQD-----KKLYL--P---------D--SK--G-FKVYVE-------ENL-GV-TYIHAFRCI--QAAELVLFL-Q--E------------------------------------------------------HFS--------------------------------------------------V-------LP-Q-------------SESAA---------------RPL----------VK--L--SR-AN-Q---LK------AW--G-EVV-RITAG-DK---W---APGKDRIKKTIA -_Streptacidiphilus_carbonis_755052115 L-I--AQ-EQDMLIK---CESAI-ENLRFAF-------WAAGKALQVIRN-----ARLYR--E---------Q--FE--T-FDEYTQ-------SKW-DI-TPQYANKLI--RTWRVAEAL----L------------------------------------------------------Q--------------------------P----------RSG------------GVLETIVSTK-L-------------GYGHA---------------WAL----------VP--L--VE-QH-S---VQ-AAVY-LY----MGI-VKVKG--------A-GVTAALVQGAVE -_Streptomyces_varsoviensis_664363832 V-T-----EQVIHAA---LAAGD-----AAI-------WVIGKALTVAAK-----GKFHR--D---------Q--GM--T-FDEYAR-------AET-GK-SPAHARRWM--DGAPLALAV-A--A------------------------------------------------------ATS-S--------------------------------------------------------TP---------------PEGHV---------------RPL----------RK--I--EK-EI-G---TR-PAIE-LY----RSA-DKASG-EG-G---R-KVTGAVLVEIRK -_Acaryochloris_marina_501119208 L-A--EL-EEIFLEP---PETGS-----DAL-------LSSGLALKTIQD-----NKLYL--P---------D--SK--G-FKVYVE-------ANL-GV-TYIHAFRCI--QAAELVLFL-Q--Q------------------------------------------------------HFS--------------------------------------------------V-------LP-Q-------------SESAA---------------RPL----------VK--L--SR-AN-Q---LK------AW--G-EVV-RITAG-DK---W---APGKDRIKKTIA -_Borrelia_hermsii_645011182 --------EEINART---LEEAV------NR-------VELAKALYEIKK-----NKLYR--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYI--KIGAYLEKE---------------------------------------------------------------N-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KKSSP----------ECKPVVIRFNLE -_Kitasatospora_sp_MBT66_759753188 F-A--GA-EYAARAN---IQQST-Q---QRV-------IVQGQILLAMRE-----EELWK--A---------LG-WT--D-FDVLVK-------HRF-NI-GRNYANKII--RSMPVVRAL----E------------------------------------------------------HVT-S--------------------------------------------------------ME-M-------------AEKHL---------------RAL----------VP--V--QE-RH-G---DE-AVRR-TW--E-EAL---RKG----------KITEKSLKEAAR -_Borrelia_crocidurae_644980358 --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK -_Streptomyces_griseofuscus_739811243 I-S--AA-QSAAETS---LIVAH-N---EFV-------MQAGPALIKIKE-----DDLWQ--A---------GG-YT--S-FKDYFE-------KKW-KY-TEQRAYQLI--RAVPVIAAL----K------------------------------------------------------GVA-T--------------------------------------------------------VK-I-------------NEGQC---------------REL----------AP--V--AR-DH-D---AA-TVQK-IW----RAA----------------ESKGKVTAKSLA -_Borrelia_coriaceae_645024919 --------EEIKART---LEEAV------NK-------LELAKALYEIKK-----NKLYR--F---------DG-YD--Y-FYEFCL--------DY-KF-SRTMIYKYI--RIGAYLEKE---------------------------------------------------------------D-----------------------------------------------------------V-------------KEQDI-IQ------------GSL----------NK--I------------IN----------D-IRV-KKSSS----------MCKPVIIKLNLE -_[Eubacterium]_siraeum_491496822 V----HQ-DLMIQEQ-V-AAQSL---------------TQIAIDLKEIRD-----RRLYA--E---L---G----YS--D-FAEYCE-------NAT-KT-GKRQAYNLI-----SLVEQY-----------------------------------------------------------KID-D----------------L-------------------------------S-----R-LA----------YL---GSTKL---------------IAL----------KS--L--GK-EE-----RE------------ELI-ESGKA--------E-ELSVRELKEKIK -_Xenococcus_sp_PCC_7305_493559029 I-A--GD-FDLQYLT---FEFAY-NQ--LAF-------VRNGLLLAKIKF-----LKLYK--N---------YG-DG--T-FASFCR-------EKL-RK-QRWQINDTI--RAARVVLEL-----------------------------------------------------------MYA-G--FD--------------------------------------------V-------LP-T-------------NISQA---------------IAL----------AK--L--TG-EK---L-VE------TW--R-SII-NIIPL--------D-KITAKSIRNLLN -_Streptomyces_violaceorubidus_663148255 I-L--AV-QYAARAN---HERAE-Q---QKL-------IGLGLRLQAMKD-----EELHK--T---------AG-FN--T-FGELTD-------SRF-GI-KKHQANNIL--RVMPVAQAL----E------------------------------------------------------DIT-T--------------------------------------------------------QE-L-------------KERPL---------------RVL----------VP--V--LE-AH-G---RE-AVRE-TW--L-EAA---RHG----------NVTDKTLMQAAN -_Borrelia_crocidurae_504495970 --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK -_Streptomyces_sp_CNH099_654240343 L-I--GE-EEEVFQR---CEAAV-ETLKFAF-------WAAGKGLQVIRD-----GRLYR--A---------T--HG--T-FDDYVQ-------DRW-GM-TRAQANKLI--RMWPIAEAL----F------------------------------------------------------ESE-A----------------------Q----------GSN------------D-LARTRAKR-L-------------SQSVV---------------WEL----------VP--V--AE-RY-D---VD-AAQH-LY----STT-VEASG--------G-EITAAVLKGAVA -_Deinococcus_peraridilitoris_505048307 L-T--PE-EKARLAA---LEQVV-GYGLRSF-------IEMAQALQEIQE-----RRLYR--E---------Q--YV--T-FEHYCM-------KVW-NF-SRTWAYQIM-QSKEAALLAL-----------------------------------------------------------DHG--------------------------------------------------V---P---VP---------------TERHA---------------RAL----------IG--V--SA-EN-----LE------IV--A-SVV-KAATG----K---E-NPTSADYQAVVE BCO_0118100_Borrelia_coriaceae_Co53_576095549 --------EEIKART---LEEAV------NK-------LELAKALYEIKK-----NKLYR--F---------DG-YD--Y-FYEFCL--------DY-KF-SRTMIYKYI--RIGAYLEKE---------------------------------------------------------------D-----------------------------------------------------------V-------------KEQDI-IQ------------GSL----------NK--I------------IN----------D-IRV-KKSSS----------MCKPVIIKLNLE -_Chroococcidiopsis_thermalis_504967303 V-Q--QR-TKELKER---LQRTA-----QDI-------WDIGKKLVEVRA------------E-----LKG----HG--Y-FDAWLR-------AEF-GW-SRRTAYNFI-----YVYEAF-----------------------------------------------------------PYA-K----------------F--------------------------------------------------------AQMII-EP------------SAL----------YR--L--AS-PS-T---PD----A-IR--D-KFI-QQANA----G---S-KVTHKEVLKAVT -_Streptacidiphilus_anmyonensis_755076504 --Q--KE-QTETVIR---TAHAT---GKAAV-------WVIGQGIAIMTK-----GKLYR--R---------T--HS--T-LEDYVA-----E-LIP-DV-VPRQTRRWV--TGSKVALAI-A--T------------------------------------------------------RQG-E--------------------------------------------------------AP---------------VESQV---------------RKL----------TD--L--PE-EV-A---VE------LY----VAA-NDAAT-AA-G---Q-RLTAESLGQLAQ -_Streptomyces_sp_R1-NS-10_759540387 L-T--AD-EQERLAA---CVAGI-ELLSTAT-------WVAGKSLDTVAT-----GRLFR--VIPHKLEPERC--YK--T-IEEWSE-------TEY-GI-SRSRCSQLR--DGWELGEVL----T------------------------------------------------------VRG-H----------------------K---------------------------------AP----------------EGQV---------------REL----------VP--F--FK-QH-G---LK-AAVG-VY--E-MVV-QAAGA--------D-KITAKRLRETVK -_Streptomyces_rochei_690403288 I-Q--EA-DRRTELA---TEQIT-Q---QYL-------LWVGEPYRIVRD-----EELYR--V---------AG-YS--S-FDDWGR-------ALN-GR-SGDYMNKII--RVAPVVRAL----S------------------------------------------------------HIT-R--------------------------------------------------------RQ-L-------------KEQPL---------------RPL----------IA--V--QR-ER-G---DE-AVRR-CW--R-KAE---ASG----------DLTERGLRAAAV -_Cyanothece_sp_PCC_8802_506264213 M-S-----ISWATNE---IKQHL-----LNW-------CRVGIVAQQVKR-----FCKWK--D---------LK-LT--S-FKEYCE-------TIL-GV-SCGYINQII--KCAKVTLDL----A------------------------------------------------------SMG----FE--------------------------------------------V-------LP-T-------------NPSQA---------------KHL----------LK--F--EG-ED---L-KA------AW--Q-QVL-DENPK--------H-LITAKAIEKTLN -_Nostoc_punctiforme_501377550 L-------EQSILEG---IEAGK-----KGF-------QQAAQALLRIDE-----LALWR----------G-E--AV--S-FDAYRQ--------KF-----------------KAVLEDL---------------------------------------------------------------D------------------------------------------------I------------------------TDRHL---------------NRL--------------L--AA-EK-C---VQ------ML----RPI----------------GLNICTHKEVIS -_Borrelia_recurrentis_501533313 --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK -_Streptomyces_albidoflavus_663309700 L-N--PA-EVLRLNQ---AESQI-RAFGKAA-------AAAGEAFDMIKK-----DGLHH--H---------Y--GL--T-WAQYTL-------THW-GL-SASQADRLI--AAAPVMREL---------------------------------------------------------------S------------------------------------------------I----------A-------------NEGTA---------------REF--VSIY----RE--W--GA-TS-A---RA------IW----SGA-KEGAD--------G-KPTAKIANIAVK -_Streptomyces_sp_NRRL_S-1022_663349955 L-R--PD-ELEQLAE---CHRAI-DNARSAQ-------WMLGRALEIVRR-----RRLYR--G------DG----SR--T-WPQYLA-----A-EHD-GM-TERDARRLQ--EEWRLAKAV-----------------------------------------------------------QDA--------------------------------------------------L-----G-KP-A-------------PASHV---------------RAM----------LD--Y--AD-ST-S---DE-QAAH-AY----VML-RSAFE-AA-Q-V-R-LAAHQITARVAK -_Kitasatospora_sp_MBT66_759768668 L-I--AQ-ERDLLVK---CEAAL-ENLRIAY-------WAAGKALEAIRG-----GRLYR--A---------T--HG--T-FEAYCL-------ELW-DI-SPQYAGKLI--RAWRVAEKV----F------------------------------------------------------ESL-G----------------------P----------KSN------------D-LETIVSKR-L-------------GYGQA---------------WEL----------VA--L--SE-EH-G---VD-AAAL-LY----VAL-IQAKG--------M-ALTAAMVAGAAK -_Streptomyces_sp_Tu_6176_740047622 --------QKEQTEA-V-ISTAL-AAGDAAV-------WVIAQGLERAAK-----GRWWR--R---------T--HT--S-LGSYVE-----A-----KI-GRSAVYGRQ--LRKNAPLAL----E------------------------------------------------------TAH-K---------------------------------TGT--------------------VP---------------KPSQV---------------KVT----------SK--T--EE-QY-G---RE-AAVT-LY----EVV-RDVSS-EL-G---A-HPTADSLMAVHK -_Deinococcus_marmoris_736389644 L-E--PH-QKARLTA---LETTV-RDGLRDF-------RRTGQALSEIRD-----NEFFR--A---------G--YD--S-FESYLQ-------DRW-GF-TPPQAGRLM--EAADVAKVL----D------------------------------------------------------PLG--------------------------------------------------I-------QP-K-------------NEAQA---------------RTF----------KA-----AA----K-----------LV----TEM--------------E-PEQQRVVARLVE -_Nonomuraea_candida_759952224 L----GA-QEAHDGA---VERAD-----RYL-----T-LTQGLALEAVKK-----DDLWR--------LLG----FK--S-FQEYVE-------QRL-NI-SRQHAYKMM--QAAPVHRDL-------------------------------------------------------------------------------------------------------------------------P-H-------------VERLT-----F---------RQI--AIL-----AR--L--KD-AA-T---RQ----K-VW--------SLAEK------W-E-DTSPPSLQKAVD dsmv_2585_Desulfococcus_multivorans_DSM_2059_523467872 I----ND-EDEDEFI---IEYEP-----PWF-------VQVGQALSHLKE-ALL-TEFPC--A---------E--QG--P-LPRWGE--TC---KEL-SI-SQSYANRLI--AAAEVYVAL----R------------------------------------------------------SAG--------------------------------------------------I-DEDD--LP-I-------------YERQV---------------RPL----------VR--F--KQ-DP-S-I-LK----Y-LW--E-EAL-VIAED-IE-F-N-S-LPRAGVVEYVVG Cflav_PD5941_Pedosphaera_parvula_Ellin514_223896866 L-T--TE-ETTTLSE---CEAVI-EEGMKTF-------VEVGSAVLTISD-----RRLYR--A---------T--HS--T-FEDYIQ-------DKW-DM-TARRAYQLC--EAAEVVMKL----E------------------------------------------------------NVK-H---------------------------------ASQ------------I----------E-------------NARQA---------------EAL----------AK-----AP-EE-K---RD----E-VL--E-KAI----------Q---T-APEGKLTSKH-- -_Streptomyces_turgidiscabies_493426429 L-M--AA-EVTKSKS---IAWSK-----LRW-T-----VETGAALRVLIE-----EDLYK--------EDP-E--FT--S-LETYAD-------NRL-HL-SRGHVYELV-DDASRLLAVA-----------------------------------------------------------PLS-E----------------I----------------SDK------------P-------FN-A-------------SQAKV-----L---------APL----------ME--V--YA-ED-G---VE-GGRT-----K-AEL-VVADV-DS-T-G-K-KRTAAALRKAAE -_Borrelia_hispanica_639481918 --------EEIRART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYV--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-MQ------------SSL----------NK--I------------IN----------D-IRV-KRDSS----------YCKPVVVKFNLK -_Borrelia_persica_639480287 --------EEIKART---LEEAV------NK-------VELSKALYEIKK-----NKLYK--F---------DG-YD--N-FYEFCL--------DY-KF-SRTMIYRYI--KIGAYLERE---------------------------------------------------------------S-----------------------------------------------------------I-------------SEQDI-ID------------SSL----------NK--I------------IN----------D-IRV-KRNSS----------YCKPVVVKFNLK -_Streptomyces_sp_NRRL_S-455_663210974 -----------ASHS---LKAAR-A---RFV-------VEAGTALRAIRD--ED-GGLYK--V---------T--HE--T-FEQYIS-------DRW-DM-DRSRAYQLI--DAAPTMNLL----S----------------------------------------------------------K------------------------------------------------I-FD----TA-P-------------VESQA---------------RAL----------AP--V--LE-AH-G---EE-AVRE-VV----VAV-KQAGA----------KVTAATIKEAAH -_Geminocystis_herdmanii_515865463 --------KYNQLKQ---ITNSI-KYNRINY-------IKLGLQLYQVKY-----YHLYK--N---------E--YK--S-FKDYCE-------KAV-YY-PVWRANQVI--DSANVAIKL----I------------------------------------------------------KLG----FN--------------------------------------------I-------IP-Q-------------NEAQA---------------RLL----------IK--L--NE-EQ---L-ME------KW--Q-EVL-NTYEP------Y-K-ITANRIERIVFG -_Opitutaceae_bacterium_TAV5_645069929 L-T--RD-ERDQLTR---CELNI-E---RNL-------SSIGAALKTIRD-----QRLYR--E---------T--HD--S-FDRYCY-------ERW-QK-SARWAHYQI--AAAIFAEEH-----------------------------------------------------------PEA-D---------------------------------GGE------------V-RR----IA-A-------------GRAPD--P------------SPA----------AK--A--AP-AK-P---TK-SD---RY--R-EQI-RALAE-MDREAL-D-RATQQSIGKLAE -_Deinococcus_misasensis_736313124 L-S--PA-ERLALQD---RESII-EGGQQAA-------KATWKALAEIHD-----LQLYR--E------------LG--T-WEDYLQ-------KRW-GI-KAAHGYRQV--AAAAIDVIL----T------------------------------------------------------GAG--------------------------------------------------V-------AI-P-------------TERRL---------------RPL--TKI-----LE--L--PE-DL-Q---DA------TA----RAM-KAIFG--------S-KPSSSEVEAFAE -_Streptomyces_sp_URHA0041_697211997 --------QLQRTEE-V-IAAAV-AAGDTAL-------WVIAQALERAAK-----GRWWR--A---------T--HD--S-LTAYVE-----E-----TV-GRSAVYVRQ--LRMNAPLAL----E------------------------------------------------------TAR-R---------------------------------TGT--------------------VP---------------KPSQI---------------KAT----------RK--T--EQ-RH-G---LD-AAVT-LY----EVV-RDVAA-EL-G---G-EPTARGLIAVHN -_Deinococcus_radiodurans_653294307 L-A--PH-EQQRLDD---LEQTV-EGGLRDF-------QRTGQALSEIRD-----NELYR--A---------T--HD--S-FEAYLQ-------DRW-GF-GVRQADRLI--DAAQVAKQL----E------------------------------------------------------PLG--------------------------------------------------I-------SP-R-------------HEAQA---------------RSF----------RP-----AA----R-----------IV----EEL--------------E-PEQQRLVARLVE -_Streptomyces_niveus_558889206 L-S--EE-DRLLLSQ---CEGRI-QAFGKAA-------ADAGEAFDTIKD-----KELHR--H---------Y--RM--T-WAEYTM-------ARW-GV-SVSQVDRLI--AAAPVMREL---------------------------------------------------------------S------------------------------------------------I----------P-------------NEGTA---------------REL--VPAY----RE--W--GA-DS-A---RA------LW----DGT-KEGSE--------N-KPTAKVLRAAVQ -_Streptomyces_griseus_702687171 L-I--GE-EQDHLAR---CEAAV-ETLKGAF-------WAAGKALQIIRD-----ARLYR--Q---------T--HG--T-FDAYCD-------DRW-GM-NRQYADKLI--RTWPIAEAL----Y------------------------------------------------------ERQLA----------------------A----------AKG------------E-TTPIGVKK-L-------------NQAQM---------------WEL----------VP--V--AD-SW-D---VD-AATF-VYETVADTV-VQVDG--------R-DVTAAVIQGAVK -_Streptomyces_sp_S4_498331267 L-S--EE-DRLLLSQ---CEGRI-QAFGKAA-------ADAGEAFDTIKN-----KELHR--H---------Y--RM--T-WAEYTL-------ARW-GV-SVSQVDRLI--AAAPVMREL---------------------------------------------------------------S------------------------------------------------I----------P-------------NEGTA---------------REL--VPAY----RD--W--GA-DS-A---RA------LW----NGT-KEGSG--------N-KPTAKVLKAAVQ -_Kitasatospora_sp_MBT66_759768796 L-T--AE-EEQRLAA---CVEGV-ELLSTAY-------WVAGKSLDTMAV-----GRLFR--KLPHRLEPARC--YA--T-IEEWAD-------VEH-GI-RQSRCSKLR--AGWELGEVL----N------------------------------------------------------AHG-H----------------------K---------------------------------VP----------------EGQV---------------REL----------VP--L--KN-RH-G---LK-AAVG-VY--Q-LVV-NAVGA--------E-KVTA-------- -_Deinococcus_swuensis_746727627 L-E--PH-QKARLTA---LETTV-RDGLRDF-------RRTGQALSEIRD-----NEFFR--A---------G--YD--S-FEAYLQ-------DRW-GF-TPPQAGRLM--EAADVAKVL----D------------------------------------------------------PLG--------------------------------------------------I-------QP-R-------------NEAQA---------------RTF----------KA-----AA----K-----------IV----TEL--------------E-PEQQRVVARLVE -_Deinococcus_frigens_736394879 L-E--PH-QKARLTA---LETTV-RDGLRDF-------RRTGQALSEIRD-----NEFFR--A---------G--YG--S-FEAYLQ-------NRW-GF-TPPQAGRLM--DAADVARVL----D------------------------------------------------------PLG--------------------------------------------------I-------QP-K-------------NEAQA---------------RTF----------KA-----AA----R-----------IV----TEL--------------E-PEDQRVVARLVE GM3709_2810_Geminocystis_sp_NIES-3709_770473153 --------KYNQLTH---ITNSI-KYNRINY-------IKLGMQLYQVRY-----YKLYK--S---------S--YT--S-FKDYCE-------KAV-YY-PVWRANQVI--ESASIAIKL----I------------------------------------------------------KAG----FN--------------------------------------------I-------IP-Q-------------NEAQA---------------RLL----------IK--L--NE-EE---L-MR------KW--Q-EVL-DTYEP------Y-K-ITANRIEKIVFG -_Streptomyces_sp_NRRL_F-5008_740027662 L-T--PE-ERADLET---CERAV-SGLQTAF-------TVAGKALATINQ-----ARLYK--E---------T--HS--S-FAAYVE-------DRW-GM-RKSQAYRLI--EAWPVAVAL-----------------------------------------------------------SSG-P----------------------N-------------------------V-------SP-R-------------GDTSA-------PPEKHV--RAL----------LP--V--VK-RH-G---LD-AARV-VY----EEL-REQDA--------R-VTTTRVTQAVRV -_Deinococcus_misasensis_736303905 L-T--SE-EQSQLDQ---LESTI-QQAVEQV-------KAGWHALKEIHD-----KGLYR--L------------YG--T-WEEYLQ-------KRW-NI-SATHGHRQV--AAAALDTIL----L------------------------------------------------------NAG--------------------------------------------------V-------VV-E-------------AERRL---------------RPL--TPL-----LD--L--PD-ED-Q---VA------IV----RTL-RETCG--------S-KPSTAQVKAFAE -_Streptomyces_halstedii_664288086 V-T-----ERVIHAA---LAAGD-----AAI-------WVIGKALTVAAK-----GKFHR--D---------Q--GM--T-FDEYAR-------AET-GK-SPAHARRWM--DGAPLALAV-A--A------------------------------------------------------ATS-S--------------------------------------------------------TP---------------PEGHV---------------RPL----------RK--I--EK-EI-G---TR-PAIE-LY----RSA-DKASG-EG-G---R-KVTGAVLVEIRK -_Streptomyces_bikiniensis_663184423 L-S--DQ-EREDVEA---CKAGV-DNLRNAF-------WVAGKSLETMST-----AKLHR--E---------E--NP--N-FAEWIW-------EKW-EI-SESNLYRLI--DEWRVGEAL----A------------------------------------------------------NLG-H---------------------------------------------------------K-P-------------LESHV---------------RKM----------TE--L--RR-QT-S---DK-VAIT-VY--D-TIA---RCR--------T-RVTGDLVEKVVN -_Streptomyces_sp_CNT372_739896924 L-T--PS-ERADLET---CERAV-SGLQTAF-------TVAGKALATINQ-----ARLYK--E---------T--HP--S-FAAYVE-------DRW-GM-RRAQAYRLI--EAWPVAVAL-----------------------------------------------------------SSG-P----------------------D-------------------------V-------SP-R-------------GDTSA-------PPERHV--RAL----------LP--V--VK-RH-G---LD-AARA-VY----EEL-REQDA--------R-VTTTRVTQAVRA -_Diplosphaera_colitermitum_759901356 L-T--RD-ERDQLTR---CEANI-E---RGL-------TAVGQALKTIRD-----NRLYR--E---------T--HD--S-FDIYCY-------ERW-QK-SVRWANYQI--AAAIFAGEH-----------------------------------------------------------PEA-E------------------------ITS------ERQ------------A-RA----LR-S-------------GGSTP--E------------ATS----------PA--A--TP-SK-P---TK-ND---RY--R-EQI-RALAE-MDREAL-D-RATQQSIAKLAE -_Opitutaceae_bacterium_TAV5_497194662 L-T--RE-ERDQLTR---CEANI-E---RGL-------TAVGQALKTIRD-----ARLYR--E---------T--HD--N-FEAYCY-------ERW-QR-SKRWANYQI--AAAIFAEEN-----------------------------------------------------------PEA-D---------------------------------EGE------------V-RR----IA-A-------------GRAPS--R------------ESA----------PA--A--AP-AK-P---TK-SD---RY--R-EQI-RALAE-MDREAL-D-RATQQSIGKLAE -_Pleurocapsa_sp_PCC_7319_518337503 I-E--GD-FNLDYIV---FEFAY-NR--LAY-------VRNGLLLAKLKF-----LKLYK--N---------FG-DG--T-FATFCR-------EQL-KI-TRWQVNDNI--KAARVCLEL-----------------------------------------------------------IYA-G--FE--------------------------------------------I-------LP-T-------------NISQA---------------IAL----------AS--L--AG-DE---L-IH------AW--R-SVI-ESIEP--------D-KITHKSIKSFLF -_Streptomyces_megasporus_671527277 L-T--EA-ERADLTT---CQAVL-QQHHASF-------WLTGKALETISK-----RRLYR--A---------D--HP--T-FEAFL--------EDW-DI-TPADAYRMM--NGWPLANRL-L--R------------------------------------------------------DVP-K--------------------------------------------------------LT-R-------------SHVEA-----L---------LPV--VNR-----YG--V--EA-AA-T-L-HA------LL--R-DSL--------------P-KVTAAAIAQVVR -_Lamprocystis_purpurea_521992951 M-T--AE-ECDLYVK---LFQKS-ESD-QRF-------Y-----LLKIRE-----EKGWK--A---------KG-FE--S-FDAFGE-------SVL-GV-TIGRLNQLA--RAAEVQLSI-----------------------------------------------------------GND-T------------------------------------------------I-VSK---IP----------------EGQL---------------RPL----------AP--L--TD-EE-R---RT------VW--A-EAT-AKAEE-DG-----R-KLTARLVQEAVD consensus/100% ............................................h....................................b............................................................................................................................................................................................................................................................................... consensus/95% ..................h.........................l..hpp......pha....................p.a..a...........h...........bh......h...h.............................................................................................................................................h.................h..............h.....................h................................... consensus/90% ........b.........hp..................h.....L..lpp......chab...............a...s.F..ah.........pa....s...s..hh......h...h.............................................................................................................................................h...............p.L..............h.....................h.........................s...h...h. consensus/85% h.......b...b.....hp..h...............h...bhL.plpc......chYc...............a...s.Fp.ah.........pa..h.sbp.s..hl...u..l.p.l.........................................................................................................................................pp..h...............+sL..............l....................ha....p....................s...h..hh. consensus/80% l.......cp..b.....hp..l...............hp..bhLbpl+-.....pchY+...............a...s.Fp.ahb........pa..h.scpps..hl...u..l.psl.........................................................................................................................................pp..h...............+sL..............l.............p......ha....psh..................ss..lpphh. consensus/75% L.......cpp.b.p...hcp.l...............hps.bhLbpl+-.....pchY+...............ap..o.Fcpahb........ca..h.s+pps.bhl...us.l.psl...............................................................p........................................................h................pp..l...............+sL..............L.....pp......p......ha....psh..................oubhlpphhp consensus/70% L.......cppbb.p...hcp.l...............hps.bhLbpl+-.....pchY+...............ap..o.Fcpahb........ca..h.u+ppsabhl...Assl.csl...............................................................p........................................................h..p.............ppp.l...............+sL..........hp..L..p..pp.p....p......ha..p.psh..................oubhlcplhpBack to Contents
GI Gene neighborhood Domain-architecture Pfam Gene name Len Taxonomy Species Genbank annotation # 202; Mostly ParA associated in a terminase context 576100681 BlyB-holin->orfD->Borrelia_orfA->DUF226->ParA->ParB-HTH*->?->Terminase_LS-> ParB-HTH Plasmid_parti BAN_0003100 195 bacteria>spirochaetes Borrelia anserina BA2 Putative plasmid partition protein (plasmid) [Borrelia anserina BA2]. 576100674_?->576100675_?->576100676_BlyB-holin->576100677_orfD->576100678_Borrelia_orfA->576100679_DUF226->576100680_ParA->576100681_ParB-HTH*->576100682_?->576100683_Terminase_LS-><-576100684_?<-576100685_?||576100686_?->576100687_?->576100688_?-> 503783548 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->XerD->?->?->Terminase_LS-> ParB-HTH Plasmid_parti BBIDN127_RS05350 192 bacteria>spirochaetes Borrelia bissettii permease [Borrelia bissettii]. 503783542_orfD->503783543_BdrA->503783544_Mlp-><-503783545_ERF||503783546_Borrelia_orfA->497943782_DUF226->503783547_ParA->503783548_ParB-HTH*->503783549_BdrA->503783518_XerD->503783554_?->503783555_?->503783556_Terminase_LS-> 576091528 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti BHW_0003100 191 bacteria>spirochaetes Borrelia hermsii MTW Plasmid partition family protein (plasmid) [Borrelia hermsii MTW]. <-576091521_?||576091522_?->576091523_?-><-576091524_?||576091525_?-><-576091526_Terminase_LS<-576091527_?<-576091528_ParB-HTH*<-576091529_ParA<-576091530_DUF226<-576091531_Borrelia_orfA<-576091532_orfD<-576091533_BlyB-holin<-576091534_?<-576091535_? 576092650 BlyB-holin->orfD->?->Borrelia_orfA->DUF226->ParA->ParB-HTH*->?->Terminase_LS-> ParB-HTH Plasmid_parti BHO_0003100 191 bacteria>spirochaetes Borrelia hermsii YBT Plasmid partition family protein (plasmid) [Borrelia hermsii YBT]. 576092643_?->576092644_BlyB-holin->576092645_orfD->576092646_?->576092647_Borrelia_orfA->576092648_DUF226->576092649_ParA->576092650_ParB-HTH*->576092651_?->576092652_Terminase_LS->576092653_?-><-576092654_?<-576092655_?<-576092656_?||576092657_?-> 576094173 BlyB-holin->orfD->Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-?||?->?->Terminase_LS-> ParB-HTH Plasmid_parti BCO_0003100 191 bacteria>spirochaetes Borrelia coriaceae Co53 Plasmid partition family protein (plasmid) [Borrelia coriaceae Co53]. 576094166_?->576094167_?->576094168_BlyB-holin->576094169_orfD->576094170_Borrelia_orfA->576094171_DUF226->576094172_ParA->576094173_ParB-HTH*-><-576094174_?||576094175_?->576094176_?->576094177_Terminase_LS->576094178_?-><-576094179_?<-576094180_? 576105484 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti BHY_1114 191 bacteria>spirochaetes Borrelia hermsii YOR Plasmid partition family protein (plasmid) [Borrelia hermsii YOR]. <-576105477_?||576105478_?->576105479_?-><-576105480_?||576105481_?-><-576105482_Terminase_LS<-576105483_?<-576105484_ParB-HTH*<-576105485_ParA<-576105486_DUF226<-576105487_Borrelia_orfA<-576105488_orfD<-576105489_BlyB-holin<-576105490_?<-576105491_? 497942632 BlyB-holin->Borrelia_lipo_2->orfD-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase||?->?->Terminase_LS-> ParB-HTH Plasmid_parti BBUN40_RS06040 186 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 497942599_BlyB-holin->497942602_Borrelia_lipo_2->497942605_orfD-><-504353774_ERF||497942614_Borrelia_orfA->497942617_DUF226->497942626_ParA->497942632_ParB-HTH*->497942636_BdrA->504353775_BppA->497943617_XerD-><-497943614_Phage-integrase||497942645_?->497942648_?->497943610_Terminase_LS-> 201084505 <-Lipoprotein_2<-Lipoprotein_2<-?<-?<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226||Borrelia_orfA->?-><-BdrA ParB-HTH SP+Plasmid_parti BDU_7025 205 bacteria>spirochaetes Borrelia duttonii Ly PF49 plasmid partition protein (plasmid) [Borrelia duttonii Ly]. <-201084498_Lipoprotein_2<-201084499_Lipoprotein_2<-201084500_?<-201084501_?<-201084502_Lipoprotein_2<-201084503_Lipoprotein_2<-201084504_Lipoprotein_2<-201084505_ParB-HTH*<-201084506_ParA<-201084507_DUF226||201084508_Borrelia_orfA->201084509_?-><-201084510_BdrA 644979506 DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti BHW_RS04950 204 bacteria>spirochaetes Borrelia hermsii permease, partial [Borrelia hermsii]. 752506912_?->644979504_DUF226->644979505_ParA->644979506_ParB-HTH*-> 501533114 DUF226->ParA->ParB-HTH*->Lipoprotein_2->?-><-?||?->Lipoprotein_2-><-Mlp ParB-HTH Plasmid_parti BRE_RS05055 199 bacteria>spirochaetes Borrelia recurrentis permease [Borrelia recurrentis]. <-501533109_?<-752506110_?<-752506368_?<-752506369_?||752506370_?->752506374_DUF226->501533113_ParA->501533114_ParB-HTH*->752506375_Lipoprotein_2->501533115_?-><-501533116_?||501533117_?->501533118_Lipoprotein_2-><-501533119_Mlp||752506371_?-> 501533328 <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-?||?-><-?<-ParB-HTH*<-ParA<-DUF226<-?||Lipoprotein_2->Lipoprotein_2-> ParB-HTH Plasmid_parti BDU_RS07380 199 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-501533318_Lipoprotein_2<-752506111_Lipoprotein_2<-501533321_Lipoprotein_2<-752506107_Lipoprotein_2<-501533323_?||501533325_?-><-501533326_?<-501533328_ParB-HTH*<-501533113_ParA<-752506112_DUF226<-752506108_?||501533331_Lipoprotein_2->752506113_Lipoprotein_2->752506109_?->752506110_?-> 576092807 <-Lipoprotein_2||?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH Plasmid_parti BHO_0016000 199 bacteria>spirochaetes Borrelia hermsii YBT Putative plasmid partition protein (plasmid) [Borrelia hermsii YBT]. <-576092804_?<-576092805_Lipoprotein_2||576092806_?-><-576092807_ParB-HTH*<-576092808_ParA<-576092809_DUF226<-576092810_Borrelia_orfA||576092811_?->576092812_?->576092813_?-><-576092814_? 644980725 <-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-?||BdrA->Lipoprotein_2-> ParB-HTH SP+Plasmid_parti BCD_RS06505 199 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-749307931_Lipoprotein_2<-644980725_ParB-HTH*<-644980726_ParA<-644980727_DUF226<-644980728_?||644980729_BdrA->644980730_Lipoprotein_2->644980732_?->644980733_?-> 576093010 <-SSB<-BppA<-?<-BppA||?->DUF226->ParA->ParB-HTH*->ERF->?->?->Lipoprotein_2->?->Lipoprotein_2->Lipoprotein_2-> ParB-HTH SP+Plasmid_parti BHO_0006701 198 bacteria>spirochaetes Borrelia hermsii YBT Putative plasmid partition protein (plasmid) [Borrelia hermsii YBT]. <-576093003_SSB<-576093004_BppA<-576093005_?<-576093006_BppA||576093007_?->576093008_DUF226->576093009_ParA->576093010_ParB-HTH*->576093011_ERF->576093012_?->576093013_?->576093014_Lipoprotein_2->576093015_?->576093016_Lipoprotein_2->576093017_Lipoprotein_2-> 639481672 <-Lipoprotein_2<-?<-?<-Lipoprotein_2||?->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->?->Lipoprotein_2->Lipoprotein_2-> ParB-HTH Plasmid_parti U880_RS0100260 198 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-740572702_Lipoprotein_2<-639481666_?<-639481667_?<-639481668_Lipoprotein_2||639481669_?->639481670_DUF226->639481671_ParA->639481672_ParB-HTH*->740572705_Lipoprotein_2->639481674_Lipoprotein_2->639481675_?->740572708_Lipoprotein_2->639481677_Lipoprotein_2-> 695263537 ParA->ParB-HTH*-> ParB-HTH SP+Plasmid_parti PF-49 197 bacteria>spirochaetes Borrelia burgdorferi PF-49 protein [Borrelia burgdorferi]. 695208857_?->695208858_ParA->695263537_ParB-HTH*->695208860_?-> 736012165 Borrelia_orfA->DUF226->ParA->ParB-HTH*->Lipoprotein_2-> ParB-HTH Plasmid_parti I871_B18 197 bacteria>spirochaetes Borrelia miyamotoi LB-2001 plasmid partition protein (plasmid) [Borrelia miyamotoi LB-2001]. 736012158_?->736012159_?-><-736012160_?<-736012161_?||736012162_Borrelia_orfA->736012163_DUF226->736012164_ParA->736012165_ParB-HTH*->736012166_Lipoprotein_2-> 576102339 <-Lipoprotein_2<-Lipoprotein_2<-?<-?<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-?<-?||?-><-BdrA ParB-HTH SP+Plasmid_parti BCD_1474 195 bacteria>spirochaetes Borrelia crocidurae DOU Putative plasmid partition protein (plasmid) [Borrelia crocidurae DOU]. <-576102332_?<-576102333_Lipoprotein_2<-576102334_Lipoprotein_2<-576102335_?<-576102336_?<-576102337_Lipoprotein_2<-576102338_Lipoprotein_2<-576102339_ParB-HTH*<-576102340_ParA<-576102341_DUF226<-576102342_?<-576102343_?||576102344_?-><-576102345_BdrA||576102346_?-> 501669395 Borrelia_lipo_1->Borrelia_lipo_1->Borrelia_lipo_1-><-?||Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-?||METHYLASE-> ParB-HTH SP+Plasmid_parti BAFPKO_RS06170 194 bacteria>spirochaetes Borrelia afzelii permease [Borrelia afzelii]. 500023161_Borrelia_lipo_1->500023159_Borrelia_lipo_1->500023158_Borrelia_lipo_1-><-500023156_?||500023155_Borrelia_orfA->500023154_DUF226->500023153_ParA->501669395_ParB-HTH*-><-500023148_?||500023144_METHYLASE->500023143_?->500023142_?->500023141_?-> 501532751 DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti Q7M_RS06865 193 bacteria>spirochaetes Borrelia MULTISPECIES: permease [Borrelia]. 504499987_DUF226->504499988_ParA->501532751_ParB-HTH*-> 639480216 Borrelia_orfA->DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti U881_RS0101565 193 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. 639480213_Borrelia_orfA->639480214_DUF226->639480215_ParA->639480216_ParB-HTH*-> 639480329 <-ERF<-ParB-HTH*<-ParA<-DUF226 ParB-HTH Plasmid_parti U881_RS0102340 193 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. <-639480328_ERF<-639480329_ParB-HTH*<-639480330_ParA<-639480331_DUF226<-639480332_? 639481645 <-BppA||Borrelia_orfA->DUF226->ParA->ParB-HTH*->ERF-> ParB-HTH Plasmid_parti U880_RS0100085 193 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-740572680_BppA||639481642_Borrelia_orfA->639481643_DUF226->639481644_ParA->639481645_ParB-HTH*->740572681_ERF-> 644980942 Borrelia_orfA->DUF226->ParA->ParB-HTH*->ERF-><-Mlp||BdrA->?->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2-> ParB-HTH Plasmid_parti BCD_RS07715 193 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-644980938_?<-749308044_?<-749308051_?<-749308045_?||644980939_Borrelia_orfA->644980940_DUF226->644980941_ParA->644980942_ParB-HTH*->749308052_ERF-><-644980943_Mlp||644980944_BdrA->644980945_?->749308046_Lipoprotein_2->644980947_Lipoprotein_2->644980948_Lipoprotein_2-> 740577602 <-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH Plasmid_parti U881_RS10530 193 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. <-639480289_ERF<-740577602_ParB-HTH*<-639480290_ParA<-639480291_DUF226<-639480292_Borrelia_orfA 496158399 orfD-><-?||Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase||Borrelia_lipo_1-> ParB-HTH SP+Plasmid_parti NM71_RS06435 192 bacteria>spirochaetes Borrelia burgdorferi group MULTISPECIES: permease [Borrelia burgdorferi group]. 499186263_orfD-><-497944296_?||499186264_Mlp-><-499186265_ERF||499186266_Borrelia_orfA->499186267_DUF226->499186268_ParA->496158399_ParB-HTH*->497944503_BdrA->499186269_BppA->499186270_XerD-><-763427946_Phage-integrase||499186272_Borrelia_lipo_1-><-499186279_?||499186280_?-> 506379547 ParA->ParB-HTH-><-Borrelia_lipo_1<-?<-?<-XerD<-BdrA<-ParB-HTH*<-Borrelia_lipo_2 ParB-HTH Plasmid_parti BVAVS116_RS05635 192 bacteria>spirochaetes Borrelia valaisiana permease [Borrelia valaisiana]. 506379527_ParA->506379528_ParB-HTH-><-750014218_Borrelia_lipo_1<-506379538_?<-506379539_?<-506379541_XerD<-506379546_BdrA<-506379547_ParB-HTH*<-750014206_Borrelia_lipo_2<-750014207_?<-506379572_?<-506379580_?<-506379581_?<-506379584_?<-506379590_? 645063171 Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-> ParB-HTH SP+Plasmid_parti BHY_RS06695 192 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. 749302146_Borrelia_orfA->645063169_DUF226->645063170_ParA->645063171_ParB-HTH*->645063172_BdrA-> 657235060 Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-> ParB-HTH SP+Plasmid_parti DZ03_RS0105960 192 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. 657235057_Borrelia_orfA->657235058_DUF226->657235059_ParA->657235060_ParB-HTH*->657235061_BdrA-> 695262165 ParA->ParB-HTH*-> ParB-HTH SP+Plasmid_parti YP_009075307.1 192 bacteria>spirochaetes Borrelia burgdorferi putative plasmid partition protein; orf3 [Borrelia burgdorferi]. 695198281_?->695198282_ParA->695262165_ParB-HTH*->695198284_?-> 695262844 <-BdrA<-ParB-HTH*<-ParA<-DUF226<-?||BppA-> ParB-HTH SP+Plasmid_parti YP_009076506.1 192 bacteria>spirochaetes Borrelia hermsii hypothetical protein [Borrelia hermsii]. <-695199855_BdrA<-695262844_ParB-HTH*<-695199857_ParA<-695199858_DUF226<-695199859_?||695199860_BppA-> 501533150 <-BdrA||Mlp-><-SSB||Borrelia_lipo_2->?->DUF226->ParA->ParB-HTH*-><-Mlp||?->BdrA->?->Lipoprotein_2->Lipoprotein_2-> ParB-HTH SP+Plasmid_parti BRE_RS05250 191 bacteria>spirochaetes Borrelia recurrentis permease [Borrelia recurrentis]. <-501533144_BdrA||501533145_Mlp-><-752506379_SSB||752506380_Borrelia_lipo_2->501533147_?->501533148_DUF226->501533149_ParA->501533150_ParB-HTH*-><-501533151_Mlp||501533152_?->501533153_BdrA->752506381_?->501533154_Lipoprotein_2->501533155_Lipoprotein_2->501533156_?-> 501533271 DUF226->ParA->ParB-HTH*->BppA->SSB->SSB->XerD-><-Phage-integrase<-Mlp ParB-HTH SP+Plasmid_parti BDU_RS05365 191 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. 501533268_?->501533269_DUF226->501533270_ParA->501533271_ParB-HTH*->752505977_BppA->752505978_SSB->752505984_SSB->501533274_XerD-><-501533275_Phage-integrase<-501533276_Mlp||501533277_?-> 639481710 <-ERF||?->DUF226->ParA->ParB-HTH*->BppA-> ParB-HTH Plasmid_parti U880_RS0100505 191 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-740572765_ERF||639481707_?->639481708_DUF226->639481709_ParA->639481710_ParB-HTH*->639481711_BppA-> 644980346 <-ERF||DUF226->ParA->ParB-HTH*->BppA-> ParB-HTH SP+Plasmid_parti BCD_RS04485 191 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-749307677_ERF||644980344_DUF226->644980345_ParA->644980346_ParB-HTH*->644980347_BppA-> 645024139 <-ERF<-ParB-HTH*<-ERF<-ParB-HTH<-ParA<-DUF226<-Borrelia_orfA||BppA-><-Borrelia_orfA ParB-HTH SP+Plasmid_parti T431_RS0107620 191 bacteria>spirochaetes Borrelia coriaceae permease [Borrelia coriaceae]. 654876536_?->645023990_?->740579677_?->740579685_?->645023977_?-><-654876537_ERF<-645024139_ParB-HTH*<-740579687_ERF<-654876538_ParB-HTH<-645024753_ParA<-645024752_DUF226<-654876539_Borrelia_orfA||740579689_BppA-><-645024131_Borrelia_orfA 740582129 <-BppA<-ParB-HTH*<-ParA<-DUF226<-?||ERF-><-orfD<-Borrelia_lipo_2 ParB-HTH Plasmid_parti BDCR2A_RS06520 191 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-740582126_BppA<-740582129_ParB-HTH*<-501533270_ParA<-501533269_DUF226<-740582133_?||740582135_ERF-><-740582138_orfD<-740582140_Borrelia_lipo_2<-740582144_? 497942842 Borrelia_orfA->DUF226->ParA->ParB-HTH*->MultiTM-><-Borrelia_lipo_1<-?<-Borrelia_lipo_1 ParB-HTH SP+Plasmid_parti NM71_RS06980 190 bacteria>spirochaetes Borrelia burgdorferi permease [Borrelia burgdorferi]. 497942808_?->497942819_?->499192785_?-><-497942827_?||499192780_Borrelia_orfA->497942834_DUF226->497942837_ParA->497942842_ParB-HTH*->499192782_MultiTM-><-499192790_Borrelia_lipo_1<-501897251_?<-499192791_Borrelia_lipo_1<-499192784_?<-499192786_?<-499192787_? 501928245 Borrelia_lipo_1->Borrelia_lipo_1-><-?||Borrelia_orfA->DUF226->ParA->ParB-HTH*->MultiTM-><-Borrelia_lipo_1<-Borrelia_lipo_1<-?<-Borrelia_lipo_1||?-><-Borrelia_lipo_1 ParB-HTH SP+Plasmid_parti BSV1_RS05810 190 bacteria>spirochaetes Borrelia finlandensis permease [Borrelia finlandensis]. 497942808_?->501928244_Borrelia_lipo_1->501928236_Borrelia_lipo_1-><-748691647_?||501928229_Borrelia_orfA->501928253_DUF226->501928228_ParA->501928245_ParB-HTH*->501928259_MultiTM-><-501928239_Borrelia_lipo_1<-501928262_Borrelia_lipo_1<-501928248_?<-501928226_Borrelia_lipo_1||501928258_?-><-748691650_Borrelia_lipo_1 504496248 <-BdrA||Mlp->?->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->?->?->Lipoprotein_2-> ParB-HTH Plasmid_parti Q7M_RS06620 190 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-504496241_BdrA||752505230_Mlp->752505231_?->504496245_DUF226->504496247_ParA->504496248_ParB-HTH*->504496250_Lipoprotein_2->504496251_Lipoprotein_2->504496252_Lipoprotein_2->504496253_Lipoprotein_2->752505232_?->752505233_?->504496261_Lipoprotein_2-> 576102765 Mlp->?-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?->SSB->XerD-><-Mlp ParB-HTH Plasmid_parti BCD_1877 190 bacteria>spirochaetes Borrelia crocidurae DOU Putative plasmid partition protein (plasmid) [Borrelia crocidurae DOU]. <-576102758_?<-576102759_?<-576102760_?<-576102761_?||576102762_Mlp->576102763_?-><-576102764_ERF<-576102765_ParB-HTH*<-576102766_ParA<-576102767_DUF226<-576102768_Borrelia_orfA||576102769_?->576102770_SSB->576102771_XerD-><-576102772_Mlp 576313683 Mlp->?-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?->SSB->SSB->XerD-> ParB-HTH Plasmid_parti BDCR2A_01333 190 bacteria>spirochaetes Borrelia duttonii CR2A putative plasmid partition protein [Borrelia duttonii CR2A]. <-576313676_?<-576313677_?<-576313678_?<-576313679_?||576313680_Mlp->576313681_?-><-576313682_ERF<-576313683_ParB-HTH*<-576313684_ParA<-576313685_DUF226<-576313686_Borrelia_orfA||576313687_?->576313688_SSB->576313689_SSB->576313690_XerD-> 639482667 DUF226->ParA->ParB-HTH*->?->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2-> ParB-HTH Plasmid_parti U880_RS0105985 190 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. 639481986_?->639482665_DUF226->639482666_ParA->639482667_ParB-HTH*->639482668_?->740573672_Lipoprotein_2->740573666_Lipoprotein_2->740573675_Lipoprotein_2->740573680_Lipoprotein_2->740573669_Lipoprotein_2-> 740581845 <-BdrA<-?||?->?->ParA->ParB-HTH*->Lipoprotein_2->?->Lipoprotein_2-> ParB-HTH Plasmid_parti BDCR2A_RS05930 190 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-740581832_BdrA<-740581835_?||740581837_?->740581840_?->740581843_ParA->740581845_ParB-HTH*->740581850_Lipoprotein_2->740581848_?->740581853_Lipoprotein_2-> 52696733 Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-Borrelia_lipo_1<-ERF ParB-HTH SP+Plasmid_parti BGP219 189 bacteria>spirochaetes Borrelia garinii PBi hypothetical protein BGP219 [Borrelia garinii PBi]. 52696730_Borrelia_orfA->52696731_DUF226->52696732_ParA->52696733_ParB-HTH*-><-52696734_Borrelia_lipo_1<-52696735_ERF<-52696736_? 501710213 <-ParB-HTH*<-ParA<-DUF226 ParB-HTH SP+Plasmid_parti DY95_RS0104625 189 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-501710213_ParB-HTH*<-501710214_ParA<-501710226_DUF226 501898261 Borrelia_lipo_1-><-?||?-><-Borrelia_lipo_1||Borrelia_orfA->DUF226->ParA->ParB-HTH*-><-Borrelia_lipo_1 ParB-HTH SP+Plasmid_parti BSPA14S_RS06005 189 bacteria>spirochaetes Borrelia spielmanii permease [Borrelia spielmanii]. 501897677_Borrelia_lipo_1-><-750018078_?||501898236_?-><-501898240_Borrelia_lipo_1||501898252_Borrelia_orfA->501898255_DUF226->501898258_ParA->501898261_ParB-HTH*-><-501898271_Borrelia_lipo_1<-501898277_?<-750018079_? 503789140 <-METHYLASE||?->?-><-Borrelia_lipo_1<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?-><-?<-Borrelia_lipo_1 ParB-HTH SP+Plasmid_parti BBIDN127_RS06445 189 bacteria>spirochaetes Borrelia bissettii permease [Borrelia bissettii]. <-503789128_METHYLASE||503789130_?->503789134_?-><-503789135_Borrelia_lipo_1<-503789140_ParB-HTH*<-503789141_ParA<-503789142_DUF226<-763175385_Borrelia_orfA||763175386_?-><-763175387_?<-763175388_Borrelia_lipo_1<-503789147_? 506379500 Lipoprotein_2->ERF->Borrelia_lipo_1-><-ParB-HTH*<-ParA<-DUF226<-?<-?<-?||MultiTM->Borrelia_orfA-> ParB-HTH SP+Plasmid_parti BVAVS116_RS05510 189 bacteria>spirochaetes Borrelia valaisiana permease [Borrelia valaisiana]. 506379494_Lipoprotein_2->506379496_ERF->506379499_Borrelia_lipo_1-><-506379500_ParB-HTH*<-750014204_ParA<-506379502_DUF226<-506379503_?<-506379508_?<-506379515_?||750014216_MultiTM->506379525_Borrelia_orfA-> 657234804 <-BdrA<-?||?->?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?->Borrelia_lipo_1->?-><-Borrelia_lipo_1 ParB-HTH SP+Plasmid_parti DZ03_RS0103740 189 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. 657234800_?-><-696419413_BdrA<-657234801_?||657234802_?->657234803_?-><-657234804_ParB-HTH*<-657234805_ParA<-657234806_DUF226<-657234807_Borrelia_orfA||696419402_?->696419404_Borrelia_lipo_1->657234808_?-><-657234809_Borrelia_lipo_1 657248004 <-BdrA<-DUF226<-?<-?<-ParB-HTH*<-ParA||?->Borrelia_lipo_1->?-><-Borrelia_lipo_1 ParB-HTH SP+Plasmid_parti DZ00_RS0100090 189 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-671502334_BdrA<-696414189_DUF226<-657247698_?<-671502337_?<-657248004_ParB-HTH*<-671502341_ParA||696414192_?->696414195_Borrelia_lipo_1->501710212_?-><-696414197_Borrelia_lipo_1 671520434 Borrelia_lipo_1-><-ParB-HTH*<-ParA<-DUF226 ParB-HTH SP+Plasmid_parti DY88_RS0103995 189 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. 696421322_Borrelia_lipo_1-><-671520434_ParB-HTH*<-696418599_ParA<-671478354_DUF226<-671478348_? 671556237 Borrelia_lipo_1-><-?<-Borrelia_lipo_1<-?<-?||ParA->ParB-HTH*-><-Borrelia_lipo_1||?-><-ERF ParB-HTH SP+Plasmid_parti DY90_RS0100165 189 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. 501710551_?->671556248_Borrelia_lipo_1-><-696410895_?<-696419824_Borrelia_lipo_1<-696419827_?<-671556244_?||696419830_ParA->671556237_ParB-HTH*-><-671556756_Borrelia_lipo_1||671556757_?-><-671556759_ERF<-696410899_?||671556761_?-> 576102351 <-BppA<-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF-><-orfD<-Borrelia_lipo_2 ParB-HTH Plasmid_parti BCD_1485 188 bacteria>spirochaetes Borrelia crocidurae DOU hypothetical protein BCD_1485 (plasmid) [Borrelia crocidurae DOU]. <-576102349_BppA<-576102350_BdrA<-576102351_ParB-HTH*<-576102352_ParA<-576102353_DUF226<-576102354_Borrelia_orfA||576102355_ERF-><-576102356_orfD<-576102357_Borrelia_lipo_2 576313055 <-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-Borrelia_orfA||ERF-> ParB-HTH Plasmid_parti BDCR2A_01875 188 bacteria>spirochaetes Borrelia duttonii CR2A putative plasmid partition protein [Borrelia duttonii CR2A]. <-576313054_BdrA<-576313055_ParB-HTH*<-576313056_ParA<-576313057_DUF226<-576313058_Borrelia_orfA<-576313059_Borrelia_orfA||576313060_ERF-> 645010853 <-BdrA<-Lipoprotein_2||Mlp-><-XerD<-SSB||?->ParA->ParB-HTH*->?->Lipoprotein_2->Lipoprotein_2-><-Lipoprotein_2<-Lipoprotein_2 ParB-HTH Plasmid_parti BHO_RS05800 188 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-645010825_BdrA<-645010828_Lipoprotein_2||645010831_Mlp-><-645010834_XerD<-749299211_SSB||645010845_?->645010850_ParA->645010853_ParB-HTH*->645010859_?->749299212_Lipoprotein_2->749299213_Lipoprotein_2-><-645010867_Lipoprotein_2<-645010869_Lipoprotein_2 749307948 <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-BdrA||?->Borrelia_orfA-> ParB-HTH Plasmid_parti BCD_RS06730 188 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-644980757_Lipoprotein_2<-644980758_Lipoprotein_2<-644980759_Lipoprotein_2<-749307952_Lipoprotein_2<-749307948_ParB-HTH*<-749307950_ParA<-644980763_DUF226<-644980764_BdrA||644980765_?->749307954_Borrelia_orfA-> 654876319 <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2||?-><-ParB-HTH*<-ParA<-DUF226<-?<-Mlp<-Lipoprotein_2 ParB-HTH Plasmid_parti T431_RS0103865 187 bacteria>spirochaetes Borrelia coriaceae permease [Borrelia coriaceae]. <-645024031_Lipoprotein_2<-645024030_Lipoprotein_2<-740579181_Lipoprotein_2<-645024306_Lipoprotein_2<-740579183_Lipoprotein_2<-654876318_Lipoprotein_2||645024312_?-><-654876319_ParB-HTH*<-645024315_ParA<-654876320_DUF226<-645024321_?<-740579185_Mlp<-645024327_Lipoprotein_2 740582299 Mlp->Phage-integrase-><-XerD<-SSB<-SSB<-ParB-HTH*<-ParA<-DUF226<-?||ERF-> ParB-HTH Plasmid_parti BDCR2A_RS06845 187 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-740582282_?||740582285_Mlp->740582288_Phage-integrase-><-740582292_XerD<-740582315_SSB<-740582296_SSB<-740582299_ParB-HTH*<-740582304_ParA<-740582307_DUF226<-740582310_?||740582313_ERF-> 752506021 <-Lipoprotein_2<-Lipoprotein_2<-?<-?<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-?||?->Borrelia_orfA-> ParB-HTH Plasmid_parti BDU_RS06030 187 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-752506020_Lipoprotein_2<-752506028_Lipoprotein_2<-501533013_?<-501533014_?<-501533015_Lipoprotein_2<-501533016_Lipoprotein_2<-501533017_Lipoprotein_2<-752506021_ParB-HTH*<-501533019_ParA<-501533020_DUF226<-752506022_Borrelia_orfA<-752506023_?||752506024_?->752506029_Borrelia_orfA->752506030_?-> 763123871 <-XerD<-?<-?<-BppA||?->DUF226->ParA->ParB-HTH*->ERF->ERF-><-Borrelia_lipo_2<-BlyB-holin ParB-HTH Plasmid_parti BOM_RS05530 187 bacteria>spirochaetes Borrelia miyamotoi permease [Borrelia miyamotoi]. <-645073224_XerD<-645073225_?<-645073226_?<-763123878_BppA||645073228_?->645073229_DUF226->645073230_ParA->763123871_ParB-HTH*->763123880_ERF->645073134_ERF-><-763123873_Borrelia_lipo_2<-645073231_BlyB-holin<-645073136_?<-645073232_? 497943336 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti NM71_RS05150 186 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 499186329_orfD->499186330_BdrA->499186331_Mlp-><-499186332_ERF||499186319_Borrelia_orfA->499186320_DUF226->499186321_ParA->497943336_ParB-HTH*->499186322_BdrA->499186323_BppA->499186293_XerD-><-499186324_Phage-integrase||499186325_?->499186184_?->497943302_?-> 500023248 orfD->BdrA-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-> ParB-HTH Plasmid_parti BAFPKO_RS06545 186 bacteria>spirochaetes Borrelia afzelii chromosome partitioning protein [Borrelia afzelii]. 500023234_?->504299370_orfD->500023238_BdrA-><-500023243_ERF||500023245_Borrelia_orfA->500023246_DUF226->500023247_ParA->500023248_ParB-HTH*->500023249_BdrA-><-500023251_?<-500023252_?<-500023253_?||500023254_?->500023255_?->500023259_?-> 501533243 <-Borrelia_orfA||?-><-MultiTM<-ThiF<-ParB-HTH*<-ParA||Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2-> ParB-HTH Plasmid_parti BDU_RS05290 186 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-752505976_Borrelia_orfA||501533240_?-><-501533241_MultiTM<-501533242_ThiF<-501533243_ParB-HTH*<-501533244_ParA||501533245_Lipoprotein_2->501533246_Lipoprotein_2->752505975_Lipoprotein_2->501533247_Lipoprotein_2->501533249_Lipoprotein_2->501533250_Lipoprotein_2-> 501704211 BlyB-holin->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti BGAPBR_RS05150 186 bacteria>spirochaetes Borrelia garinii chromosome partitioning protein [Borrelia garinii]. 696411817_?->501704216_?->501704255_BlyB-holin->501704201_Mlp->501704219_Borrelia_orfA->501704207_DUF226->501704227_ParA->501704211_ParB-HTH*->501704234_?-><-501704233_? 501710973 Mlp->DUF226->ParA->ParB-HTH*->XerD->?->BlyB-holin->BdrA->Mlp->Borrelia_orfA-> ParB-HTH Plasmid_parti BGAFAR04_RS04830 186 bacteria>spirochaetes Borrelia garinii chromosome partitioning protein [Borrelia garinii]. 696414862_?->501710881_?->501710891_?->501710887_?->501710904_Mlp->501704207_DUF226->501710876_ParA->501710973_ParB-HTH*->501711020_XerD->501710905_?->696414864_BlyB-holin->501710896_BdrA->501711040_Mlp->501711010_Borrelia_orfA->696414865_?-> 504299060 Borrelia_lipo_2->orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BAFPKO_RS04750 186 bacteria>spirochaetes Borrelia afzelii chromosome partitioning protein [Borrelia afzelii]. 504299052_Borrelia_lipo_2->504299053_orfD->504299054_BdrA->504299055_Mlp-><-504299056_ERF||504299058_Borrelia_orfA->504299059_ParA->504299060_ParB-HTH*->504299061_BdrA->504299062_BppA->504299063_XerD-><-501574879_Phage-integrase||504299064_?->504299065_?-><-504299066_? 504496143 <-Lipoprotein_2<-BdrA||?->?-><-ParB-HTH*<-ParA ParB-HTH Plasmid_parti Q7M_RS05445 186 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-504496141_Lipoprotein_2<-504496142_BdrA||752505150_?->752505151_?-><-504496143_ParB-HTH*<-504496144_ParA||752505152_?->752505153_?->752505154_?-> 504496216 <-Lipoprotein_2<-?<-BdrA<-?<-?<-MultiTM<-ThiF<-ParB-HTH*<-ParA||Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->?->METHYLASE-> ParB-HTH Plasmid_parti Q7M_RS05925 186 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-752505181_Lipoprotein_2<-752505182_?<-752505187_BdrA<-752505183_?<-752505188_?<-504496214_MultiTM<-504496215_ThiF<-504496216_ParB-HTH*<-504496217_ParA||504496218_Lipoprotein_2->752505184_Lipoprotein_2->752505189_Lipoprotein_2->752505190_Lipoprotein_2->504496225_?->752505191_METHYLASE-> 504509606 <-SSB<-SSB<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF-><-orfD ParB-HTH Plasmid_parti BCD_RS07150 186 bacteria>spirochaetes Borrelia crocidurae chromosome partitioning protein [Borrelia crocidurae]. <-644980837_SSB<-644980838_SSB<-504509606_ParB-HTH*<-644980840_ParA<-644980841_DUF226<-644980842_Borrelia_orfA||749308014_ERF-><-644980844_orfD 639481943 <-BppA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH Plasmid_parti U880_RS0101780 186 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-740573056_BppA<-639481943_ParB-HTH*<-639481944_ParA<-639481945_DUF226<-639481946_Borrelia_orfA 639482644 <-Borrelia_orfA<-?<-MultiTM<-ThiF<-ParB-HTH*<-ParA<-?||Lipoprotein_2->Lipoprotein_2->?->Lipoprotein_2->Lipoprotein_2-> ParB-HTH Plasmid_parti U880_RS0105870 186 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-740573640_Borrelia_orfA<-639482641_?<-639482642_MultiTM<-740573637_ThiF<-639482644_ParB-HTH*<-639482645_ParA<-639482646_?||639482647_Lipoprotein_2->740573643_Lipoprotein_2->639482648_?->639482649_Lipoprotein_2->740573646_Lipoprotein_2-> 644980530 <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-?||?->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti BCD_RS05395 186 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-644980522_?<-749307822_Lipoprotein_2<-644980525_Lipoprotein_2<-749307824_Lipoprotein_2<-644980527_?||644980528_?->644980529_ParA->644980530_ParB-HTH*-> 657235047 <-ParB-HTH*<-ParA<-DUF226 ParB-HTH Plasmid_parti DZ03_RS0105875 186 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-657235047_ParB-HTH*<-657235048_ParA<-657235049_DUF226 671563339 <-BdrA<-ParB-HTH*<-ParA ParB-HTH Plasmid_parti DZ19_RS0105860 186 bacteria>spirochaetes Borrelia burgdorferi permease [Borrelia burgdorferi]. <-501600498_BdrA<-671563339_ParB-HTH*<-671563341_ParA 740582201 <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF-> ParB-HTH Plasmid_parti BDCR2A_RS06670 186 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-740582201_ParB-HTH*<-740582204_ParA<-740582206_DUF226<-740582209_Borrelia_orfA||740582211_ERF-> 499186196 orfD-><-?||Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti NM71_RS05580 185 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 497944293_orfD-><-499186190_?||499186191_Mlp-><-499186192_ERF||499186193_Borrelia_orfA->499186194_DUF226->499186195_ParA->499186196_ParB-HTH*->499186197_BdrA->499186198_BppA->499186199_XerD-><-499186200_Phage-integrase||499186325_?->499186184_?->763427928_?-> 501574765 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BAFPKO_RS04960 185 bacteria>spirochaetes Borrelia afzelii chromosome partitioning protein [Borrelia afzelii]. 504299145_orfD->504299146_BdrA->504299147_Mlp-><-504299148_ERF||504299149_Borrelia_orfA->504299150_DUF226->504299151_ParA->501574765_ParB-HTH*->504299152_BdrA->504299153_BppA->504299154_XerD-><-504299155_Phage-integrase||504299156_?-><-504299157_?||504299158_?-> 503783569 BlyB-holin->Borrelia_lipo_2->orfD->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BBIDN127_RS05685 185 bacteria>spirochaetes Borrelia bissettii permease [Borrelia bissettii]. 503783561_BlyB-holin->503783467_Borrelia_lipo_2->503783562_orfD->503783563_Mlp->503783566_Borrelia_orfA->503783567_DUF226->503783568_ParA->503783569_ParB-HTH*->503783570_BdrA->503783571_BppA->503783572_XerD-><-503783573_Phage-integrase||503783574_?->503783575_?-><-763175365_? 576103756 orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA->?->?->XerD-> ParB-HTH Plasmid_parti BOM_0964 185 bacteria>spirochaetes Borrelia miyamotoi FR64b Plasmid partition family protein (plasmid) [Borrelia miyamotoi FR64b]. 576103754_orfD-><-576103755_ERF<-576103756_ParB-HTH*<-576103757_ParA<-576103758_DUF226<-576103759_Borrelia_orfA||576103760_BppA->576103761_?->576103762_?->576103763_XerD-> 657235558 DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti DY94_RS0102615 185 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. 696420534_?->657235554_DUF226->657235555_ParA->657235558_ParB-HTH*-> 695263564 ParA->ParB-HTH*-> ParB-HTH Plasmid_parti PF-49 185 bacteria>spirochaetes Borrelia burgdorferi PF-49 protein [Borrelia burgdorferi]. 695208921_?->695208922_ParA->695263564_ParB-HTH*->695208924_?-> 696415789 <-XerD<-BppA<-BdrA<-ParB-HTH*<-ParA<-Borrelia_orfA||ERF-><-Mlp<-orfD<-BlyB-holin ParB-HTH Plasmid_parti DM10_RS00505 185 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-696415784_?||696411761_?-><-696415785_?<-696415786_XerD<-696415787_BppA<-696415788_BdrA<-696415789_ParB-HTH*<-696415790_ParA<-696415791_Borrelia_orfA||696415792_ERF-><-696415793_Mlp<-696415794_orfD<-696415795_BlyB-holin<-696415796_? 499186290 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-><-?||BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BB_O33 184 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 11497260_orfD->11497261_BdrA->11497262_Mlp-><-11497280_ERF||11497237_Borrelia_orfA->11497238_DUF226->11497239_ParA->499186290_ParB-HTH*->11497241_BdrA-><-11497263_?||11497242_BppA->11497243_XerD-><-11497244_Phage-integrase||11497245_?->11497246_?-> 740577610 <-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH SP+Plasmid_parti U881_RS0102215 184 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. <-639480299_ERF<-740577610_ParB-HTH*<-639480301_ParA<-639480302_DUF226<-639480303_Borrelia_orfA 764988637 DUF226->ParA->ParB-HTH*-><-?||Lipoprotein_2-> ParB-HTH Plasmid_parti I871_RS04285 184 bacteria>spirochaetes Borrelia miyamotoi chromosome partitioning protein [Borrelia miyamotoi]. 764988629_?->764988630_?-><-764988631_?<-764988632_?<-764988633_?||764988634_DUF226->764988636_ParA->764988637_ParB-HTH*-><-764988642_?||764988638_Lipoprotein_2-> 201084318 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon||?-><-orfD<-BlyB-holin ParB-HTH Plasmid_parti BDU_1115 183 bacteria>spirochaetes Borrelia duttonii Ly PF49 plasmid partition protein (plasmid) [Borrelia duttonii Ly]. <-201084311_?<-201084312_?||201084313_?->201084314_?->201084315_?-><-201084316_Terminase_LS<-201084317_?<-201084318_ParB-HTH*<-201084319_ParA<-201084320_DUF226<-201084321_Borrelia_orfA<-201084322_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon||201084323_?-><-201084324_orfD<-201084325_BlyB-holin 499192746 DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti NM71_RS06515 183 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 499192760_?->499192743_?->501902608_?->499192744_DUF226->499192745_ParA->499192746_ParB-HTH*-><-763427949_?<-497942530_?<-499192761_?<-497942522_?<-497942517_?<-499192762_?<-497945500_? 639482723 <-SSB<-SSB||?->DUF226->ParA->ParB-HTH*->ERF-> ParB-HTH Plasmid_parti U880_RS0106340 183 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-639482718_SSB<-639482719_SSB||639482720_?->639482721_DUF226->639482722_ParA->639482723_ParB-HTH*->740573743_ERF-> 644981026 <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2||?-><-BdrA<-Terminase_LS||?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-Mlp<-Phage-integrase||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-Mlp ParB-HTH Plasmid_parti BCD_RS08230 183 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-644981019_Lipoprotein_2<-749308080_Lipoprotein_2<-644981020_Lipoprotein_2||644981021_?-><-749308079_BdrA<-644981023_Terminase_LS||749308081_?-><-644981026_ParB-HTH*<-644981027_ParA<-644981028_DUF226<-644981029_Borrelia_orfA<-644981030_Mlp<-644981031_Phage-integrase||644981032_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-749308082_Mlp 740581624 Mlp-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||XerD-><-Mlp ParB-HTH Plasmid_parti BDCR2A_RS05505 183 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. 740581621_Mlp-><-740581624_ParB-HTH*<-644981027_ParA<-740581626_DUF226<-740581629_Borrelia_orfA||740581631_XerD-><-740581634_Mlp 741043351 Borrelia_orfA->DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti OY14_04355 182 bacteria>spirochaetes Borrelia chilensis chromosome partitioning protein (plasmid) [Borrelia chilensis]. 741043344_?->741043345_?-><-741043346_?||741043347_?->741043348_Borrelia_orfA->741043349_DUF226->741043350_ParA->741043351_ParB-HTH*-><-741043352_?<-741043353_? 145652250 ParA->ParB-HTH*->?->?->?->Terminase_LS-> ParB-HTH Plasmid_parti ABP88177.1 181 bacteria>spirochaetes Borrelia lonestari hypothetical protein [Borrelia lonestari]. 145652251_ParA->145652250_ParB-HTH*->145652252_?->145652253_?->145652254_?->145652255_Terminase_LS-> 504496098 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti Q7M_RS05150 181 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. <-504496090_?<-504496091_?<-504496092_?||504496093_?->504496094_?-><-504496096_Terminase_LS<-504496097_?<-504496098_ParB-HTH*<-501532856_ParA<-501532978_DUF226<-504496099_Borrelia_orfA<-504496100_orfD<-504496101_BlyB-holin<-504496102_?<-504496103_? 504509673 <-BppA<-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||ERF-><-orfD ParB-HTH Plasmid_parti BCD_RS06770 181 bacteria>spirochaetes Borrelia crocidurae chromosome partitioning protein [Borrelia crocidurae]. <-644980768_BppA<-644980769_BdrA<-504509673_ParB-HTH*<-644980770_ParA<-644980771_DUF226<-644980772_Borrelia_orfA||644980773_ERF-><-749307961_orfD 519700232 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti BTA100 181 bacteria>spirochaetes Borrelia turicatae hypothetical protein [Borrelia turicatae]. <-541862209_?||541862210_?->541862211_?->541862212_?-><-541862213_?<-541862214_Terminase_LS<-541862215_?<-519700232_ParB-HTH*<-541862217_ParA<-541862218_DUF226<-541862219_Borrelia_orfA<-541862220_orfD<-541862221_BlyB-holin<-541862222_?<-541862223_? 639481996 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti U880_RS0102040 181 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-639481989_?<-639481990_?||639481991_?->639481992_?->639481993_?-><-639481994_Terminase_LS<-639481995_?<-639481996_ParB-HTH*<-639481997_ParA<-639481998_DUF226<-639481999_Borrelia_orfA<-639482000_orfD<-639482001_BlyB-holin<-639482002_?<-639482003_? 639482094 Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA-> ParB-HTH SP+Plasmid_parti U880_RS0102635 181 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. 639482091_Borrelia_orfA->639482092_DUF226->639482093_ParA->639482094_ParB-HTH*->639482095_BdrA->740573199_BppA-> 644922901 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti X966_RS04750 181 bacteria>spirochaetes Borrelia parkeri permease [Borrelia parkeri]. <-749302062_?<-644922885_?||644922887_?->644922891_?-><-644922894_?<-644922896_Terminase_LS<-644922898_?<-644922901_ParB-HTH*<-644922903_ParA<-519700238_DUF226<-749302063_Borrelia_orfA<-749302064_orfD<-644922913_BlyB-holin<-644922915_?<-644922917_? 644979602 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti BHW_RS05475 181 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-752506919_?||644979596_?->644979597_?-><-644979598_?||644979599_?-><-644979600_Terminase_LS<-644979601_?<-644979602_ParB-HTH*<-644979603_ParA<-644979604_DUF226<-644979605_Borrelia_orfA<-644979606_orfD<-644979607_BlyB-holin<-644979608_?<-644979609_? 644980614 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti BDCR2A_RS04950 181 bacteria>spirochaetes Borrelia MULTISPECIES: permease [Borrelia]. <-740581381_?<-740581385_?||740581388_?->740581391_?->740581394_?-><-740581398_Terminase_LS<-740581400_?<-644980614_ParB-HTH*<-740581403_ParA<-501532978_DUF226<-740581405_Borrelia_orfA<-740581407_orfD<-740581409_BlyB-holin<-504496102_?<-740581412_? 645023282 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti T431_RS0103125 181 bacteria>spirochaetes Borrelia coriaceae permease [Borrelia coriaceae]. <-645023301_?<-740579078_?<-645023296_?||645023293_?-><-645023290_?<-645023287_Terminase_LS<-645023284_?<-645023282_ParB-HTH*<-645023279_ParA<-645023274_DUF226<-740579080_Borrelia_orfA<-645023268_orfD<-645023266_BlyB-holin<-645023264_?<-645023261_? 645024774 <-XerD<-?<-SSB<-SSB||Borrelia_orfA->DUF226->ParB-HTH*->ERF-> ParB-HTH Plasmid_parti BCO_RS07330 181 bacteria>spirochaetes Borrelia coriaceae permease [Borrelia coriaceae]. <-645024768_?<-645023953_XerD<-645024769_?<-752507031_SSB<-645024771_SSB||645024772_Borrelia_orfA->645024773_DUF226->645024774_ParB-HTH*->645024775_ERF-> 645048715 BlyB-holin->orfD->Borrelia_orfA->DUF226->ParA->ParB-HTH*->?->Terminase_LS-> ParB-HTH Plasmid_parti BAN_RS04870 181 bacteria>spirochaetes Borrelia anserina permease [Borrelia anserina]. 645048709_?->752506871_?->752506872_BlyB-holin->752506873_orfD->752506876_Borrelia_orfA->645048713_DUF226->645048714_ParA->645048715_ParB-HTH*->645048716_?->645048717_Terminase_LS-><-645048718_?<-645048719_?||645048720_?->645048721_?->645048722_?-> 645062976 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti BHY_RS05115 181 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-749302082_?||645062973_?->645062974_?-><-644979598_?||644979599_?-><-645062975_Terminase_LS<-644979601_?<-645062976_ParB-HTH*<-644979603_ParA<-644979604_DUF226<-645062977_Borrelia_orfA<-644979606_orfD<-644979607_BlyB-holin<-644979608_?<-644979609_? 645073074 <-Terminase_LS<-?<-ParB-HTH*<-ParA<-Borrelia_orfA<-orfD<-BlyB-holin ParB-HTH Plasmid_parti BOM_RS04520 181 bacteria>spirochaetes Borrelia miyamotoi permease [Borrelia miyamotoi]. <-645073067_?<-763123742_?<-763123715_?||645073070_?->645073071_?-><-645073072_Terminase_LS<-645073073_?<-645073074_ParB-HTH*<-645073075_ParA<-763123743_Borrelia_orfA<-645073077_orfD<-645073078_BlyB-holin<-645073079_?<-645073080_?<-645073081_? 740582639 <-BdrA<-ParB-HTH*<-ParA<-DUF226||ERF-> ParB-HTH Plasmid_parti BDCR2A_RS07425 181 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-740582637_BdrA<-740582639_ParB-HTH*<-740582641_ParA<-740582643_DUF226||740582645_ERF-> 501588721 BdrA->Mlp-><-ERF||?->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti NM71_RS06225 180 bacteria>spirochaetes Borrelia burgdorferi permease [Borrelia burgdorferi]. 499186212_BdrA->499186213_Mlp-><-499186214_ERF||742499587_?->499186215_Borrelia_orfA->499186216_DUF226->499186217_ParA->501588721_ParB-HTH*->499186218_BdrA->499186219_BppA->499186220_XerD-><-499186221_Phage-integrase||499186222_?->499186223_?->501895123_?-> 696413767 <-ParB-HTH*<-ParA<-DUF226 ParB-HTH Plasmid_parti DZ02_RS01000000108520 180 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-696413767_ParB-HTH*<-671547733_ParA<-671547734_DUF226 639480295 <-ERF<-ParB-HTH*<-ParA<-DUF226 ParB-HTH Plasmid_parti U881_RS0102195 179 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. <-639480294_ERF<-639480295_ParB-HTH*<-639480296_ParA<-639480297_DUF226 740577787 <-Lipoprotein_2||?-><-MultiTM<-?||?->DUF226->ParA->ParB-HTH*-> ParB-HTH Plasmid_parti U881_RS0103180 179 bacteria>spirochaetes Borrelia persica permease, partial [Borrelia persica]. <-639480427_Lipoprotein_2||639480428_?-><-639480429_MultiTM<-639480430_?||639480431_?->639480432_DUF226->639480433_ParA->740577787_ParB-HTH*-> 639480863 <-ERF<-ParB-HTH*<-ParA<-DUF226<-?||BppA-> ParB-HTH Plasmid_parti U881_RS0105670 178 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. <-639480862_ERF<-639480863_ParB-HTH*<-639480864_ParA<-639480865_DUF226<-639480866_?||639480867_BppA-> 645024479 <-MultiTM<-ThiF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2->Borrelia_orfA-> ParB-HTH Plasmid_parti T431_RS0104570 178 bacteria>spirochaetes Borrelia coriaceae permease [Borrelia coriaceae]. 740579281_?-><-645024494_MultiTM<-645024491_ThiF||654876359_Borrelia_orfA->645024484_DUF226->645024481_ParA->645024479_ParB-HTH*->645024476_Lipoprotein_2->740579290_Lipoprotein_2->740579284_Lipoprotein_2->645024470_Lipoprotein_2->740579287_Lipoprotein_2->740579293_Lipoprotein_2->654876362_Borrelia_orfA-> 576102548 Borrelia_lipo_2->orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA->BppA->SSB->XerD-> ParB-HTH Plasmid_parti BCD_1669 177 bacteria>spirochaetes Borrelia crocidurae DOU Putative plasmid partition protein (plasmid) [Borrelia crocidurae DOU]. <-576102541_?<-576102542_?<-576102543_?||576102544_?->576102545_Borrelia_lipo_2->576102546_orfD-><-576102547_ERF<-576102548_ParB-HTH*<-576102549_ParA<-576102550_DUF226<-576102551_Borrelia_orfA||576102552_BppA->576102553_BppA->576102554_SSB->576102555_XerD-> 644979720 <-Lipoprotein_2<-BdrA||?-><-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH Plasmid_parti BHW_RS06110 177 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-644979715_Lipoprotein_2<-749302125_BdrA||644979718_?-><-644979720_ParB-HTH*<-644979721_ParA<-644979722_DUF226<-644979723_Borrelia_orfA 645010591 DUF226->ParA->ParB-HTH*->BdrA-><-Lipoprotein_2<-Lipoprotein_2<-?<-Lipoprotein_2<-BdrA<-Terminase_LS ParB-HTH Plasmid_parti BHO_RS05255 177 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. 645010587_DUF226->645010590_ParA->645010591_ParB-HTH*->645010594_BdrA-><-645010596_Lipoprotein_2<-645010599_Lipoprotein_2<-645010602_?<-645010609_Lipoprotein_2<-645010612_BdrA<-645010615_Terminase_LS 644979647 <-ParB-HTH*<-ParA<-DUF226<-Lipoprotein_2<-BdrA ParB-HTH Plasmid_parti BHY_RS05885 176 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-644979648_?<-644979647_ParB-HTH*<-749302115_ParA<-645063075_DUF226<-749302116_Lipoprotein_2<-645063078_BdrA 645010701 <-ERF<-BdrA<-ParB-HTH*<-ParA<-DUF226<-?||BppA-> ParB-HTH SP+Plasmid_parti BHO_RS05480 176 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-645010695_ERF<-645010698_BdrA<-645010701_ParB-HTH*<-645010704_ParA<-645010707_DUF226<-645010709_?||645010711_BppA-> 645023888 <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BdrA-><-Lipoprotein_2<-BdrA||Mlp-> ParB-HTH Plasmid_parti T431_RS0103305 176 bacteria>spirochaetes Borrelia coriaceae permease [Borrelia coriaceae]. <-654876291_Lipoprotein_2<-740579113_Lipoprotein_2<-654876292_Lipoprotein_2<-740579100_Lipoprotein_2<-740579103_Lipoprotein_2<-740579116_Lipoprotein_2<-645023885_Lipoprotein_2<-645023888_ParB-HTH*<-645023890_ParA<-654876296_DUF226<-645023892_Borrelia_orfA||645023893_BdrA-><-645023896_Lipoprotein_2<-645023900_BdrA||645023901_Mlp-> 645063163 <-BdrA<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH Plasmid_parti BHY_RS06640 176 bacteria>spirochaetes Borrelia hermsii chromosome partitioning protein [Borrelia hermsii]. <-645063162_BdrA<-645063163_ParB-HTH*<-644979703_ParA<-645063164_DUF226<-645063165_Borrelia_orfA 763123770 orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA->?->?->XerD-> ParB-HTH Plasmid_parti BOM_RS04635 176 bacteria>spirochaetes Borrelia miyamotoi hypothetical protein [Borrelia miyamotoi]. 645073095_orfD-><-763123769_ERF<-763123770_ParB-HTH*<-645073098_ParA<-645073099_DUF226<-645073100_Borrelia_orfA||763123772_BppA->645073102_?->645073103_?->763123766_XerD-> 497943851 <-BdrA<-ParB-HTH*<-ParA ParB-HTH Plasmid_parti DZ10_RS0105695 175 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. <-671559758_BdrA<-497943851_ParB-HTH*<-671579798_ParA 501704894 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BBUJD1_RS00885 175 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 488735288_orfD->499186212_BdrA->501883602_Mlp-><-504352924_ERF||504352925_Borrelia_orfA->504352926_DUF226->488735209_ParA->501704894_ParB-HTH*->504352927_BdrA->763417052_BppA->488735217_XerD-><-497943318_Phage-integrase||504352928_?->501704944_?->499186310_?-> 501928340 orfD-><-?||Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-><-Phage-integrase||?->?->Terminase_LS-> ParB-HTH Plasmid_parti BSV1_RS06320 175 bacteria>spirochaetes Borrelia finlandensis chromosome partitioning protein [Borrelia finlandensis]. 748691687_orfD-><-748691689_?||501928371_Mlp-><-748691701_ERF||501928405_Borrelia_orfA->501928365_DUF226->501928300_ParA->501928340_ParB-HTH*->501928369_BdrA-><-501928327_Phage-integrase||501928367_?->748691703_?->501928390_Terminase_LS->501928324_?->748691705_?-> 503783476 Borrelia_lipo_2->orfD->BdrA->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BBIDN127_RS04525 175 bacteria>spirochaetes Borrelia bissettii chromosome partitioning protein [Borrelia bissettii]. 503783467_Borrelia_lipo_2->503783468_orfD->503783469_BdrA->503783470_Mlp->503783473_Borrelia_orfA->503783474_DUF226->503783475_ParA->503783476_ParB-HTH*->503783477_BdrA->503783478_BppA->503783479_XerD-><-503783480_Phage-integrase||503783486_?-> 504299035 Borrelia_lipo_2->orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BAFPKO_RS04540 175 bacteria>spirochaetes Borrelia afzelii chromosome partitioning protein [Borrelia afzelii]. 504299028_Borrelia_lipo_2->504299029_orfD->504299030_BdrA->504299031_Mlp-><-504299032_ERF||504299033_Borrelia_orfA->504299034_ParA->504299035_ParB-HTH*->504299036_BdrA->504299037_BppA->504299038_XerD-><-504299039_Phage-integrase||504299040_?->763173059_?->763173060_?-> 226232418 <-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BBUBOL26_W05 174 bacteria>spirochaetes Borrelia burgdorferi Bol26 putative plasmid partition protein (plasmid) [Borrelia burgdorferi Bol26]. <-226232414_ERF||226232415_Borrelia_orfA->226232416_DUF226->226232417_ParA->226232418_ParB-HTH*->226232419_BdrA->226232420_BppA->226232421_XerD-><-226232422_Phage-integrase||226232423_?->226232424_?->226232425_?-> 493479353 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BAFPKO_RS05375 174 bacteria>spirochaetes Borrelia burgdorferi group MULTISPECIES: chromosome partitioning protein [Borrelia burgdorferi group]. 493479326_orfD->504299104_BdrA->504299105_Mlp-><-504299106_ERF||504299107_Borrelia_orfA->493479359_DUF226->501574801_ParA->493479353_ParB-HTH*->501574776_BdrA->504299109_BppA->504299110_XerD-><-504299111_Phage-integrase||504299112_?->504299113_?->763173064_?-> 501531293 <-Borrelia_orfA||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->?->?->Borrelia_orfA->DUF226->ParA->ParB-HTH*->Lipoprotein_2->Lipoprotein_2->Lipoprotein_2-> ParB-HTH Plasmid_parti BDU_RS04395 174 bacteria>spirochaetes Borrelia duttonii chromosome partitioning protein [Borrelia duttonii]. <-752505939_Borrelia_orfA||501531286_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->501531287_?->501531288_?->752505936_Borrelia_orfA->501531291_DUF226->501531292_ParA->501531293_ParB-HTH*->501531294_Lipoprotein_2->752505940_Lipoprotein_2->752505937_Lipoprotein_2-> 501894859 BlyB-holin->Mlp->Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BVAVS116_RS06000 174 bacteria>spirochaetes Borrelia valaisiana chromosome partitioning protein [Borrelia valaisiana]. 501894843_?->501894848_?->501894849_BlyB-holin->501894852_Mlp->501894856_Borrelia_orfA->501894857_DUF226->501894858_ParA->501894859_ParB-HTH*->501894860_BdrA->501894861_BppA->501894862_XerD-><-501894863_Phage-integrase||501894864_?->501894865_?-><-750014224_? 503783725 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BBIDN127_RS05150 174 bacteria>spirochaetes Borrelia bissettii chromosome partitioning protein [Borrelia bissettii]. 503783718_orfD->503783719_BdrA->503783720_Mlp-><-503783721_ERF||503783723_Borrelia_orfA->497943782_DUF226->503783724_ParA->503783725_ParB-HTH*->503783726_BdrA->503783727_BppA->503783728_XerD-><-503783729_Phage-integrase||503783730_?->503783731_?->503783732_?-> 504499910 <-Lipoprotein_2<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BdrA->HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-Mlp||BdrA-> ParB-HTH Plasmid_parti BCD_RS05470 174 bacteria>spirochaetes Borrelia crocidurae chromosome partitioning protein [Borrelia crocidurae]. <-749307847_Lipoprotein_2<-504499910_ParB-HTH*<-644980543_ParA<-644980544_DUF226<-644980545_Borrelia_orfA||644980546_BdrA->644980547_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-644980548_Mlp||644980550_BdrA-> 504509579 <-Lipoprotein_2<-Lipoprotein_2<-?<-?||Borrelia_lipo_2->orfD-><-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||SSB->SSB-> ParB-HTH Plasmid_parti BCD_RS07490 174 bacteria>spirochaetes Borrelia crocidurae chromosome partitioning protein [Borrelia crocidurae]. <-644980895_Lipoprotein_2<-749308030_Lipoprotein_2<-644980897_?<-644980898_?||749308035_Borrelia_lipo_2->749308031_orfD-><-749308036_ERF<-504509579_ParB-HTH*<-644980901_ParA<-504509577_DUF226<-749308032_Borrelia_orfA||644980902_SSB->749308037_SSB-> 644979702 <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH Plasmid_parti BHW_RS06025 174 bacteria>spirochaetes Borrelia hermsii chromosome partitioning protein, partial [Borrelia hermsii]. <-644979702_ParB-HTH*<-644979703_ParA<-644979704_DUF226<-644979705_Borrelia_orfA 657248267 <-XerD<-BppA<-ParB-HTH*<-ParA||ERF-><-BdrA<-orfD<-Borrelia_lipo_2<-BlyB-holin ParB-HTH Plasmid_parti DY95_RS0102845 174 bacteria>spirochaetes Borrelia garinii chromosome partitioning protein [Borrelia garinii]. <-671612253_?<-501704835_XerD<-501704830_BppA<-657248267_ParB-HTH*<-501704490_ParA||501704847_ERF-><-501704831_BdrA<-501704842_orfD<-501704499_Borrelia_lipo_2<-501704839_BlyB-holin 740573754 <-ERF<-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||BppA-> ParB-HTH Plasmid_parti U880_RS10330 174 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. <-639482730_ERF<-740573754_ParB-HTH*<-639482731_ParA<-639482732_DUF226<-639482733_Borrelia_orfA||639482734_BppA-> 501704326 orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA-><-?||BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti NM71_RS05360 173 bacteria>spirochaetes Borrelia burgdorferi group MULTISPECIES: chromosome partitioning protein [Borrelia burgdorferi group]. 499186329_orfD->501883588_BdrA->499186307_Mlp-><-763427922_ERF||499186287_Borrelia_orfA->499186288_DUF226->499186289_ParA->501704326_ParB-HTH*->499186291_BdrA-><-497945423_?||501883650_BppA->499186293_XerD-><-488735128_Phage-integrase||499186294_?->499186295_?-> 501894927 Phage-integrase-><-BdrA<-ParB-HTH*<-ParA<-Borrelia_orfA||Borrelia_orfA->DUF226->ParA->ParB-HTH->XerD-> ParB-HTH Plasmid_parti BVAVS116_RS04800 173 bacteria>spirochaetes Borrelia valaisiana chromosome partitioning protein [Borrelia valaisiana]. 750014136_?-><-501894913_?<-750014137_?<-750014139_?||750014140_Phage-integrase-><-501894926_BdrA<-501894927_ParB-HTH*<-501894928_ParA<-750014142_Borrelia_orfA||750014144_Borrelia_orfA->501894942_DUF226->501894943_ParA->501894944_ParB-HTH->501894949_XerD-> 501894944 <-BdrA<-ParB-HTH<-ParA<-Borrelia_orfA||Borrelia_orfA->DUF226->ParA->ParB-HTH*->XerD-> ParB-HTH Plasmid_parti BVAVS116_RS04865 173 bacteria>spirochaetes Borrelia valaisiana chromosome partitioning protein [Borrelia valaisiana]. <-501894926_BdrA<-501894927_ParB-HTH<-501894928_ParA<-750014142_Borrelia_orfA||750014144_Borrelia_orfA->501894942_DUF226->501894943_ParA->501894944_ParB-HTH*->501894949_XerD->501894951_?->501894952_?-> 503783755 BdrA->?->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BBIDN127_RS05560 173 bacteria>spirochaetes Borrelia bissettii chromosome partitioning protein [Borrelia bissettii]. 503783749_BdrA->503783750_?->503783751_Mlp-><-503783752_ERF||503783753_Borrelia_orfA->497944496_DUF226->503783754_ParA->503783755_ParB-HTH*->503783756_BdrA->503783518_XerD-><-503783759_Phage-integrase||503783760_?-><-763175361_?<-503783762_?||503783763_?-> 504299173 BdrA->Mlp->Mlp-><-ERF||Borrelia_orfA->DUF226->ParA->ParB-HTH*->BdrA->BppA->XerD-><-Phage-integrase||?->?->Terminase_LS-> ParB-HTH Plasmid_parti BAFPKO_RS05550 173 bacteria>spirochaetes Borrelia afzelii chromosome partitioning protein [Borrelia afzelii]. 501574871_BdrA->504299167_Mlp->504299168_Mlp-><-504299169_ERF||504299170_Borrelia_orfA->504299171_DUF226->504299172_ParA->504299173_ParB-HTH*->504299174_BdrA->504299175_BppA->504299110_XerD-><-499919955_Phage-integrase||504299176_?->504299177_?->504299179_Terminase_LS-> 671481046 ParA->ParB-HTH*->BdrA-> ParB-HTH Plasmid_parti DY90_RS0105415 173 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. 671557949_ParA->671481046_ParB-HTH*->671481040_BdrA-> 696415807 <-Terminase_LS<-?<-?<-?||Phage-integrase-><-ParB-HTH*<-ParA<-DUF226 ParB-HTH Plasmid_parti DM10_RS00395 173 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-696415800_?<-696415801_?<-696415802_Terminase_LS<-696415803_?<-696415804_?<-696415805_?||696415806_Phage-integrase-><-696415807_ParB-HTH*<-696415808_ParA<-696415809_DUF226<-696415810_? 504495970 <-Borrelia_orfA||?->?->ParA->ParB-HTH*-> ParB-HTH - Q7M_RS04450 172 bacteria>spirochaetes Borrelia crocidurae permease [Borrelia crocidurae]. 752505117_?-><-501533315_Borrelia_orfA||504495968_?->504495969_?->501533314_ParA->504495970_ParB-HTH*-> 639480210 Borrelia_orfA->DUF226->ParA->ParB-HTH*->ERF-> ParB-HTH Plasmid_parti U881_RS0101540 172 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. 639480207_Borrelia_orfA->639480208_DUF226->639480209_ParA->639480210_ParB-HTH*->639480211_ERF-> 645024919 <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA-> ParB-HTH - BCO_RS07580 172 bacteria>spirochaetes Borrelia coriaceae permease [Borrelia coriaceae]. <-645024919_ParB-HTH*<-645024927_ParA<-645024928_?<-645024931_?||752507041_Borrelia_orfA-><-752507042_? 501533313 <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA-> ParB-HTH - BRE_RS05790 171 bacteria>spirochaetes Borrelia recurrentis permease [Borrelia recurrentis]. <-501533313_ParB-HTH*<-501533314_ParA<-504495969_?<-504495968_?||501533315_Borrelia_orfA-><-752506402_?<-752506403_? 639480287 <-Borrelia_orfA||?->?->ParA->ParB-HTH*-><-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA-> ParB-HTH - U881_RS0102010 171 bacteria>spirochaetes Borrelia persica permease [Borrelia persica]. 639480281_?-><-639480282_?<-639480283_Borrelia_orfA||639480284_?->639480285_?->639480286_ParA->639480287_ParB-HTH*-><-639480287_ParB-HTH*<-639480286_ParA<-639480285_?<-639480284_?||639480283_Borrelia_orfA->639480282_?-><-639480281_? 639481918 Borrelia_orfA-><-?||?-><-Borrelia_orfA||?->?->ParA->ParB-HTH*-> ParB-HTH - U880_RS0101625 171 bacteria>spirochaetes Borrelia hispanica permease [Borrelia hispanica]. 639481921_Borrelia_orfA-><-639481922_?||639481922_?-><-639481921_Borrelia_orfA||504495968_?->639481920_?->639481919_ParA->639481918_ParB-HTH*-> 644980358 <-Borrelia_orfA||?->?->ParA->ParB-HTH*-> ParB-HTH - BCD_RS04550 171 bacteria>spirochaetes Borrelia crocidurae permease, partial [Borrelia crocidurae]. 644980356_?-><-644980357_Borrelia_orfA||504495968_?->504495969_?->501533314_ParA->644980358_ParB-HTH*-> 645011182 <-ParB-HTH*<-ParA ParB-HTH - BHO_RS06460 171 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-645011182_ParB-HTH*<-645011185_ParA<-645011188_?<-645011190_?||749299237_?-><-749299238_? 740582340 <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA-> ParB-HTH - BDCR2A_RS06900 171 bacteria>spirochaetes Borrelia duttonii permease [Borrelia duttonii]. <-740582340_ParB-HTH*<-501533314_ParA<-504495969_?<-504495968_?||740582345_Borrelia_orfA-> 576095549 <-ParB-HTH*<-ParA<-?<-?||Borrelia_orfA-> ParB-HTH - BCO_0118100 170 bacteria>spirochaetes Borrelia coriaceae Co53 Putative plasmid partition protein (plasmid) [Borrelia coriaceae Co53]. <-576095549_ParB-HTH*<-576095550_ParA<-576095551_?<-576095552_?||576095553_Borrelia_orfA-><-576095554_? 696412166 DUF226->ParA->ParB-HTH*-><-Borrelia_lipo_1<-?||?-><-BdrA<-DUF226<-Borrelia_orfA||orfD-> ParB-HTH Plasmid_parti DY92_RS0102290 170 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. 657252264_DUF226->696412165_ParA->696412166_ParB-HTH*-><-696412168_Borrelia_lipo_1<-696412167_?||657252267_?-><-657252268_BdrA<-696412169_DUF226<-696412170_Borrelia_orfA||657252270_orfD-> 671501759 <-ParB-HTH*<-ParA ParB-HTH Plasmid_parti DZ05_RS0105610 169 bacteria>spirochaetes Borrelia garinii permease, partial [Borrelia garinii]. <-671501759_ParB-HTH*<-671501760_ParA 696422229 <-Borrelia_orfA<-ParB-HTH*<-ParA ParB-HTH Plasmid_parti DZ06_RS01000000107455 163 bacteria>spirochaetes Borrelia garinii chromosome partitioning protein [Borrelia garinii]. <-671501216_Borrelia_orfA<-696422229_ParB-HTH*<-696422231_ParA<-696422233_? 576105904 <-Lipoprotein_2<-?<-?<-ERF<-ParB-HTH*<-ParA<-ParA<-DUF226<-?<-Borrelia_orfA||BppA->BppA-> ParB-HTH SP+Plasmid_parti BHY_1499 160 bacteria>spirochaetes Borrelia hermsii YOR hypothetical protein BHY_1499 (plasmid) [Borrelia hermsii YOR]. <-576105900_Lipoprotein_2<-576105901_?<-576105902_?<-576105903_ERF<-576105904_ParB-HTH*<-576105905_ParA<-576105906_ParA<-576105907_DUF226<-576105908_?<-576105909_Borrelia_orfA||576105910_BppA->576105911_BppA-> 645063111 ParA->ParB-HTH*-> ParB-HTH Plasmid_parti BHY_RS06200 157 bacteria>spirochaetes Borrelia hermsii permease, partial [Borrelia hermsii]. 644979721_ParA->645063111_ParB-HTH*-> 695263429 ParA->ParB-HTH*-> ParB-HTH Plasmid_parti YP_009077380.1 145 bacteria>spirochaetes Borrelia burgdorferi putative plasmid partition protein, partial [Borrelia burgdorferi]. 695208550_ParA->695263429_ParB-HTH*-> 224513739 ParA->ParB-HTH*->BdrA-><-?<-?||BppA-> ParB-HTH Plasmid_parti BSPA14S_PA0096 142 bacteria>spirochaetes Borrelia spielmanii A14S putative plasmid partition protein (plasmid) [Borrelia spielmanii A14S]. 224513741_ParA->224513739_ParB-HTH*->224513737_BdrA-><-224513738_?<-224513742_?||224513740_BppA-> 645073449 <-ParB-HTH*<-ParA ParB-HTH - BOM_RS06715 134 bacteria>spirochaetes Borrelia miyamotoi permease, partial [Borrelia miyamotoi]. <-645073449_ParB-HTH*<-645073450_ParA<-645073451_?<-645073452_?||763124095_?-><-763124097_? 219694015 Borrelia_lipo_2->orfD->BdrA->Mlp-><-ERF||Borrelia_orfA->ParA->ParB-HTH*->BdrA-><-?<-?||BppA->XerD-><-Phage-integrase ParB-HTH Plasmid_parti BGAPBR_V0033 133 bacteria>spirochaetes Borrelia garinii PBr putative plasmid partition protein (plasmid) [Borrelia garinii PBr]. 219694004_Borrelia_lipo_2->219694032_orfD->219694021_BdrA->219694017_Mlp-><-219694037_ERF||219694026_Borrelia_orfA->219694001_ParA->219694015_ParB-HTH*->219694014_BdrA-><-219694010_?<-219694038_?||219694020_BppA->219694025_XerD-><-219694007_Phage-integrase||219694027_?-> 644979632 <-ParB-HTH*<-ParA ParB-HTH - BHW_RS05635 129 bacteria>spirochaetes Borrelia hermsii permease, partial [Borrelia hermsii]. <-644979632_ParB-HTH*<-644979633_ParA<-644979634_?<-644979635_?||752506929_?-><-749302140_? 671558442 ParA->ParB-HTH*-> ParB-HTH Plasmid_parti DY90_RS0106595 127 bacteria>spirochaetes Borrelia garinii permease, partial [Borrelia garinii]. 671558441_ParA->671558442_ParB-HTH*-> 657235150 ParA->ParB-HTH*-> ParB-HTH Plasmid_parti DZ03_RS0106530 123 bacteria>spirochaetes Borrelia garinii permease, partial [Borrelia garinii]. 657235149_ParA->657235150_ParB-HTH*-> 657245836 ParA->ParB-HTH*-> ParB-HTH Plasmid_parti DZ07_RS0106275 123 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein, partial [Borrelia burgdorferi]. 657245835_ParA->657245836_ParB-HTH*-> 696413957 <-ParB-HTH*<-ParA ParB-HTH Plasmid_parti DZ02_RS0106455 119 bacteria>spirochaetes Borrelia garinii permease, partial [Borrelia garinii]. <-696413957_ParB-HTH*<-671547930_ParA 219694563 <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA||?-><-?||?->Borrelia_orfA-> ParB-HTH Plasmid_parti BGAFAR04_E0008 117 bacteria>spirochaetes Borrelia garinii Far04 hypothetical protein BGAFAR04_E0008 (plasmid) [Borrelia garinii Far04]. 219694576_?->219694566_?-><-219694581_?||219694571_?-><-219694557_?<-219694559_?<-219694563_ParB-HTH*<-219694558_ParA<-219694565_DUF226<-219694568_Borrelia_orfA||219694560_?-><-219694575_?||219694588_?->219694586_Borrelia_orfA-> 671561023 <-ParB-HTH*<-ParA<-DUF226<-Borrelia_orfA ParB-HTH SP DZ15_RS0105800 113 bacteria>spirochaetes Borrelia burgdorferi permease, partial [Borrelia burgdorferi]. <-671561023_ParB-HTH*<-501666664_ParA<-497942834_DUF226<-671561024_Borrelia_orfA 657249216 DUF226->ParA->ParB-HTH*-> ParB-HTH - DZ12_RS0106295 112 bacteria>spirochaetes Borrelia burgdorferi permease, partial [Borrelia burgdorferi]. 501588777_DUF226->499186342_ParA->657249216_ParB-HTH*-> 576092914 <-Lipoprotein_2||ParA->ParB-HTH*-><-?<-Mlp<-?<-Mlp ParB-HTH - BHO_0117100 106 bacteria>spirochaetes Borrelia hermsii YBT Putative plasmid partition protein (plasmid) [Borrelia hermsii YBT]. <-576092907_?<-576092908_?||576092909_?-><-576092910_?<-576092911_?<-576092912_Lipoprotein_2||576092913_ParA->576092914_ParB-HTH*-><-576092915_?<-576092916_Mlp<-576092917_?<-576092918_Mlp||576092919_?->576092920_?->576092921_?-> 695263555 ParA->?->ParB-HTH*-> ParB-HTH Plasmid_parti PF-49 96 bacteria>spirochaetes Borrelia burgdorferi PF-49 protein, partial [Borrelia burgdorferi]. 695208901_ParA->695208900_?->695263555_ParB-HTH*-> 695263547 ParA->ParB-HTH*-> ParB-HTH - PF-49 95 bacteria>spirochaetes Borrelia burgdorferi PF-49 protein, partial [Borrelia burgdorferi]. 695208881_?->695208882_ParA->695263547_ParB-HTH*-> 657236274 <-ParB-HTH*<-ParA<-Borrelia_orfA ParB-HTH - DY94_RS0105330 92 bacteria>spirochaetes Borrelia garinii hypothetical protein, partial [Borrelia garinii]. <-657236274_ParB-HTH*<-657236277_ParA<-657236280_Borrelia_orfA 219693935 Borrelia_lipo_1-><-BdrA||?-><-?<-?<-?<-ParB-HTH*<-?<-?<-ParA ParB-HTH Plasmid_parti BGAPBR_E0019 91 bacteria>spirochaetes Borrelia garinii PBr hypothetical protein BGAPBR_E0019 (plasmid) [Borrelia garinii PBr]. <-219693939_?||219693908_Borrelia_lipo_1-><-219693911_BdrA||219693940_?-><-219693903_?<-219693899_?<-219693941_?<-219693935_ParB-HTH*<-219693934_?<-219693930_?<-219693928_ParA<-219693937_?<-219693898_?||219693921_?->219693902_?-> 671563388 <-ParB-HTH*<-ParA ParB-HTH - DZ19_RS0106050 91 bacteria>spirochaetes Borrelia burgdorferi hypothetical protein, partial [Borrelia burgdorferi]. <-671563388_ParB-HTH*<-671563390_ParA 695263540 ParA->ParB-HTH*-> ParB-HTH - YP_009077626.1 89 bacteria>spirochaetes Borrelia burgdorferi PF-49 protein, partial [Borrelia burgdorferi]. 695208862_?->695208863_ParA->695263540_ParB-HTH*-> # 1; Same as above in general 576094619 <-BppA<-BppA||Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-orfD<-Borrelia_lipo_2 ParB-HTH Plasmid_parti BCO_0130002 201 bacteria>spirochaetes Borrelia coriaceae Co53 Putative plasmid partition protein (plasmid) [Borrelia coriaceae Co53]. <-576094615_BppA<-576094616_BppA||576094617_Borrelia_orfA->576094618_DUF226->576094619_ParB-HTH*->576094620_ERF-><-576094621_orfD<-576094622_Borrelia_lipo_2<-576094623_?<-576094624_?<-576094625_?<-576094626_? 576095359 <-XerD<-?<-SSB<-SSB<-BppA||Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-orfD ParB-HTH Plasmid_parti BCO_0130005 200 bacteria>spirochaetes Borrelia coriaceae Co53 Putative plasmid partition protein (plasmid) [Borrelia coriaceae Co53]. <-576095352_XerD<-576095353_?<-576095354_SSB<-576095355_SSB<-576095356_BppA||576095357_Borrelia_orfA->576095358_DUF226->576095359_ParB-HTH*->576095360_ERF-><-576095361_orfD 488735361 Phage-integrase-><-XerD<-ParB-HTH*<-DUF226<-Borrelia_orfA||ERF-> ParB-HTH Plasmid_parti DZ12_RS0105395 186 bacteria>spirochaetes Borrelia burgdorferi group MULTISPECIES: chromosome partitioning protein [Borrelia burgdorferi group]. <-740590199_?||488735346_?->657249030_?->657249032_Phage-integrase-><-501588716_XerD<-488735361_ParB-HTH*<-488735365_DUF226<-657249036_Borrelia_orfA||740590200_ERF-> 488739923 Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-?<-?||?-><-?||Phage-integrase-> ParB-HTH SP+Plasmid_parti NM71_RS07415 186 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 499193068_Borrelia_orfA->499193062_DUF226->488739923_ParB-HTH*->499193063_ERF-><-499193064_?<-499193070_?||763428009_?-><-499193066_?||497944141_Phage-integrase-> 493479385 ParB-HTH*->BdrA-> ParB-HTH Plasmid_parti BSPA14S_RS04890 186 bacteria>spirochaetes Borrelia spielmanii chromosome partitioning protein [Borrelia spielmanii]. 493479385_ParB-HTH*->493479383_BdrA-> 497944835 <-ParB-HTH* ParB-HTH SP+Plasmid_parti BBU80A_RS09650 186 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. <-497944835_ParB-HTH* 499985629 Phage-integrase-><-BppA<-BppA<-BppA||Borrelia_orfA->DUF226->ParB-HTH*->ERF-> ParB-HTH Plasmid_parti Ip21p08 186 bacteria>spirochaetes Borrelia burgdorferi hypothetical protein [Borrelia burgdorferi]. 115534917_?->115534918_Phage-integrase-><-115534919_BppA<-115534927_BppA<-115534920_BppA||115534921_Borrelia_orfA->115534922_DUF226->499985629_ParB-HTH*->115534924_ERF->115534925_?-><-115534926_? 501930839 Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-?<-?||?->Phage-integrase-> ParB-HTH SP+Plasmid_parti BBUN40_RS05120 186 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 504353027_Borrelia_orfA->497944126_DUF226->501930839_ParB-HTH*->497944130_ERF-><-497944134_?<-497944135_?||497944137_?->497944141_Phage-integrase-> 671501078 <-ParB-HTH* ParB-HTH SP+Plasmid_parti DZ04_RS0105605 186 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-671501078_ParB-HTH* 671580297 <-BdrA<-ParB-HTH* ParB-HTH Plasmid_parti DZ10_RS0106780 186 bacteria>spirochaetes Borrelia burgdorferi permease [Borrelia burgdorferi]. <-671580290_BdrA<-671580297_ParB-HTH* 671550272 Borrelia_lipo_1->Borrelia_lipo_1-><-ParB-HTH* ParB-HTH Plasmid_parti DZ09_RS0104855 185 bacteria>spirochaetes Borrelia burgdorferi permease [Borrelia burgdorferi]. 501666678_?->497942861_Borrelia_lipo_1->501666676_Borrelia_lipo_1-><-671550272_ParB-HTH* 696419050 <-BdrA<-ParB-HTH* ParB-HTH Plasmid_parti DY99_RS01000000109765 181 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-671481818_BdrA<-696419050_ParB-HTH* 644979468 Borrelia_orfA->DUF226->ParB-HTH*->BdrA->ERF-> ParB-HTH SP+Plasmid_parti BHY_RS06165 176 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. 645063105_Borrelia_orfA->645063106_DUF226->644979468_ParB-HTH*->644979469_BdrA->645063107_ERF-> 645010627 <-Lipoprotein_2<-?<-Lipoprotein_2<-BdrA<-Terminase_LS<-MultiTM<-ThiF<-ParB-HTH*<-DUF226<-Borrelia_orfA||Lipoprotein_2->?-><-BdrA||BdrA-> ParB-HTH Plasmid_parti BHO_RS05335 175 bacteria>spirochaetes Borrelia hermsii permease [Borrelia hermsii]. <-645010599_Lipoprotein_2<-645010602_?<-645010609_Lipoprotein_2<-645010612_BdrA<-645010615_Terminase_LS<-645010618_MultiTM<-645010621_ThiF<-645010627_ParB-HTH*<-645010630_DUF226<-645010633_Borrelia_orfA||645010636_Lipoprotein_2->645010639_?-><-749299198_BdrA||645010642_BdrA->749299199_?-> 497943789 ParB-HTH*->BdrA-> ParB-HTH Plasmid_parti DY93_RS0105830 174 bacteria>spirochaetes Borrelia burgdorferi chromosome partitioning protein [Borrelia burgdorferi]. 740589459_?->497943789_ParB-HTH*->671559884_BdrA-> 499985609 <-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Lipoprotein_2<-Borrelia_orfA<-ParB-HTH*<-Borrelia_orfA||BppA->BppA->XerD-><-Phage-integrase<-Mlp||BdrA-> ParB-HTH Plasmid_parti ORFc 174 bacteria>spirochaetes Borrelia duttonii hypothetical protein [Borrelia duttonii]. <-115534864_?<-115534865_Lipoprotein_2<-115534866_Lipoprotein_2<-115534867_Lipoprotein_2<-115534868_Lipoprotein_2<-115534869_Lipoprotein_2<-115534870_Borrelia_orfA<-499985609_ParB-HTH*<-115534872_Borrelia_orfA||115534873_BppA->115534874_BppA->115534875_XerD-><-115534876_Phage-integrase<-115534877_Mlp||115534878_BdrA-> 671520608 <-ParB-HTH* ParB-HTH Plasmid_parti DY88_RS0105135 174 bacteria>spirochaetes Borrelia garinii chromosome partitioning protein [Borrelia garinii]. <-671520608_ParB-HTH* 657235493 <-ParB-HTH* ParB-HTH SP+Plasmid_parti DY94_RS0102265 172 bacteria>spirochaetes Borrelia garinii permease [Borrelia garinii]. <-657235493_ParB-HTH*||657235495_?->657235496_?-> 752506999 Borrelia_orfA->DUF226->ParB-HTH*->ERF-><-Borrelia_lipo_2 ParB-HTH Plasmid_parti BCO_RS06215 172 bacteria>spirochaetes Borrelia coriaceae hypothetical protein [Borrelia coriaceae]. 645024187_Borrelia_orfA->645024190_DUF226->752506999_ParB-HTH*->645024196_ERF-><-645024198_Borrelia_lipo_2<-645024200_?<-645024203_?<-645024206_?<-645024210_?<-645024212_? 740592163 <-ParB-HTH* ParB-HTH Plasmid_parti DZ19_RS09730 171 bacteria>spirochaetes Borrelia burgdorferi permease [Borrelia burgdorferi]. <-740592163_ParB-HTH* 497945190 ParB-HTH*->BppA-> ParB-HTH Plasmid_parti BBU80A_RS08665 166 bacteria>spirochaetes Borrelia burgdorferi hypothetical protein, partial [Borrelia burgdorferi]. 497945190_ParB-HTH*->497945191_BppA-> 654876378 Borrelia_lipo_2->orfD-><-ERF<-ParB-HTH* ParB-HTH Plasmid_parti T431_RS0104785 153 bacteria>spirochaetes Borrelia coriaceae hypothetical protein, partial [Borrelia coriaceae]. 645024210_?->645024206_?->645024203_?->645024200_?->654876375_Borrelia_lipo_2->654876376_orfD-><-740579301_ERF<-654876378_ParB-HTH* 671558427 ParB-HTH*-> ParB-HTH Plasmid_parti DY90_RS0106570 134 bacteria>spirochaetes Borrelia garinii permease, partial [Borrelia garinii]. 671558427_ParB-HTH*-> 671547916 <-BdrA<-ParB-HTH* ParB-HTH Plasmid_parti DZ02_RS0106405 130 bacteria>spirochaetes Borrelia garinii permease, partial [Borrelia garinii]. <-671481040_BdrA<-671547916_ParB-HTH* 671560278 <-ParB-HTH* ParB-HTH - DY93_RS0106950 94 bacteria>spirochaetes Borrelia burgdorferi permease, partial [Borrelia burgdorferi]. <-671560278_ParB-HTH*<-671560280_? # 155; Often XerD association 494523440 Relaxase-><-?<-HNH<-?||Primpol?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - CWATWH0005_2825 431 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 494523435_Relaxase-><-494523436_?<-494523437_HNH<-494523438_?||494523439_Primpol?->494523440_ParB-HTH+Prok-TUDOR*-><-494523442_?||494523443_?-> 428272365 <-ParB<-ParA<-?<-?<-?||ParB-HTH+Prok-TUDOR*-><-XerD||CASPASE-><-?||?->?->METHYLASE-> ParB-HTH+Prok-TUDOR - Sta7437_4876 420 bacteria>cyanobacteria Stanieria cyanosphaera PCC 7437 hypothetical protein Sta7437_4876 (plasmid) [Stanieria cyanosphaera PCC 7437]. <-428272358_?<-428272359_?<-428272360_ParB<-428272361_ParA<-428272362_?<-428272363_?<-428272364_?||428272365_ParB-HTH+Prok-TUDOR*-><-428272366_XerD||428272367_CASPASE-><-428272368_?||428272369_?->428272370_?->428272371_METHYLASE->428272372_?-> 428267400 <-ParA<-?<-?<-?<-?<-?||HTH->ParB-HTH+Prok-TUDOR*-><-?<-?||?->?->?->XerD-> ParB-HTH+Prok-TUDOR SP Glo7428_4930 400 bacteria>cyanobacteria Gloeocapsa sp. PCC 7428 hypothetical protein Glo7428_4930 (plasmid) [Gloeocapsa sp. PCC 7428]. <-428267393_ParA<-428267394_?<-428267395_?<-428267396_?<-428267397_?<-428267398_?||428267399_HTH->428267400_ParB-HTH+Prok-TUDOR*-><-428267401_?<-428267402_?||428267403_?->428267404_?->428267405_?->428267406_XerD-><-428267407_? 718251661 <-TerD<-TerD||?-><-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - N44_02315 370 bacteria>cyanobacteria Microcystis aeruginosa NIES-44 benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB [Microcystis aeruginosa NIES-44]. 718251654_?-><-718251655_?||718251656_?-><-718251657_?<-718251658_TerD<-718251659_TerD||718251660_?-><-718251661_ParB-HTH+Prok-TUDOR*||718251662_?->718251663_?->718251664_?-><-718251665_?||718251666_?->718251667_?->718251668_?-> 389882556 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - MICAK_2860002 368 bacteria>cyanobacteria Microcystis aeruginosa PCC 9701 conserved hypothetical protein [Microcystis aeruginosa PCC 9701]. 389882570_?-><-389882571_?<-389882572_?<-389882573_?<-389882574_?||389882575_?->389882555_?-><-389882556_ParB-HTH+Prok-TUDOR*<-389882557_?||389882558_?->389882559_?-><-389882560_?||389882561_?-><-389882562_?<-389882563_? 543531309 Relaxase->?-><-?<-ASCH+ParB-HTH+Prok-TUDOR* ASCH+ParB-HTH+Prok-TUDOR - CWATWH0402_1907 361 bacteria>cyanobacteria Crocosphaera watsonii WH 0402 hypothetical protein CWATWH0402_1907 [Crocosphaera watsonii WH 0402]. <-543531302_?||543531303_?-><-543531304_?||543531305_?->543531306_Relaxase->543531307_?-><-543531308_?<-543531309_ASCH+ParB-HTH+Prok-TUDOR* 737861903 Relaxase-><-?<-HNH<-?||Primpol?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - CWATWH0003_RS24275 344 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 737861894_Relaxase-><-737861897_?<-737861900_HNH<-494523438_?||494523439_Primpol?->737861903_ParB-HTH+Prok-TUDOR*->494523441_?-><-737861906_?||494523443_?->494523444_?->494523445_?-> 754508876 HTH->ParB-HTH+Prok-TUDOR*-><-?||?->?->XerD-> ParB-HTH+Prok-TUDOR - GLO7428_RS24200 338 bacteria>cyanobacteria Gloeocapsa sp. PCC 7428 hypothetical protein, partial [Gloeocapsa sp. PCC 7428]. <-754508859_?<-505004106_?<-505004107_?<-754508860_?<-505004108_?<-754508861_?||754508875_HTH->754508876_ParB-HTH+Prok-TUDOR*-><-505004112_?||505004114_?->505004115_?->754508877_XerD-><-505004117_?<-505004118_?<-505004119_? 748135946 RVT+HNH-><-?<-?<-?||?->?->HTH->ParB-HTH+Prok-TUDOR*-><-?<-HU-IHF ParB-HTH+Prok-TUDOR - QH73_RS07795 337 bacteria>cyanobacteria Scytonema millei hypothetical protein [Scytonema millei]. 748135943_RVT+HNH-><-748135944_?<-748135878_?<-748135879_?||748135880_?->748135945_?->748135881_HTH->748135946_ParB-HTH+Prok-TUDOR*-><-748135882_?<-748135947_HU-IHF||748135883_?-><-748135948_?<-748135949_?||748135884_?->748135885_?-> 67852287 <-ExoVII<-?<-ParB-HTH+Prok-TUDOR*<-XerD ParB-HTH+Prok-TUDOR - CwatDRAFT_0109 334 bacteria>cyanobacteria Crocosphaera watsonii WH 8501 unknown protein [Crocosphaera watsonii WH 8501]. <-67852291_?||67852285_?-><-67852290_?<-67852289_ExoVII<-67852288_?<-67852287_ParB-HTH+Prok-TUDOR*<-67852286_XerD 751570983 <-RVT+HNH||?->?->DDE_Tnp_1_2-><-ParB-HTH+Prok-TUDOR*<-Primpol? ParB-HTH+Prok-TUDOR - SD81_RS27565 334 bacteria>cyanobacteria Tolypothrix campylonemoides hypothetical protein [Tolypothrix campylonemoides]. <-751570975_?<-751565841_?<-751571087_?<-751570976_RVT+HNH||751571089_?->751570978_?->751570980_DDE_Tnp_1_2-><-751570983_ParB-HTH+Prok-TUDOR*<-751570984_Primpol?||751570986_?-><-751570988_?<-751570990_?||751570992_?->751570994_?->751571091_?-> 218175378 <-XerD||ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR SP PCC7424_5542 332 bacteria>cyanobacteria Cyanothece sp. PCC 7424 hypothetical protein PCC7424_5542 (plasmid) [Cyanothece sp. PCC 7424]. 218175371_?->218175372_?->218175373_?-><-218175374_?<-218175375_?||218175376_?-><-218175377_XerD||218175378_ParB-HTH+Prok-TUDOR*->218175379_?->218175380_?->218175381_?-> 515877940 Primpol?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - PCC9339_RS0106675 332 bacteria>cyanobacteria Fischerella sp. PCC 9339 hypothetical protein [Fischerella sp. PCC 9339]. <-515877934_?<-515877935_?<-515877936_?<-515877936_?<-515877937_?<-515877938_?||737126821_Primpol?->515877940_ParB-HTH+Prok-TUDOR*->515877941_?->737126824_?->515877943_?->515877944_?->515877945_?->515877946_?->515877947_?-> 657929542 <-ParB-HTH+Prok-TUDOR*<-Primpol? ParB-HTH+Prok-TUDOR - TOL9009_RS0101730 330 bacteria>cyanobacteria [Scytonema hofmanni] UTEX B 1581 hypothetical protein [[Scytonema hofmanni] UTEX B 1581]. <-657929539_?<-657929540_?<-657929541_?<-740464363_?<-657929542_ParB-HTH+Prok-TUDOR*<-657929543_Primpol?||657929544_?->657929545_?->657929546_?->657929547_?->740464366_?->657929548_?-> 407266570 <-DDE||TPR-><-?||?->?->?->?-><-ParB-HTH+Prok-TUDOR*||?->?-><-?||?->?-><-?||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-> ParB-HTH+Prok-TUDOR - FDUTEX481_04300 326 bacteria>cyanobacteria Tolypothrix sp. PCC 7601 hypothetical protein FDUTEX481_04300 [Tolypothrix sp. PCC 7601]. <-407266564_DDE||407266563_TPR-><-407266565_?||407266566_?->407266567_?->407266568_?->407266569_?-><-407266570_ParB-HTH+Prok-TUDOR*||407266571_?->407266572_?-><-407266573_?||407266574_?->407266575_?-><-407266576_?||407266577_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-> 515385753 <-ParB-HTH+Prok-TUDOR*<-Primpol?||?->?-><-?||?-><-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon ParB-HTH+Prok-TUDOR - UYC_RS0100505 326 bacteria>cyanobacteria Chlorogloeopsis fritschii hypothetical protein [Chlorogloeopsis fritschii]. 515385746_?->750127864_?->515385748_?-><-515385749_?||515385750_?->515385751_?->515388784_?-><-515385753_ParB-HTH+Prok-TUDOR*<-515388785_Primpol?||515388786_?->515385106_?-><-515385110_?||515385113_?-><-515385115_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon||515385118_?-> 546206668 PriCT_2->XerD->ParB-HTH+Prok-TUDOR*->?->ExoVII-> ParB-HTH+Prok-TUDOR - CWATWH8502_3343 323 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 546206663_PriCT_2->546206666_XerD->546206668_ParB-HTH+Prok-TUDOR*->494518954_?->546206670_ExoVII->494518956_?-><-494518951_? 546222413 <-ExoVII<-?<-ParB-HTH+Prok-TUDOR*<-XerD ParB-HTH+Prok-TUDOR - CWATWH0005_5641 323 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 494524271_?->494524273_?->546222411_?->546222412_?-><-494523800_?<-494518955_ExoVII<-494518954_?<-546222413_ParB-HTH+Prok-TUDOR*<-546222414_XerD 655839534 RVT+HNH->?->?->ParB-HTH->HTH->Primpol?->ParB-HTH+Prok-TUDOR*-><-?<-?||RecD->XerD-> ParB-HTH+Prok-TUDOR - SYN7509_RS0222055 323 bacteria>cyanobacteria Synechocystis sp. PCC 7509 hypothetical protein [Synechocystis sp. PCC 7509]. 740179509_?->497316263_RVT+HNH->497316262_?->497316261_?->740179512_ParB-HTH->497316258_HTH->740179516_Primpol?->655839534_ParB-HTH+Prok-TUDOR*-><-497316255_?<-655839535_?||740179519_RecD->497316172_XerD->497316171_?->740179350_?->740179426_?-> 748136445 HU-IHF-><-?<-ParB-HTH+Prok-TUDOR*<-HTH<-?<-?||?-><-?<-?||NACHT-> ParB-HTH+Prok-TUDOR - QH73_RS10255 323 bacteria>cyanobacteria Scytonema millei hypothetical protein [Scytonema millei]. <-748136367_?||748136368_?->748136443_?->748136369_?-><-748136444_?||748136370_HU-IHF-><-748136371_?<-748136445_ParB-HTH+Prok-TUDOR*<-748136372_HTH<-748136373_?<-748136374_?||748136375_?-><-748136376_?<-748136377_?||748136378_NACHT-> 797212629 TPR-><-?||?->?->?-><-ParB-HTH+Prok-TUDOR*||?->?-><-?||?-><-?||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-ATHOOK+ParA ParB-HTH+Prok-TUDOR - FDUTEX481_RS32065 323 bacteria>cyanobacteria Tolypothrix sp. PCC 7601 hypothetical protein [Tolypothrix sp. PCC 7601]. 797212624_?->797212718_?->797212719_TPR-><-797212625_?||797212626_?->797212627_?->797212628_?-><-797212629_ParB-HTH+Prok-TUDOR*||797212630_?->797212720_?-><-797212631_?||797212721_?-><-797212722_?||797212632_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-797212633_ATHOOK+ParA 769922127 HTH->Primpol?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - UH38_RS20080 322 bacteria>cyanobacteria Chroococcales cyanobacterium CENA595 hypothetical protein [Chroococcales cyanobacterium CENA595]. <-769922079_?||769922080_?->769922081_HTH->769922126_Primpol?->769922127_ParB-HTH+Prok-TUDOR*->769922082_?->769922083_?->769922128_?->769922084_?->769922085_?->769922086_?->769922087_?-> 737859551 Primpol?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - CWATWH0003_RS12720 320 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein, partial [Crocosphaera watsonii]. 737859546_?->737859548_?->494521402_Primpol?->737859551_ParB-HTH+Prok-TUDOR*-> 186469442 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - Npun_BR102 317 bacteria>cyanobacteria Nostoc punctiforme PCC 73102 conserved hypothetical protein (plasmid) [Nostoc punctiforme PCC 73102]. <-186469435_?<-186469436_?<-186469437_?||186469438_?-><-186469439_?||186469440_?->186469441_?-><-186469442_ParB-HTH+Prok-TUDOR*<-186469443_?<-186469444_?<-186469445_?||186469446_?->186469447_?->186469448_?->186469449_?-> 797208446 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - FDUTEX481_RS10740 317 bacteria>cyanobacteria Tolypothrix sp. PCC 7601 hypothetical protein [Tolypothrix sp. PCC 7601]. <-797208721_?<-797208441_?<-797208442_?<-797208722_?<-797208443_?<-797208444_?<-797208445_?<-797208446_ParB-HTH+Prok-TUDOR*<-797208447_?||797208723_?->797208724_?->797208448_?->797208725_?->797208449_?->797208450_?-> 501601085 <-ParB-HTH+Prok-TUDOR*<-XerD ParB-HTH+Prok-TUDOR SP PCC7424_RS26725 314 bacteria>cyanobacteria Cyanothece sp. PCC 7424 hypothetical protein [Cyanothece sp. PCC 7424]. <-501601078_?<-752567297_?<-501601080_?<-752567298_?<-501601082_?||501601083_?->501601084_?-><-501601085_ParB-HTH+Prok-TUDOR*<-501601086_XerD||501601087_?-><-752567299_?<-501601089_?<-501601090_?<-752567300_?||501601092_?-> 499635872 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR SP AVA_RS27595 313 bacteria>cyanobacteria Anabaena variabilis hypothetical protein [Anabaena variabilis]. <-499635864_?<-752818109_?<-499635866_?<-499635867_?||752818153_?-><-499635870_?<-499635871_?<-499635872_ParB-HTH+Prok-TUDOR*<-499635873_?||499635874_?->752818154_?->752818155_?->499635876_?->499635877_?->752818156_?-> 501381405 NACHT->STYKIN->?-><-TPR+CASPASE||?->ParB-HTH+Prok-TUDOR*-><-?<-?<-PIN+CASPASE<-TPR+CASPASE||DDE_3-> ParB-HTH+Prok-TUDOR SP NPUN_RS34090 313 bacteria>cyanobacteria Nostoc punctiforme hypothetical protein [Nostoc punctiforme]. <-501381398_?<-753810943_?||501381399_NACHT->753810971_STYKIN->753810972_?-><-501381402_TPR+CASPASE||501381404_?->501381405_ParB-HTH+Prok-TUDOR*-><-501381406_?<-501381408_?<-501381409_PIN+CASPASE<-501381410_TPR+CASPASE||501381411_DDE_3->753810944_?->753810973_?-> 389714985 <-ParB-HTH+Prok-TUDOR*<-?||?-><-?<-?<-?<-RVT+HNH ParB-HTH+Prok-TUDOR SP MICAB_900014 309 bacteria>cyanobacteria Microcystis aeruginosa PCC 9717 conserved hypothetical protein [Microcystis aeruginosa PCC 9717]. 389714978_?->389714979_?->389714980_?->389714981_?->389714982_?->389714983_?->389714984_?-><-389714985_ParB-HTH+Prok-TUDOR*<-389714986_?||389714970_?-><-389714961_?<-389714962_?<-389714963_?<-389714964_RVT+HNH||389714965_?-> 499309017 STYKIN->?-><-TPR+CASPASE<-?<-?<-?<-?||ParB-HTH+Prok-TUDOR*->?->RVT+HNH->RVT+HNH->RVT+HNH->?-><-?<-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon ParB-HTH+Prok-TUDOR SP PCC7120DELTA_RS29565 309 bacteria>cyanobacteria Nostoc sp. PCC 7120 hypothetical protein [Nostoc sp. PCC 7120]. 764953490_STYKIN->764953492_?-><-499309012_TPR+CASPASE<-499309013_?<-499309014_?<-499309015_?<-499309016_?||499309017_ParB-HTH+Prok-TUDOR*->764953376_?->499309018_RVT+HNH->764953494_RVT+HNH->764953500_RVT+HNH->499309021_?-><-499309022_?<-499309023_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon 499635567 <-ParB-HTH+Prok-TUDOR*||?->?->?->TPR+CASPASE-><-?<-STYKIN||NACHT-> ParB-HTH+Prok-TUDOR SP AVA_RS26020 309 bacteria>cyanobacteria Anabaena variabilis hypothetical protein [Anabaena variabilis]. 499309022_?-><-499309021_?<-499635567_ParB-HTH+Prok-TUDOR*||499635568_?->499309014_?->499635569_?->499635570_TPR+CASPASE-><-499635571_?<-752818111_STYKIN||752818112_NACHT-> 515347403 ABC->ABC-><-?||Primpol?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - UYG_RS0120335 308 bacteria>cyanobacteria Fischerella muscicola hypothetical protein [Fischerella muscicola]. 515347396_?->515347397_?->515347398_?->515347399_ABC->515347400_ABC-><-515347401_?||703201084_Primpol?->515347403_ParB-HTH+Prok-TUDOR*-><-515347404_?||515347405_?->515347406_?->515347407_?->515347408_?->703201088_?->703201078_?-> 501381481 ParB-HTH+Prok-TUDOR*->TPR->TPR+CASPASE->NACHT->TPR+CASPASE-> ParB-HTH+Prok-TUDOR - NPUN_RS34540 305 bacteria>cyanobacteria Nostoc punctiforme hypothetical protein [Nostoc punctiforme]. <-501381475_?<-753810996_?<-501381477_?<-753810949_?<-501381478_?<-753810997_?||501381480_?->501381481_ParB-HTH+Prok-TUDOR*->753810998_TPR->753810999_TPR+CASPASE->753811000_NACHT->753811001_TPR+CASPASE->501381484_?->753811002_?->753811003_?-> 753865019 <-ParB<-ParA<-?<-?<-?||?->ParB-HTH+Prok-TUDOR*-><-XerD||CASPASE-><-?||?->METHYLASE-> ParB-HTH+Prok-TUDOR - STA7437_RS23975 302 bacteria>cyanobacteria Stanieria cyanosphaera hypothetical protein [Stanieria cyanosphaera]. <-505008572_?<-505008573_ParB<-505008574_ParA<-505008575_?<-753865013_?<-753865015_?||753865017_?->753865019_ParB-HTH+Prok-TUDOR*-><-753865021_XerD||753865024_CASPASE-><-505008581_?||753865026_?->505008584_METHYLASE->505008585_?-><-505008586_? 757158775 <-ExoVII<-?<-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - CWATDRAFT_RS29615 301 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 757158773_?-><-494518957_?||494518951_?-><-494518956_?<-494518955_ExoVII<-494518954_?<-757158775_ParB-HTH+Prok-TUDOR* 744450902 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR SP DA73_0214905 298 bacteria>cyanobacteria Tolypothrix bouteillei VB521301 hypothetical protein DA73_0214905 [Tolypothrix bouteillei VB521301]. 744450881_?-><-744450898_?||744450899_?->744450882_?-><-744450900_?<-744450883_?<-744450901_?<-744450902_ParB-HTH+Prok-TUDOR*<-744450884_?||744450885_?->744450886_?->744450887_?-><-744450888_?<-744450889_?||744450890_?-> 737134277 ParB-HTH+Prok-TUDOR*-><-?<-?<-?<-RVT+HNH ParB-HTH+Prok-TUDOR - FIS9431_RS33115 295 bacteria>cyanobacteria Fischerella sp. PCC 9431 hypothetical protein [Fischerella sp. PCC 9431]. <-652326659_?||737135005_?->652326660_?->652326661_?-><-652326662_?||737135007_?->737135017_?->737134277_ParB-HTH+Prok-TUDOR*-><-652326663_?<-652326664_?<-652326665_?<-652326666_RVT+HNH<-652326667_?<-652326668_?<-737135019_? 769920071 <-ParB-HTH+Prok-TUDOR*<-?<-HISKIN||?->HISKIN-> ParB-HTH+Prok-TUDOR - UH38_RS09160 294 bacteria>cyanobacteria Chroococcales cyanobacterium CENA595 hypothetical protein [Chroococcales cyanobacterium CENA595]. 769919939_?-><-769919940_?<-769919941_?<-769919942_?<-769919943_?<-769920071_ParB-HTH+Prok-TUDOR*<-769919944_?<-769919945_HISKIN||769919946_?->769920072_HISKIN->769919947_?-><-769919948_?<-769919949_? 779871805 SNF-helicase->?->?-><-?<-?<-TerD<-TerD<-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - N44_RS02080 293 bacteria>cyanobacteria Microcystis aeruginosa hypothetical protein [Microcystis aeruginosa]. 779871748_SNF-helicase->779871801_?->779871752_?-><-779871755_?<-779871760_?<-488847691_TerD<-488847692_TerD<-779871805_ParB-HTH+Prok-TUDOR*||779871765_?->779871769_?->779871774_?-><-779871778_?||488837493_?->779871786_?-> 752568031 <-ExoVII<-ParB-HTH+Prok-TUDOR*<-XerD ParB-HTH+Prok-TUDOR - CYAN8802_RS22020 292 bacteria>cyanobacteria Cyanothece sp. PCC 8802 hypothetical protein [Cyanothece sp. PCC 8802]. <-502464563_?<-502464564_?<-502464565_?<-752568030_?||502464569_?-><-502464578_?<-502464579_ExoVII<-752568031_ParB-HTH+Prok-TUDOR*<-752568048_XerD<-502464582_?||502464583_?-><-502464586_?||752568033_?->752568050_?-><-502464594_? 763118064 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - MICAK_RS15355 291 bacteria>cyanobacteria Microcystis aeruginosa hypothetical protein [Microcystis aeruginosa]. 490389584_?-><-490389586_?<-490389588_?<-490389589_?<-490389590_?||490389591_?->490389593_?-><-763118064_ParB-HTH+Prok-TUDOR*||763118011_?->490389596_?->490389597_?-><-763118012_?<-490389598_? 505024902 <-ParB-HTH+Prok-TUDOR*<-?||XerD-><-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-?<-Zn_Tnp_IS1595<-McrB ParB-HTH+Prok-TUDOR SP STA7437_RS22850 290 bacteria>cyanobacteria Stanieria cyanosphaera hypothetical protein [Stanieria cyanosphaera]. 505024896_?->753864831_?->505024897_?->505024898_?->505024899_?->505024900_?->505024901_?-><-505024902_ParB-HTH+Prok-TUDOR*<-753864890_?||505024905_XerD-><-505024906_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-505024907_?<-753864892_Zn_Tnp_IS1595<-505024909_McrB<-505024910_? 505141377 TPR->?-><-DDE_Tnp_1_2||?->?-><-?<-ParB-HTH+Prok-TUDOR*||?->?-><-DDE_3 ParB-HTH+Prok-TUDOR - CYLST_RS31010 290 bacteria>cyanobacteria Cylindrospermum stagnale hypothetical protein [Cylindrospermum stagnale]. <-752562980_?||505141373_TPR->505141374_?-><-505141375_DDE_Tnp_1_2||752562981_?->752562892_?-><-752562982_?<-505141377_ParB-HTH+Prok-TUDOR*||505141378_?->505141379_?-><-752561959_DDE_3<-752561958_?||505141380_?->505141381_?->752562984_?-> 518335686 Relaxase->?->?->?->?->?-><-ParB-HTH+Prok-TUDOR*||XerD-><-?<-?<-?<-?<-ABC ParB-HTH+Prok-TUDOR SP PLEUR7319_RS0114150 289 bacteria>cyanobacteria Pleurocapsa sp. PCC 7319 hypothetical protein [Pleurocapsa sp. PCC 7319]. 518335678_?->518335679_Relaxase->518335680_?->518335682_?->518335683_?->518335684_?->518335685_?-><-518335686_ParB-HTH+Prok-TUDOR*||518335687_XerD-><-648410763_?<-518335688_?<-518335689_?<-648410764_?<-648410765_ABC<-518335692_? 738538439 URI->?->?->?->?-><-ParB-HTH+Prok-TUDOR*||XerD-><-?||HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-?<-SFII-helicase ParB-HTH+Prok-TUDOR SP KV40_RS24315 289 bacteria>cyanobacteria Myxosarcina sp. GI1 hypothetical protein [Myxosarcina sp. GI1]. <-738538426_?||738538428_?->738538430_URI->738538431_?->738538432_?->738538433_?->738538438_?-><-738538439_ParB-HTH+Prok-TUDOR*||738538562_XerD-><-738538440_?||738538442_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon-><-738538445_?<-738538447_SFII-helicase<-738538449_?<-738538564_? 752567372 <-XerD||ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR SP PCC7424_RS28605 289 bacteria>cyanobacteria Cyanothece sp. PCC 7424 hypothetical protein, partial [Cyanothece sp. PCC 7424]. <-501601018_?||501601019_?->501601020_?->501601021_?-><-752567351_?||501601024_?-><-752567371_XerD||752567372_ParB-HTH+Prok-TUDOR*->501601027_?->501601028_?->752567352_?-> 768384071 HTH->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - UH38_20050 289 bacteria>cyanobacteria Chroococcales cyanobacterium CENA595 hypothetical protein UH38_20050 [Chroococcales cyanobacterium CENA595]. <-768384026_?||768384027_?->768384028_HTH->768384071_ParB-HTH+Prok-TUDOR*->768384029_?->768384030_?->768384031_?->768384032_?->768384033_?->768384034_?->768384035_?-> 505030514 <-RDRP||?->?->?-><-?<-?<-?||ParB-HTH+Prok-TUDOR*-><-HISKIN ParB-HTH+Prok-TUDOR - ANACY_RS28045 288 bacteria>cyanobacteria Anabaena cylindrica hypothetical protein [Anabaena cylindrica]. <-505030507_RDRP||505030508_?->505030509_?->505030510_?-><-505030511_?<-505030512_?<-505030513_?||505030514_ParB-HTH+Prok-TUDOR*-><-505030515_HISKIN||505030516_?-><-505030517_?<-505030518_?||755115646_?->505030520_?->505030521_?-> 753811080 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - NPUN_RS35730 288 bacteria>cyanobacteria Nostoc punctiforme hypothetical protein [Nostoc punctiforme]. <-501381676_?<-753811078_?<-501381678_?<-501381679_?||753811079_?->501381682_?->501381683_?-><-753811080_ParB-HTH+Prok-TUDOR*<-501381685_?<-501381686_?<-501381687_?||753811081_?->501381689_?->753811043_?->501381690_?-> 744452929 <-XerD||?-><-?<-?<-?||ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - DA73_0203765 287 bacteria>cyanobacteria Tolypothrix bouteillei VB521301 hypothetical protein DA73_0203765 [Tolypothrix bouteillei VB521301]. <-744452910_?||744452911_?-><-744452912_XerD||744452913_?-><-744452914_?<-744452915_?<-744452928_?||744452929_ParB-HTH+Prok-TUDOR*->744452916_?->744452917_?-><-744452918_?<-744452919_?||744452920_?->744452921_?->744452922_?-> 752567338 <-ParB-HTH+Prok-TUDOR*<-?<-DCM ParB-HTH+Prok-TUDOR - PCC7424_RS28050 287 bacteria>cyanobacteria Cyanothece sp. PCC 7424 hypothetical protein [Cyanothece sp. PCC 7424]. 752567363_?->501600808_?-><-501600809_?<-501600810_?||501600811_?-><-752567337_?<-501600813_?<-752567338_ParB-HTH+Prok-TUDOR*<-752567339_?<-752567340_DCM||501600815_?-><-501600816_?<-501600817_?<-501600818_?<-501600819_? 218175274 RdRP+RNaseH+RNaseH->?-><-?<-?||?-><-?<-?<-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - PCC7424_5430 286 bacteria>cyanobacteria Cyanothece sp. PCC 7424 conserved hypothetical protein (plasmid) [Cyanothece sp. PCC 7424]. 218175267_RdRP+RNaseH+RNaseH->218175268_?-><-218175269_?<-218175270_?||218175271_?-><-218175272_?<-218175273_?<-218175274_ParB-HTH+Prok-TUDOR*||218175275_?-><-218175276_?<-218175277_?<-218175278_?<-218175279_?<-218175280_?<-218175281_? 748134961 <-DCM<-?<-?<-?||?->ParB-HTH->Primpol?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - QH73_RS02585 284 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. <-748134886_DCM<-748134887_?<-748134888_?<-748134889_?||748134890_?->748134960_ParB-HTH->748134891_Primpol?->748134961_ParB-HTH+Prok-TUDOR*-><-748134962_?||748134963_?->748134964_?->748134965_?->748134892_?->748134966_?->748134967_?-> 748137603 <-ParB-HTH+Prok-TUDOR*<-HTH<-HTH<-RecT ParB-HTH+Prok-TUDOR - QH73_RS16405 282 bacteria>cyanobacteria Scytonema millei hypothetical protein [Scytonema millei]. 748137524_?->748137525_?->748137526_?->748137601_?->748137527_?-><-748137602_?||748137528_?-><-748137603_ParB-HTH+Prok-TUDOR*<-748137529_HTH<-748137530_HTH<-748137604_RecT<-748137531_?||748137605_?->748137532_?->748137533_?-> 737857352 ParB->?->?->?->?-><-?<-ExoVII<-ParB-HTH+Prok-TUDOR*<-?<-XerD ParB-HTH+Prok-TUDOR - CWATWH0003_RS03005 281 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 737857345_ParB->494519769_?->494519770_?->737857348_?->494519772_?-><-494519773_?<-494519774_ExoVII<-737857352_ParB-HTH+Prok-TUDOR*<-737857355_?<-737857358_XerD 750617827 METHYLASE-><-?<-?<-?<-?<-ParB-HTH+Prok-TUDOR*||?->?-><-?||Peptidase_M10-> ParB-HTH+Prok-TUDOR SP XEN7305_RS25800 281 bacteria>cyanobacteria Xenococcus sp. PCC 7305 hypothetical protein [Xenococcus sp. PCC 7305]. 493559047_?->493559048_?->493559049_METHYLASE-><-750617862_?<-493559051_?<-493559052_?<-493559053_?<-750617827_ParB-HTH+Prok-TUDOR*||493559055_?->493559056_?-><-493559057_?||750617865_Peptidase_M10-><-493559059_?||493559060_?-><-750617867_? 494519775 XerD->?->ParB-HTH+Prok-TUDOR*->ExoVII->?-><-?<-?<-?<-?<-ParB ParB-HTH+Prok-TUDOR - CWATWH0005_5327 280 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 546226988_XerD->494519776_?->494519775_ParB-HTH+Prok-TUDOR*->494519774_ExoVII->494519773_?-><-494519772_?<-546226991_?<-494519770_?<-494519769_?<-546226994_ParB 497232044 TPR+CASPASE->?-><-?||?->?->?->Primpol?->ParB-HTH+Prok-TUDOR*->?->?->?->TerD-> ParB-HTH+Prok-TUDOR - CY51472DRAFT_RS0225515 280 bacteria>cyanobacteria Cyanothece MULTISPECIES: hypothetical protein [Cyanothece]. 501330960_TPR+CASPASE->497232050_?-><-497232049_?||497232048_?->497232047_?->501330961_?->497232045_Primpol?->497232044_ParB-HTH+Prok-TUDOR*->497232043_?->497232042_?->497232041_?->497232040_TerD->639855298_?->639855302_?->497232038_?-> 763118968 <-ParB-HTH+Prok-TUDOR*<-?<-?<-?<-RVT+HNH ParB-HTH+Prok-TUDOR - MICAB_RS03030 280 bacteria>cyanobacteria Microcystis aeruginosa hypothetical protein [Microcystis aeruginosa]. 488845132_?->488845134_?->488845136_?->763118967_?->488845140_?->488845142_?->488845144_?-><-763118968_ParB-HTH+Prok-TUDOR*<-488845148_?<-763118969_?<-488845152_?<-488845158_RVT+HNH||488845160_?->488845162_?->488845163_?-> 428013042 <-ParA<-ParA<-?<-HNH||Primpol?->ParB-HTH+Prok-TUDOR*-><-?<-?<-HNH||?->?->AAA-ATPase-> ParB-HTH+Prok-TUDOR - Chro_5819 279 bacteria>cyanobacteria Chroococcidiopsis thermalis PCC 7203 hypothetical protein Chro_5819 (plasmid) [Chroococcidiopsis thermalis PCC 7203]. 428013035_?-><-428013036_?<-428013037_ParA<-428013038_ParA<-428013039_?<-428013040_HNH||428013041_Primpol?->428013042_ParB-HTH+Prok-TUDOR*-><-428013043_?<-428013044_?<-428013045_HNH||428013046_?->428013047_?->428013048_AAA-ATPase->428013049_?-> 494514224 <-METHYLASE<-SNF-helicase<-?<-ExoVII<-ParB-HTH+Prok-TUDOR*||?-><-?<-?<-?||?->ParB-> ParB-HTH+Prok-TUDOR - CWATDRAFT_RS03435 279 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. <-494514218_?<-546220957_?<-494514219_?<-494514220_METHYLASE<-494514221_SNF-helicase<-494514222_?<-757157015_ExoVII<-494514224_ParB-HTH+Prok-TUDOR*||494514225_?-><-494514226_?<-494514227_?<-757157016_?||494514228_?->494514229_ParB-><-494514230_? 494523801 <-ExoVII<-?<-ParB-HTH+Prok-TUDOR*<-XerD ParB-HTH+Prok-TUDOR - CWATWH0003_RS26285 279 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 494523799_?->546222412_?-><-494523800_?<-494518955_ExoVII<-494518954_?<-494523801_ParB-HTH+Prok-TUDOR*<-494523802_XerD 497231939 ParB-><-?<-ExoVII<-?<-?<-?<-XerD<-ParB-HTH+Prok-TUDOR*||?-><-?<-?||TPR+CASPASE-> ParB-HTH+Prok-TUDOR SP CY51472DRAFT_RS0223830 279 bacteria>cyanobacteria Cyanothece MULTISPECIES: hypothetical protein [Cyanothece]. 497231946_ParB-><-497231945_?<-639854853_ExoVII<-497231943_?<-497231942_?<-497231941_?<-737891485_XerD<-497231939_ParB-HTH+Prok-TUDOR*||497231938_?-><-497231937_?<-501330991_?||501330992_TPR+CASPASE->497231934_?->497231933_?-><-497231932_? 497232068 Tox-HNH->?-><-ParB-HTH+Prok-TUDOR*||?->?->?->?->?->HNH-> ParB-HTH+Prok-TUDOR - CY51472DRAFT_RS0225395 279 bacteria>cyanobacteria Cyanothece MULTISPECIES: hypothetical protein [Cyanothece]. <-497232077_?<-497232075_?||497232074_?-><-737891503_?<-497232071_?||497232070_Tox-HNH->497232069_?-><-497232068_ParB-HTH+Prok-TUDOR*||497232067_?->497232066_?->639855262_?->497232064_?->497232063_?->497232062_HNH->497232061_?-> 543428839 ParB-HTH+Prok-TUDOR*->ExoVII->?->SNF-helicase->METHYLASE-> ParB-HTH+Prok-TUDOR - CWATWH0401_4234 279 bacteria>cyanobacteria Crocosphaera watsonii WH 0401 hypothetical protein CWATWH0401_4234 [Crocosphaera watsonii WH 0401]. <-543428838_?||543428839_ParB-HTH+Prok-TUDOR*->543428840_ExoVII->543428841_?->543428842_SNF-helicase->543428843_METHYLASE->543428844_?->543428845_?->543428846_?-> 737832178 Tox-HNH->?-><-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR SP CY0110_RS14620 279 bacteria>cyanobacteria Cyanothece sp. CCY0110 hypothetical protein [Cyanothece sp. CCY0110]. 495551649_?-><-495551650_?||495551651_?-><-737832174_?<-495551652_?||495551653_Tox-HNH->737832175_?-><-737832178_ParB-HTH+Prok-TUDOR* 515515560 <-DDE<-McrB+METHYLASE<-?<-URI<-?||ParB-HTH+Prok-TUDOR*-><-?<-?<-?<-?||DDE-> ParB-HTH+Prok-TUDOR - ANA7108_RS0100620 278 bacteria>cyanobacteria Anabaena sp. PCC 7108 hypothetical protein [Anabaena sp. PCC 7108]. 755139934_?-><-515515555_?<-755139935_DDE<-515515557_McrB+METHYLASE<-755139938_?<-755139940_URI<-515515559_?||515515560_ParB-HTH+Prok-TUDOR*-><-648412157_?<-515515562_?<-515515563_?<-755139943_?||515515565_DDE->515515569_?-><-515515570_? 740179759 <-XerD||?->?-><-XerD||HU-IHF-><-?<-ParB-HTH+Prok-TUDOR*<-HTH<-?||?->?->DDE-> ParB-HTH+Prok-TUDOR - SYN7509_RS0223705 278 bacteria>cyanobacteria Synechocystis sp. PCC 7509 hypothetical protein [Synechocystis sp. PCC 7509]. <-740179750_?<-497316315_XerD||740179753_?->497316313_?-><-740179756_XerD||655839688_HU-IHF-><-497316309_?<-740179759_ParB-HTH+Prok-TUDOR*<-740179762_HTH<-655839696_?||655839701_?->655839706_?->655839485_DDE->497316325_?->740179661_?-> 769921346 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - UH38_RS16060 278 bacteria>cyanobacteria Chroococcales cyanobacterium CENA595 hypothetical protein [Chroococcales cyanobacterium CENA595]. <-769921282_?<-769921343_?<-769921283_?||769921344_?->769921284_?->769921345_?->769921285_?-><-769921346_ParB-HTH+Prok-TUDOR*||769921286_?-><-769921287_?<-769921288_?<-769921289_?||769921347_?-><-769921290_?<-769921291_? 748136693 <-DDE_Tnp_IS1<-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - QH73_RS11110 276 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. 748136506_?->748136507_?->748136508_?-><-748136509_?<-748136691_?<-748136692_?<-748136510_DDE_Tnp_IS1<-748136693_ParB-HTH+Prok-TUDOR*<-748136511_?<-748136694_?<-748136512_?||748136695_?-><-748136696_?<-748136513_?<-748136514_? 501223295 <-ParB-HTH+Prok-TUDOR*<-?<-?<-?<-?||DDE_3-><-?||DDE_Tnp_1_2-> ParB-HTH+Prok-TUDOR - MAE_RS14770 275 bacteria>cyanobacteria Microcystis aeruginosa hypothetical protein [Microcystis aeruginosa]. <-501223289_?||488880879_?->501223290_?->501221134_?-><-501223292_?<-501223293_?||754188503_?-><-501223295_ParB-HTH+Prok-TUDOR*<-754188505_?<-501223296_?<-501220880_?<-501220879_?||501221980_DDE_3-><-501223297_?||754188508_DDE_Tnp_1_2-> 737188140 <-ParB-HTH+Prok-TUDOR*<-?<-?||?-><-?<-HISKIN ParB-HTH+Prok-TUDOR - CAL7103_RS0120705 274 bacteria>cyanobacteria Calothrix sp. PCC 7103 hypothetical protein [Calothrix sp. PCC 7103]. 648401412_?-><-518321992_?<-518321993_?<-518321994_?<-518321995_?<-518321996_?<-518321997_?<-737188140_ParB-HTH+Prok-TUDOR*<-518321999_?<-518322000_?||737188142_?-><-648401413_?<-518322003_HISKIN<-518322004_?||518322005_?-> 518327692 NACHT->?-><-?<-?<-ParB-HTH+Prok-TUDOR*||?-><-?<-?<-?<-CASPASE<-?||TPR+HD-RNase-> ParB-HTH+Prok-TUDOR - CAL7103_RS0150440 273 bacteria>cyanobacteria Calothrix sp. PCC 7103 hypothetical protein [Calothrix sp. PCC 7103]. 518327685_?-><-518327686_?<-518327687_?||648402072_NACHT->518327689_?-><-518327690_?<-518327691_?<-518327692_ParB-HTH+Prok-TUDOR*||518327693_?-><-518327694_?<-518327695_?<-518327696_?<-518327697_CASPASE<-518327698_?||518327699_TPR+HD-RNase-> 754536191 ParB-HTH+Prok-TUDOR*->?->?-><-?<-?<-?<-ParB<-XerD ParB-HTH+Prok-TUDOR - CYAN7822_RS33100 272 bacteria>cyanobacteria Cyanothece sp. PCC 7822 hypothetical protein, partial [Cyanothece sp. PCC 7822]. 754536188_?->503090766_?-><-503090767_?<-503090768_?||503090770_?-><-503090771_?<-503090772_?||754536191_ParB-HTH+Prok-TUDOR*->503090776_?->503090777_?-><-503090778_?<-503090780_?<-754536193_?<-754536196_ParB<-503090783_XerD 126620031 Tox-HNH->?-><-ParB-HTH+Prok-TUDOR*<-?<-HNH ParB-HTH+Prok-TUDOR - CY0110_32445 271 bacteria>cyanobacteria Cyanothece sp. CCY0110 hypothetical protein CY0110_32445 [Cyanothece sp. CCY0110]. 126620024_?->126620025_?-><-126620026_?||126620027_?-><-126620028_?||126620029_Tox-HNH->126620030_?-><-126620031_ParB-HTH+Prok-TUDOR*<-126620032_?<-126620033_HNH 495554039 ParB-HTH+Prok-TUDOR*->?-><-?<-?<-?<-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon ParB-HTH+Prok-TUDOR - CY0110_RS25950 271 bacteria>cyanobacteria Cyanothece sp. CCY0110 hypothetical protein [Cyanothece sp. CCY0110]. <-495554029_?<-737833440_?||495554031_?-><-495554035_?<-737833445_?<-737833442_?<-495554038_?||495554039_ParB-HTH+Prok-TUDOR*->495554040_?-><-495554041_?<-495554042_?<-495554043_?<-737833447_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-495554045_?||495554046_?-> 737862397 <-ExoVII<-ParB-HTH+Prok-TUDOR*<-XerD<-DDE_Tnp_ISAZ013 ParB-HTH+Prok-TUDOR - CWATWH0003_RS26330 269 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. <-494523803_?<-494523804_?||494523805_?-><-494523807_?<-494523808_?<-494523809_ExoVII<-737862397_ParB-HTH+Prok-TUDOR*<-494523813_XerD<-494523815_DDE_Tnp_ISAZ013 751574024 XerD->?->?-><-ParB-HTH+Prok-TUDOR*||?-><-?||?->RVT+HNH-> ParB-HTH+Prok-TUDOR - SD81_RS35605 269 bacteria>cyanobacteria Tolypothrix campylonemoides hypothetical protein [Tolypothrix campylonemoides]. 751574020_XerD->751574022_?->751573928_?-><-751574024_ParB-HTH+Prok-TUDOR*||515883502_?-><-751573933_?||751573186_?->751574026_RVT+HNH-><-751566262_?<-751573935_?<-751573937_? 738911651 Peptidase_M10->?-><-?||?->?-><-ParB-HTH||?-><-ParB-HTH+Prok-TUDOR*||XerD-> ParB-HTH+Prok-TUDOR - PLEUR7319_RS33990 266 bacteria>cyanobacteria Pleurocapsa sp. PCC 7319 hypothetical protein [Pleurocapsa sp. PCC 7319]. 738911648_Peptidase_M10->518333452_?-><-518333453_?||518333454_?->518333455_?-><-518333456_ParB-HTH||518333457_?-><-738911651_ParB-HTH+Prok-TUDOR*||738911654_XerD->518333460_?->518333461_?->738911661_?-><-738911478_?<-518333464_?<-518333465_? 256592473 <-ExoVII<-ParB-HTH+Prok-TUDOR*<-XerD ParB-HTH+Prok-TUDOR - Cyan8802_4571 265 bacteria>cyanobacteria Cyanothece sp. PCC 8802 hypothetical protein Cyan8802_4571 (plasmid) [Cyanothece sp. PCC 8802]. <-256592466_?<-256592467_?<-256592468_?<-256592469_?||256592470_?-><-256592471_?<-256592472_ExoVII<-256592473_ParB-HTH+Prok-TUDOR*<-256592474_XerD<-256592475_?||256592476_?-><-256592477_?||256592478_?-><-256592479_?<-256592480_? 752825464 <-ParA<-ParA<-?<-HNH||Primpol?->ParB-HTH+Prok-TUDOR*->?-><-?<-?<-HNH||?->AAA-ATPase-> ParB-HTH+Prok-TUDOR - CHRO_RS28535 265 bacteria>cyanobacteria Chroococcidiopsis thermalis hypothetical protein, partial [Chroococcidiopsis thermalis]. 504975986_?-><-504975987_?<-504975988_ParA<-752825462_ParA<-504975990_?<-504975991_HNH||752825463_Primpol?->752825464_ParB-HTH+Prok-TUDOR*->752825420_?-><-752825465_?<-504975995_?<-504975996_HNH||752825466_?->504975999_AAA-ATPase->504976000_?-> 17135837 ABC-><-?||ABC->?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - alr7299 264 bacteria>cyanobacteria Nostoc sp. PCC 7120 alr7299 (plasmid) [Nostoc sp. PCC 7120]. 17135830_?->17135831_?->17135832_?->17135833_ABC-><-17135834_?||17135835_ABC->17135836_?->17135837_ParB-HTH+Prok-TUDOR*->17135838_?->17135839_?-><-17135840_?<-17135841_?||17135842_?->17135843_?-><-17135844_? 546220971 <-METHYLASE<-SNF-helicase<-?<-ExoVII<-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - CWATWH8502_3723 264 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. <-546220957_?<-494514219_?<-546220961_?<-546220964_METHYLASE<-494514221_SNF-helicase<-546220966_?<-546220969_ExoVII<-546220971_ParB-HTH+Prok-TUDOR* 748136747 TPR->?-><-?<-?<-?||?->?->ParB-HTH+Prok-TUDOR*->?->?->?->?->ABC-> ParB-HTH+Prok-TUDOR - QH73_RS12040 262 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. 748136633_TPR->748136742_?-><-748136743_?<-748136744_?<-748136745_?||748136634_?->748136746_?->748136747_ParB-HTH+Prok-TUDOR*->748136748_?->748136635_?->748136636_?->748136637_?->748136638_ABC->748136639_?->748136640_?-> 748136457 RVT+HNH-><-?||?->?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - QH73_RS10490 260 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. <-748136454_?||748136126_?-><-748136455_?||748136382_RVT+HNH-><-748136456_?||748136394_?->748136395_?->748136457_ParB-HTH+Prok-TUDOR*->748136396_?->748136397_?->748136458_?->748136459_?->748136398_?->748136460_?->748136461_?-> 442790849 METHYLASE-><-?<-?<-?<-?<-ParB-HTH+Prok-TUDOR*||?->?-><-?||Peptidase_M10-> ParB-HTH+Prok-TUDOR - Xen7305DRAFT_00000510 258 bacteria>cyanobacteria Xenococcus sp. PCC 7305 hypothetical protein Xen7305DRAFT_00000510 [Xenococcus sp. PCC 7305]. 442790842_?->442790843_?->442790844_METHYLASE-><-442790845_?<-442790846_?<-442790847_?<-442790848_?<-442790849_ParB-HTH+Prok-TUDOR*||442790850_?->442790851_?-><-442790852_?||442790853_Peptidase_M10-><-442790854_?||442790855_?-><-442790856_? 738540774 <-ParB-HTH+Prok-TUDOR*||?-><-?<-?||XerD-><-?<-?<-DDE ParB-HTH+Prok-TUDOR - KV40_RS29900 258 bacteria>cyanobacteria Myxosarcina sp. GI1 hypothetical protein [Myxosarcina sp. GI1]. <-738540756_?<-738540809_?<-738540759_?<-738540762_?<-738540765_?||738540768_?->738540771_?-><-738540774_ParB-HTH+Prok-TUDOR*||738540777_?-><-738540780_?<-738540783_?||738540812_XerD-><-738540785_?<-738540787_?<-738540789_DDE 737132827 ParA->HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->ParB-><-?<-?||?->Relaxase->ParB-HTH+Prok-TUDOR*->?->HNH->?-><-?||?->?-><-HISKIN ParB-HTH+Prok-TUDOR - FIS9431_RS31145 257 bacteria>cyanobacteria Fischerella sp. PCC 9431 hypothetical protein [Fischerella sp. PCC 9431]. 652319800_ParA->652319802_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->652319809_ParB-><-652319810_?<-652319812_?||652319813_?->737132824_Relaxase->737132827_ParB-HTH+Prok-TUDOR*->652319814_?->652319815_HNH->652319816_?-><-652319817_?||737132828_?->652319819_?-><-652319821_HISKIN 737187200 DDE-><-?<-?<-?<-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - CAL7103_RS0100030 257 bacteria>cyanobacteria Calothrix sp. PCC 7103 hypothetical protein, partial [Calothrix sp. PCC 7103]. 737187199_DDE-><-518317934_?<-518317935_?<-518317936_?<-737187200_ParB-HTH+Prok-TUDOR*||518317938_?->518317939_?->648400969_?->518317941_?->737187201_?->737187203_?->518317944_?-> 648361686 TPR+CASPASE-><-?<-?<-STYKIN<-NACHT||ParB-HTH+Prok-TUDOR*->ParB-HTH-><-?<-?<-?||XerD-> ParB-HTH+Prok-TUDOR - PCC9339_RS0103785 253 bacteria>cyanobacteria Fischerella sp. PCC 9339 hypothetical protein [Fischerella sp. PCC 9339]. 737126431_?->737126434_?->737126438_TPR+CASPASE-><-515877411_?<-515877412_?<-737126440_STYKIN<-515877414_NACHT||648361686_ParB-HTH+Prok-TUDOR*->515877416_ParB-HTH-><-515877417_?<-515877418_?<-737126307_?||515877419_XerD->515877420_?-><-515877421_? 740179430 <-XerD<-RecD||?-><-ParB-HTH+Prok-TUDOR*<-HTH||DDE-><-ParB-HTH<-ParB-HTH ParB-HTH+Prok-TUDOR - SYN7509_RS26630 252 bacteria>cyanobacteria Synechocystis sp. PCC 7509 hypothetical protein, partial [Synechocystis sp. PCC 7509]. <-740179423_?<-740179426_?<-740179350_?<-497316171_?<-497316172_XerD<-740179427_RecD||655839503_?-><-740179430_ParB-HTH+Prok-TUDOR*<-740179432_HTH||497315944_DDE-><-740179434_ParB-HTH<-740179437_ParB-HTH<-497315734_?<-740179354_?||497316078_?-> 515383623 ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - UYC_RS0133720 242 bacteria>cyanobacteria Chlorogloeopsis fritschii hypothetical protein [Chlorogloeopsis fritschii]. <-515383601_?||515383603_?-><-515390425_?<-515383607_?||648395863_?->648395864_?->515383620_?->515383623_ParB-HTH+Prok-TUDOR*->515383626_?-><-515383628_?<-515383630_?<-648395865_?<-515383636_?<-515383639_?<-515383641_? 494523812 <-ExoVII<-?<-?<-ParB-HTH+Prok-TUDOR*<-XerD ParB-HTH+Prok-TUDOR - CWATWH0005_5485 238 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. 494523805_?-><-494523806_?<-494523807_?<-494523808_?<-494523809_ExoVII<-494523810_?<-494523811_?<-494523812_ParB-HTH+Prok-TUDOR*<-546228489_XerD 738540904 <-XerD||ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - KV40_RS30300 234 bacteria>cyanobacteria Myxosarcina sp. GI1 hypothetical protein [Myxosarcina sp. GI1]. 738540898_?->738541035_?-><-738540900_?||738541038_?-><-738540902_?<-738541041_?<-738541044_XerD||738540904_ParB-HTH+Prok-TUDOR*->738540906_?->738540907_?->738540908_?->738540910_?->738540911_?->738540912_?-><-738540914_? 764953510 ABC-><-?||ABC->?->ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - PCC7120DELTA_RS29870 230 bacteria>cyanobacteria Nostoc sp. PCC 7120 hypothetical protein, partial [Nostoc sp. PCC 7120]. 499309069_?->499309070_?->499309071_?->499309072_ABC-><-499309073_?||499309074_ABC->499309075_?->764953510_ParB-HTH+Prok-TUDOR*->764953405_?->499309078_?->764953409_?->499309080_?-><-499309084_?<-499309085_?||499309086_?-> 744453553 <-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR SP DA73_0201705 229 bacteria>cyanobacteria Tolypothrix bouteillei VB521301 hypothetical protein DA73_0201705, partial [Tolypothrix bouteillei VB521301]. <-744453520_?||744453521_?-><-744453522_?||744453523_?->744453524_?-><-744453525_?||744453526_?-><-744453553_ParB-HTH+Prok-TUDOR*<-744453527_?||744453528_?->744453529_?->744453530_?-> 738539875 <-ParA<-?<-?||?->HNH-><-ParB-HTH+Prok-TUDOR||?-><-ParB-HTH+Prok-TUDOR*<-?<-?||XerD-> ParB-HTH+Prok-TUDOR - KV40_RS28185 224 bacteria>cyanobacteria Myxosarcina sp. GI1 hypothetical protein [Myxosarcina sp. GI1]. <-738539861_ParA<-738539864_?<-738539995_?||738540017_?->738539867_HNH-><-738539870_ParB-HTH+Prok-TUDOR||738539872_?-><-738539875_ParB-HTH+Prok-TUDOR*<-738539878_?<-738540021_?||738540025_XerD-><-738539881_?<-738539884_?<-738539887_?<-738540029_? 754535969 <-ParB-HTH*||?->?-><-METHYLASE ParB-HTH SP CYAN7822_RS31780 204 bacteria>cyanobacteria Cyanothece sp. PCC 7822 hypothetical protein, partial [Cyanothece sp. PCC 7822]. <-503100216_?||503100217_?->503100218_?-><-503100219_?<-503100220_?<-754535782_?<-503100222_?<-754535969_ParB-HTH*||503100224_?->754535785_?-><-503100225_METHYLASE<-754535972_?||503100227_?->754535974_?-><-503100229_? 740027662 ParA?->ParB-HTH*-> ParB-HTH - IF77_RS0136485 189 bacteria>actinobacteria Streptomyces sp. NRRL F-5008 hypothetical protein, partial [Streptomyces sp. NRRL F-5008]. 664267443_?-><-664267445_?<-664267447_?||664267448_?->740027659_?->740027652_?->664267453_ParA?->740027662_ParB-HTH*-><-664267457_?<-740027664_?<-664267461_?||664267466_?->664267468_?-><-664267469_? 739896924 ParA?->ParB-HTH*-> ParB-HTH - C593_RS30785 185 bacteria>actinobacteria Streptomyces sp. CNT372 hypothetical protein, partial [Streptomyces sp. CNT372]. 739896922_?->517680486_?->517680487_ParA?->739896924_ParB-HTH*-><-517680489_? 754535993 Subtilisin->Subtilisin->?->?-><-DDE_3<-?||ParB-HTH*-> ParB-HTH - CYAN7822_RS32040 183 bacteria>cyanobacteria Cyanothece sp. PCC 7822 hypothetical protein, partial [Cyanothece sp. PCC 7822]. <-503100252_?||503100253_Subtilisin->754535989_Subtilisin->503100255_?->503100256_?-><-754535991_DDE_3<-503100257_?||754535993_ParB-HTH*->754535996_?->503100259_?->503100260_?->503100261_?->754535807_?->754535809_?->754535812_?-> 738911416 <-XerD||ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - PLEUR7319_RS33705 170 bacteria>cyanobacteria Pleurocapsa sp. PCC 7319 hypothetical protein, partial [Pleurocapsa sp. PCC 7319]. 738911398_?->738911413_?-><-518333130_?<-518333131_?||518333132_?-><-648410481_XerD||738911416_ParB-HTH+Prok-TUDOR*-><-518333135_?||518333136_?-><-518333137_?||738911419_?->518333139_?->518333140_?-><-518333141_? 753864872 <-ParB-HTH* ParB-HTH - STA7437_RS22520 166 bacteria>cyanobacteria Stanieria cyanosphaera hypothetical protein, partial [Stanieria cyanosphaera]. 505024834_?->505024835_?-><-505024836_?||505024837_?->753864869_?->753864871_?->505024840_?-><-753864872_ParB-HTH*||753864821_?-><-505024843_?<-753864875_?<-505024845_?<-753864822_?<-505024847_?<-753864824_? 763312164 NACHT-><-ParB-HTH*||?->?-><-?||CASPASE->?->?-><-ABC ParB-HTH - OSC10802_RS39075 163 bacteria>cyanobacteria Oscillatoria sp. PCC 10802 hypothetical protein, partial [Oscillatoria sp. PCC 10802]. 516325480_?-><-763312157_?<-763312160_?<-763312162_?<-648406499_?<-516325486_?||516325488_NACHT-><-763312164_ParB-HTH*||763312165_?->648406500_?-><-516325491_?||516325492_CASPASE->763311553_?->516325494_?-><-516325495_ABC 357263645 Primpol?->ParB-HTH*-> ParB-HTH - CWATWH0003_2674t1 161 bacteria>cyanobacteria Crocosphaera watsonii WH 0003 hypothetical protein CWATWH0003_2674t1, partial [Crocosphaera watsonii WH 0003]. 357263644_Primpol?->357263645_ParB-HTH*-> 703170672 DDE-><-?||ParB-HTH+Prok-TUDOR*-><-?<-XerD||?-><-?<-?||CASPASE->CASPASE-> ParB-HTH+Prok-TUDOR - MAS10914_RS29250 158 bacteria>cyanobacteria Mastigocladopsis repens hypothetical protein, partial [Mastigocladopsis repens]. 515883494_?->515883495_?-><-515883496_?<-515883498_?||515883500_?->515883501_DDE-><-515883502_?||703170672_ParB-HTH+Prok-TUDOR*-><-515883503_?<-515883504_XerD||703170675_?-><-515883506_?<-515883507_?||703170678_CASPASE->703170682_CASPASE-> 753864885 <-ParB-HTH*<-?||?->?->?-><-AAA-ATPase ParB-HTH - STA7437_RS22675 154 bacteria>cyanobacteria Stanieria cyanosphaera hypothetical protein, partial [Stanieria cyanosphaera]. <-505024864_?<-505024865_?<-753864827_?<-753864828_?<-753864884_?<-505024869_?<-505024870_?<-753864885_ParB-HTH*<-505024872_?||505024873_?->505024874_?->505024875_?-><-505024876_AAA-ATPase||505024877_?->505024878_?-> 763120073 DDE-><-?<-?<-?||ParB-HTH*-> ParB-HTH - MICAE_RS13740 154 bacteria>cyanobacteria Microcystis aeruginosa hypothetical protein, partial [Microcystis aeruginosa]. <-488871383_?<-488871384_?<-513846817_?||488871388_DDE-><-488871389_?<-488871390_?<-763120071_?||763120073_ParB-HTH*-><-488871392_?||488871393_?-><-763120072_? 763350225 ParA->ParB->DDE_3->METHYLASE-><-HTH||ParB-HTH*-><-?||?->?->?->?->Peptidase_M10->Peptidase_M10-> ParB-HTH - MC7420_RS19625 153 bacteria>cyanobacteria Coleofasciculus chthonoplastes hypothetical protein, partial [Coleofasciculus chthonoplastes]. <-763350114_?||493033615_?->493033471_ParA->493033522_ParB->493033396_DDE_3->763350222_METHYLASE-><-493033334_HTH||763350225_ParB-HTH*-><-763350228_?||493033541_?->493033509_?->493033259_?->493033571_?->763350230_Peptidase_M10->493033595_Peptidase_M10-> 760034517 ParA->ParB-HTH*->NLPC-><-?||?->?->?->ABC-> ParB-HTH - ON27_RS00260 150 bacteria>actinobacteria Nocardia asiatica hypothetical protein, partial [Nocardia asiatica]. 760034450_?->760034453_?->760034456_?->760034458_?->760034460_?->760034462_?->760034465_ParA->760034517_ParB-HTH*->760034520_NLPC-><-760034522_?||760034467_?->760034470_?->760034472_?->760034475_ABC->760034525_?-> 696559281 <-ParB-HTH*<-ParA||NLPC-> ParB-HTH - FH09_RS0132835 147 bacteria>actinobacteria Nocardia seriolae hypothetical protein, partial [Nocardia seriolae]. 696559274_?->696559275_?->696559276_?->696559277_?-><-696559281_ParB-HTH*<-696559282_ParA||696559283_NLPC-><-696559278_?||696559279_?-><-696559284_?||696559280_?-> 738540713 <-ParB-HTH*||Tox-HNH-> ParB-HTH - KV40_RS29710 147 bacteria>cyanobacteria Myxosarcina sp. GI1 hypothetical protein, partial [Myxosarcina sp. GI1]. 738540609_?-><-738540612_?<-738540711_?<-738540615_?<-738540618_?<-738540621_?<-738540623_?<-738540713_ParB-HTH*||738540626_Tox-HNH->738540629_?->738540632_?->738540635_?->738540637_?->738540640_?->738540715_?-> 750579664 <-NLPC<-?||ParA->ParB-HTH*-> ParB-HTH - ON33_RS24890 144 bacteria>actinobacteria Nocardia niigatensis hypothetical protein, partial [Nocardia niigatensis]. 750579613_?-><-750579614_?||750579660_?-><-750579661_?<-750579662_NLPC<-750579663_?||750579615_ParA->750579664_ParB-HTH*-><-750579616_?<-750579665_?<-750579617_?<-750579618_?||750579666_?->750579619_?-><-750579620_? 754053711 <-ParB-HTH* ParB-HTH - PSE7367_RS19220 143 bacteria>cyanobacteria Pseudanabaena sp. PCC 7367 hypothetical protein, partial [Pseudanabaena sp. PCC 7367]. <-754052469_?<-504958988_?<-754053638_?||504959113_?-><-504959114_?||504959115_?-><-504959117_?<-754053711_ParB-HTH*<-754053640_?||754053641_?->754053713_?->504959122_?->504959123_?->754053642_?-><-504959125_? 737153646 <-XerD||?->?-><-?<-ParB-HTH* ParB-HTH - FIS9605_RS38655 141 bacteria>cyanobacteria Fischerella sp. PCC 9605 hypothetical protein, partial [Fischerella sp. PCC 9605]. 652338671_?-><-737153643_?<-737153645_?<-652338672_XerD||652338673_?->652338674_?-><-652338675_?<-737153646_ParB-HTH*<-652338676_?<-652338677_?<-737153594_?<-652338678_?<-652338679_?<-652338680_?<-652338681_? 737859558 Primpol?->ParB-HTH*-> ParB-HTH - CWATWH0003_RS12730 140 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein, partial [Crocosphaera watsonii]. 737859555_Primpol?->737859558_ParB-HTH*-> 738538560 <-ParB-HTH*<-?||?->URI-> ParB-HTH - KV40_RS24275 139 bacteria>cyanobacteria Myxosarcina sp. GI1 hypothetical protein, partial [Myxosarcina sp. GI1]. <-738538553_?||738538419_?->738538555_?->738538420_?->738538556_?-><-738538558_?<-738538422_?<-738538560_ParB-HTH*<-738538426_?||738538428_?->738538430_URI->738538431_?->738538432_?->738538433_?->738538438_?-> 737126563 ParB-HTH*->?->HNH-> ParB-HTH - PCC9339_RS35280 137 bacteria>cyanobacteria Fischerella sp. PCC 9339 hypothetical protein, partial [Fischerella sp. PCC 9339]. <-515877580_?<-648361739_?||515877582_?-><-648361741_?<-648361742_?<-515877586_?||737126559_?->737126563_ParB-HTH*->515877589_?->515877590_HNH->737126567_?->737126570_?->515877593_?->515877594_?->515877595_?-> 737126426 <-ParB-HTH*||?->?->?->?->TPR+CASPASE-> ParB-HTH - PCC9339_RS35040 135 bacteria>cyanobacteria Fischerella sp. PCC 9339 hypothetical protein, partial [Fischerella sp. PCC 9339]. <-515877396_?<-515877397_?<-737126422_?||515877399_?-><-515877400_?||515877401_?->515877402_?-><-737126426_ParB-HTH*||515877404_?->737126429_?->737126431_?->737126434_?->737126438_TPR+CASPASE-><-515877411_?<-515877412_? 738995333 <-ParB-HTH* ParB-HTH - IF64_RS0121970 133 bacteria>actinobacteria Prauserella rugosa hypothetical protein, partial [Prauserella rugosa]. <-663758541_?<-738995333_ParB-HTH*<-663758543_?<-663758544_?<-738995336_?<-663758546_?<-663758547_?<-663758548_?<-663758549_? 737232780 <-P-loop<-?<-ParB-HTH* ParB-HTH - Q362_RS21450 132 bacteria>proteobacteria>deltaproteobacteria Desulfobulbus elongatus hypothetical protein, partial [Desulfobulbus elongatus]. <-654866285_?<-654866206_?<-654867413_?<-737232778_?<-654867414_P-loop<-737232779_?<-737232780_ParB-HTH*<-654867416_?<-737232781_?<-654867417_?<-654867418_?||737232782_?->654867419_?->654867420_?-> 738617228 ParA->ParB-HTH*->NLPC-><-?||?-><-?||ABC-> ParB-HTH - K940_RS26470 131 bacteria>actinobacteria Nocardia sp. CNY236 hypothetical protein, partial [Nocardia sp. CNY236]. <-738617225_?||655027042_?->738617078_?->655027043_?->655027044_?->738617081_?->655027045_ParA->738617228_ParB-HTH*->655027047_NLPC-><-655027048_?||655027049_?-><-655027050_?||738617231_ABC->655027051_?-><-655027052_? 750252315 ParB-HTH*-> ParB-HTH - CFLAV_RS00535 131 bacteria>verrucomicrobia Pedosphaera parvula hypothetical protein, partial [Pedosphaera parvula]. 494654662_?-><-494654663_?<-750252179_?||494654665_?->494654666_?->494654667_?-><-750252182_?||750252315_ParB-HTH*->494654670_?->494654671_?-><-494654672_?<-494654673_?<-494654675_?<-494654676_?<-494654677_? 663146205 ParB-HTH*-> ParB-HTH SP IF21_RS0142715 130 bacteria>actinobacteria Streptomyces capuensis hypothetical protein, partial [Streptomyces capuensis]. 663146205_ParB-HTH*-> 748262016 METHYLASE-><-?||?-><-?||STYKIN->?-><-NLPC||ParB-HTH*-> ParB-HTH - ON31_RS25060 130 bacteria>actinobacteria Nocardia otitidiscaviarum hypothetical protein, partial [Nocardia otitidiscaviarum]. 748261919_METHYLASE-><-748261920_?||748261921_?-><-748261922_?||748262013_STYKIN->748261923_?-><-748262015_NLPC||748262016_ParB-HTH*-><-748261924_?<-748261925_?<-659884470_?<-748262017_?||748261926_?->748261927_?->748261928_?-> 759915784 <-ParB-HTH*||NLPC-><-?<-STYKIN||?-><-?||?-><-METHYLASE ParB-HTH - NOTIT_RS39125 130 bacteria>actinobacteria Nocardia otitidiscaviarum hypothetical protein, partial [Nocardia otitidiscaviarum]. <-659884466_?<-659884467_?<-659884468_?||659884469_?->659884470_?->659884471_?->659884472_?-><-759915784_ParB-HTH*||659884474_NLPC-><-659884475_?<-759915786_STYKIN||659884477_?-><-659884478_?||659884479_?-><-659884480_METHYLASE 750537552 ParA->ParB-HTH*->NLPC-> ParB-HTH - ON40_RS04835 129 bacteria>actinobacteria Nocardia jiangxiensis hypothetical protein, partial [Nocardia jiangxiensis]. 750537034_?-><-750537037_?<-750537039_?||750537549_?->750537042_?-><-750537044_?||750537045_ParA->750537552_ParB-HTH*->750537554_NLPC-><-750537556_?<-750537559_?<-750537047_?||750537049_?-><-750537051_?||750537052_?-> 760001072 ParA->ParB-HTH*->NLPC-><-?||?->?->?->ABC-> ParB-HTH - ON19_RS00180 129 bacteria>actinobacteria Nocardia abscessus hypothetical protein, partial [Nocardia abscessus]. 760000905_?->760000908_?->760000911_?->760000915_?->760000918_?->760000923_?->760001069_ParA->760001072_ParB-HTH*->760001075_NLPC-><-760001078_?||760001082_?->760000926_?->760000928_?->760001085_ABC->760001088_?-> 752791755 P-loop->?->?->?->?->ParB-HTH*-> ParB-HTH - SYN6312_RS05530 128 bacteria>cyanobacteria Synechococcus sp. PCC 6312 hypothetical protein, partial [Synechococcus sp. PCC 6312]. 504936766_?->504936767_?->504936768_P-loop->504936769_?->752791199_?->752791201_?->752791754_?->752791755_ParB-HTH*->504936774_?->504936775_?->504936776_?->504936777_?->752791756_?->752791203_?->504936779_?-> 753864865 <-ParB-HTH* ParB-HTH - STA7437_RS22355 128 bacteria>cyanobacteria Stanieria cyanosphaera hypothetical protein, partial [Stanieria cyanosphaera]. 753864857_?->753864859_?->505024810_?->505024811_?-><-753864861_?||753864816_?->753864862_?-><-753864865_ParB-HTH*||505024816_?->505024817_?-><-753864817_?||505024818_?->505024819_?->505024820_?->505024821_?-> 750531062 <-NLPC||ParA->ParB-HTH*-> ParB-HTH - ON43_RS29550 127 bacteria>actinobacteria Nocardia concava hypothetical protein, partial [Nocardia concava]. <-750530893_?<-750530895_?||750531058_?-><-750530897_?||750531059_?-><-750531061_NLPC||750530898_ParA->750531062_ParB-HTH*-><-750531064_?<-750531065_?<-750530901_?<-750531067_?||750531068_?->750530903_?->750530905_?-> 740087909 ParB-HTH*-> ParB-HTH - IO34_RS50180 119 bacteria>actinobacteria Streptosporangium roseum hypothetical protein, partial [Streptosporangium roseum]. 665605533_?->665605536_?-><-665605540_?<-665605542_?<-665605545_?||665605548_?->665605551_?->740087909_ParB-HTH*->740087912_?-><-665605560_?<-665605563_?||665605566_?->665605569_?->665605571_?->665605577_?-> 737153947 ParB-HTH*-> ParB-HTH - FIS9605_RS39545 118 bacteria>cyanobacteria Fischerella sp. PCC 9605 hypothetical protein, partial [Fischerella sp. PCC 9605]. <-652339343_?||652339344_?-><-652338464_?<-652339345_?<-652339346_?<-737153945_?||737153916_?->737153947_ParB-HTH*-><-652339348_?<-652339349_?<-652339350_?<-652334111_?<-652339351_?<-652339352_?||652339353_?-> 749816534 <-XerD<-ParB-HTH* ParB-HTH - BEGALDRAFT_RS07320 118 bacteria>proteobacteria>gammaproteobacteria Beggiatoa alba hypothetical protein, partial [Beggiatoa alba]. 488762020_?->488762021_?->488762022_?-><-488762024_?<-749816070_?<-488762025_?<-749816530_XerD<-749816534_ParB-HTH*<-488762035_?<-488762037_?<-488762039_?<-488762042_?||488762044_?->488762046_?->488762048_?-> 738450871 HISKIN->?-><-?||ParB-HTH*-> ParB-HTH - C789_RS15355 116 bacteria>cyanobacteria Microcystis aeruginosa hypothetical protein, partial [Microcystis aeruginosa]. 488833846_?->488833849_?->488833851_HISKIN->488833853_?-><-738450868_?||738450871_ParB-HTH*->738450872_?->488833865_?->488833867_?->488833868_?->488833870_?->488833871_?->488833876_?-> 749816531 <-XerD<-ParB-HTH* ParB-HTH - BEGALDRAFT_RS07180 114 bacteria>proteobacteria>gammaproteobacteria Beggiatoa alba hypothetical protein, partial [Beggiatoa alba]. <-488761975_?||488761978_?-><-749816065_?<-749816066_?||488761981_?->488761983_?-><-749816530_XerD<-749816531_ParB-HTH*<-488761989_?<-488761991_?<-488761994_?<-749816067_?<-488761996_?||749816532_?->488762000_?-> 749816550 ParB-HTH*-> ParB-HTH - BEGALDRAFT_RS07765 106 bacteria>proteobacteria>gammaproteobacteria Beggiatoa alba hypothetical protein, partial [Beggiatoa alba]. 488762199_?->488762200_?->488762201_?-><-749816548_?<-488762203_?||749816549_?-><-488762206_?||749816550_ParB-HTH*-><-488762211_?||488762213_?->749816551_?->488762217_?->749816552_?->488762222_?->488762224_?-> 750077672 <-ParB-HTH*<-?<-?<-?<-?<-?<-RecT ParB-HTH - F784_RS22230 102 bacteria>deinococci Deinococcus apachensis hypothetical protein, partial [Deinococcus apachensis]. 518415046_?->518415047_?-><-518415048_?<-518415049_?||518415050_?-><-518415051_?<-648640482_?<-750077672_ParB-HTH*<-518415055_?<-518415056_?<-518415057_?<-518415058_?<-518415059_?<-750077674_RecT<-518415061_? 764930116 <-ParB-HTH* ParB-HTH - NOS7107_RS16750 100 bacteria>cyanobacteria Nostoc sp. PCC 7107 hypothetical protein, partial [Nostoc sp. PCC 7107]. <-504927031_?<-504927032_?||504927033_?->504927034_?->504927035_?->504927036_?->504927037_?-><-764930116_ParB-HTH*||764930119_?->504927039_?-><-504923882_?||764930121_?->504927041_?-><-504927043_?||504927044_?-> 738452741 <-ParB-HTH*<-?||DDE_Tnp_1_2-> ParB-HTH - I546_RS16505 98 bacteria>actinobacteria Mycobacterium kansasii hypothetical protein, partial [Mycobacterium kansasii]. 738447877_?-><-738452475_?<-738452477_?<-738452479_?<-738452481_?<-738452483_?<-738452485_?<-738452741_ParB-HTH*<-738452743_?||738448501_DDE_Tnp_1_2->738447877_?->738452486_?-><-738452745_?<-738452488_?<-738452490_? 754047437 <-Relaxase<-?||?->?->?-><-ParB-HTH*<-ParA ParB-HTH - EMTOL_RS20635 97 bacteria>bacteroidetes Emticicia oligotrophica hypothetical protein, partial [Emticicia oligotrophica]. 504839219_?-><-504839220_?<-754047436_Relaxase<-754047400_?||504839223_?->504839224_?->504839225_?-><-754047437_ParB-HTH*<-504839227_ParA||754047402_?->504839229_?->504839230_?->504839231_?->504839232_?->754047438_?-> 736383566 <-ParB-HTH*<-?<-?<-?<-HNH ParB-HTH - H565_RS13795 96 bacteria>deinococci Deinococcus murrayi hypothetical protein, partial [Deinococcus murrayi]. <-653253874_?<-736383564_?<-653253875_?<-736383449_?<-653253877_?<-736383452_?<-653253879_?<-736383566_ParB-HTH*<-653253881_?<-736383569_?<-653253883_?<-736383572_HNH<-736383575_?<-736383455_?<-653253885_? 750079283 <-ParB-HTH*<-?<-RNAse_T ParB-HTH - Q424_RS15920 94 bacteria>deinococci Deinococcus MULTISPECIES: hypothetical protein, partial [Deinococcus]. <-658539106_?<-648446909_?<-646631546_?<-516480771_?<-516480772_?<-516480773_?<-516480774_?<-750079283_ParB-HTH*<-516480776_?<-658539109_RNAse_T<-516480778_?<-516480779_?<-516480780_?<-658539110_?<-516480782_? 748248067 ParA->ParB-HTH*->NLPC-><-?<-?||?->ABC-> ParB-HTH - ON21_RS29990 92 bacteria>actinobacteria Nocardia araoensis hypothetical protein, partial [Nocardia araoensis]. <-748248018_?||748248020_?-><-748248021_?||748248023_?-><-748248025_?<-748248065_?||748248027_ParA->748248067_ParB-HTH*->748248069_NLPC-><-748248029_?<-748248070_?||748248030_?->748248032_ABC->748248072_?-><-748248034_? 750589548 ParA->ParB-HTH*->NLPC-><-?||?->?->ABC-> ParB-HTH - ON29_RS29715 92 bacteria>actinobacteria Nocardia exalbida hypothetical protein, partial [Nocardia exalbida]. <-750589525_?||750589526_?->750589528_?->750589529_?->750589530_?->750589531_?->750589547_ParA->750589548_ParB-HTH*->750589550_NLPC-><-750589551_?||750589533_?->750589534_?->750589535_ABC->750589552_?->750589553_?-> 750595695 ParA->ParB-HTH*-> ParB-HTH - A3IC_RS57885 92 bacteria>actinobacteria Streptomyces scabrisporus hypothetical protein, partial [Streptomyces scabrisporus]. 750595635_?-><-750595694_?||648527278_?-><-648527279_?<-750595636_?<-648527280_?||522043141_ParA->750595695_ParB-HTH*->648527281_?-><-522043143_?<-522043144_?||522043145_?->522043146_?->750595696_?-><-522043148_? 783222125 ParB-HTH*->?->P-loop-> ParB-HTH - VR65_RS19825 87 bacteria>proteobacteria>deltaproteobacteria Desulfobulbaceae bacterium BRH_c16a hypothetical protein, partial [Desulfobulbaceae bacterium BRH_c16a]. <-783221951_?<-783221953_?||783221955_?->783221963_?->783221965_?->783222121_?->783221967_?->783222125_ParB-HTH*->783221970_?->783221973_P-loop->783221975_?->783221977_?->783221979_?->783221981_?->783221983_?-> 750503395 ParA->ParB-HTH*->NLPC-><-?<-?||?->ABC-> ParB-HTH - ON34_RS28430 84 bacteria>actinobacteria Nocardia pneumoniae hypothetical protein, partial [Nocardia pneumoniae]. <-750503299_?||750503301_?->750503303_?->750503393_?->750503304_?->750503307_?->750503310_ParA->750503395_ParB-HTH*->750503400_NLPC-><-750503403_?<-750503313_?||750503314_?->750503407_ABC->750503409_?->750503411_?-> 749816938 ParB-HTH*->XerD->?-><-?||?-><-ParA ParB-HTH - BEGALDRAFT_RS17130 82 bacteria>proteobacteria>gammaproteobacteria Beggiatoa alba hypothetical protein, partial [Beggiatoa alba]. <-488779792_?<-488779796_?<-488779800_?||488779802_?->488779804_?->749816937_?->488779809_?->749816938_ParB-HTH*->749816939_XerD->488779813_?-><-488779816_?||488779820_?-><-488779822_ParA<-488779825_?<-488779827_? 738659689 <-NLPC<-?*<-ParA - SP D892_RS40665 80 bacteria>actinobacteria Nocardia sp. BMG51109 hypothetical protein, partial [Nocardia sp. BMG51109]. <-640147789_?||640147792_?-><-640147795_?||640147797_?->738659684_?->640147803_?-><-738659687_NLPC<-738659689_?*<-640147810_ParA||640147813_?-><-738659692_?<-738658134_?||738659694_?->640147822_?->738659696_?-> 750304467 ParA->ParB-HTH*-> ParB-HTH - B156_RS30075 79 bacteria>bacteroidetes Spirosoma luteum hypothetical protein, partial [Spirosoma luteum]. 517452061_?->517452062_?->517452063_?-><-648569338_?||750304465_?->517452066_?->517452067_ParA->750304467_ParB-HTH*->517452069_?->517452070_?->517452071_?->517452072_?->517452073_?->517452074_?->517452075_?-> 737785149 <-ParB-HTH*<-ParA<-?||?->?->?->Mrr_cat-REase-> ParB-HTH - B056_RS38110 78 bacteria>actinobacteria Frankia sp. BCU110501 hypothetical protein, partial [Frankia sp. BCU110501]. <-517329402_?<-517329403_?<-517329404_?<-517329405_?<-737785146_?||517329407_?->517329408_?-><-737785149_ParB-HTH*<-648548434_ParA<-737785138_?||517329414_?->737785152_?->737785154_?->517329417_Mrr_cat-REase->517329418_?-> 750404616 ParA->ParB-HTH*->NLPC-> ParB-HTH - ON44_RS04455 74 bacteria>actinobacteria Nocardia vinacea hypothetical protein, partial [Nocardia vinacea]. <-750404615_?||750404607_?->750404608_ParA->750404616_ParB-HTH*->750404617_NLPC->750404609_?->750404618_?->750404610_?-><-750404611_?<-750404612_?<-750404619_? 750463730 ParA->?*->NLPC-> - SP ON41_RS05075 74 bacteria>actinobacteria Nocardia transvalensis hypothetical protein, partial [Nocardia transvalensis]. 750463727_?->750463728_?->750463729_?-><-750463710_?||750463712_?-><-750463714_?||750463716_ParA->750463730_?*->750463731_NLPC-><-750463717_?<-750463718_?<-750463732_?<-750463733_? 763155621 ParB-HTH*-> ParB-HTH - MICAH_RS23570 71 bacteria>cyanobacteria Microcystis aeruginosa hypothetical protein, partial [Microcystis aeruginosa]. 488887368_?->763155621_ParB-HTH*-> # 13; 389403119 zf-CHC2->ParB-HTH*-> ParB-HTH - DespoDRAFT_03587 228 bacteria>proteobacteria>deltaproteobacteria Desulfobacter postgatei 2ac9 hypothetical protein DespoDRAFT_03587 [Desulfobacter postgatei 2ac9]. 389403112_?->389403113_?->389403114_?->389403115_?->389403116_?-><-389403117_?||389403118_zf-CHC2->389403119_ParB-HTH*->389403120_?-><-389403121_?||389403122_?-><-389403123_?||389403124_?->389403125_?->389403126_?-> 386428514 <-ABC<-?||?-><-?||?->?-><-XerD<-ParB-HTH* ParB-HTH - BegalDRAFT_1454 199 bacteria>proteobacteria>gammaproteobacteria Beggiatoa alba B18LD hypothetical protein BegalDRAFT_1454 [Beggiatoa alba B18LD]. <-386428507_ABC<-386428508_?||386428509_?-><-386428510_?||386428511_?->386428512_?-><-386428513_XerD<-386428514_ParB-HTH*<-386428515_?<-386428516_?<-386428517_?<-386428518_?<-386428519_?<-386428520_?||386428521_?-> 386428542 <-XerD<-ParB-HTH* ParB-HTH - BegalDRAFT_1483 199 bacteria>proteobacteria>gammaproteobacteria Beggiatoa alba B18LD hypothetical protein BegalDRAFT_1483 [Beggiatoa alba B18LD]. 386428535_?->386428536_?->386428537_?->386428538_?-><-386428539_?<-386428540_?<-386428541_XerD<-386428542_ParB-HTH*<-386428543_?<-386428544_?<-386428545_?<-386428546_?<-386428547_?||386428548_?->386428549_?-> 501880589 <-ParB-HTH*<-zf-CHC2 ParB-HTH - HRM2_RS03160 193 bacteria>proteobacteria>deltaproteobacteria Desulfobacterium autotrophicum DNA methylase [Desulfobacterium autotrophicum]. 752604261_?-><-506383132_?<-506383133_?<-752603663_?<-752604262_?<-752603629_?<-501880588_?<-501880589_ParB-HTH*<-501880590_zf-CHC2<-501880591_?<-506383135_?<-752604263_?<-752603664_?<-506383137_?||506383138_?-> 501881616 zf-CHC2->ParB-HTH*-> ParB-HTH - HRM2_RS03920 193 bacteria>proteobacteria>deltaproteobacteria Desulfobacterium autotrophicum DNA methylase [Desulfobacterium autotrophicum]. <-506386491_?<-501878029_?||752604848_?->506386493_?->752604849_?->501880591_?->506386494_zf-CHC2->501881616_ParB-HTH*->501880784_?->752603629_?->506386495_?-><-506386498_?<-752604046_?<-506386500_?<-752604850_? 506384528 zf-CHC2->ParB-HTH*-> ParB-HTH - HRM2_RS11855 193 bacteria>proteobacteria>deltaproteobacteria Desulfobacterium autotrophicum DNA methylase [Desulfobacterium autotrophicum]. 506384523_?->506384524_?->752604527_?->506384526_?->506384527_?->501880591_?->501880590_zf-CHC2->506384528_ParB-HTH*->506384529_?->752603629_?->752604528_?-><-506384531_?<-506384532_?<-506384533_?<-752604529_? 506384753 zf-CHC2->ParB-HTH*->?->?-><-?<-?<-?<-ABC ParB-HTH - HRM2_RS12975 193 bacteria>proteobacteria>deltaproteobacteria Desulfobacterium autotrophicum DNA methylase [Desulfobacterium autotrophicum]. <-752604070_?<-506386680_?<-506386681_?<-506386682_?<-752604884_?||506386684_?->506384754_zf-CHC2->506384753_ParB-HTH*->506386685_?->752603629_?-><-752604885_?<-506386687_?<-506386688_?<-752604886_ABC<-752604887_? 748757961 <-DDE||ParB-HTH*-> ParB-HTH - DESPODRAFT_RS17490 192 bacteria>proteobacteria>deltaproteobacteria Desulfobacter postgatei DNA methylase [Desulfobacter postgatei]. 490176783_?->490176786_?->490176788_?->490176791_?->490176793_?->490176795_?-><-748757960_DDE||748757961_ParB-HTH*->748757962_?->748757963_?->490176812_?-><-748757964_?<-748757650_?<-748757828_?||748758336_?-> 527022036 <-ParB-HTH*<-zf-CHC2 ParB-HTH - DSMV_RS01030 188 bacteria>proteobacteria>deltaproteobacteria Desulfococcus multivorans hypothetical protein [Desulfococcus multivorans]. <-527022035_?<-527022036_ParB-HTH*<-527022037_zf-CHC2 654515559 zf-CHC2->ParB-HTH*-> ParB-HTH - H167_RS0106170 188 bacteria>proteobacteria>deltaproteobacteria delta proteobacterium PSCGC 5296 hypothetical protein [delta proteobacterium PSCGC 5296]. 654515557_?->654515558_zf-CHC2->654515559_ParB-HTH*->654515560_?->654515561_?->654515769_?-><-654515770_? 654517946 DDE->zf-CHC2->ParB-HTH*-> ParB-HTH - H169_RS0112525 188 bacteria>proteobacteria>deltaproteobacteria delta proteobacterium PSCGC 5451 hypothetical protein [delta proteobacterium PSCGC 5451]. 740511296_?-><-654517944_?||740511298_DDE->654515558_zf-CHC2->654517946_ParB-HTH*->654515560_?->654517947_?->654517948_?-> 571788483 ParB-HTH*-> ParB-HTH - OMM_03956 187 bacteria>proteobacteria>deltaproteobacteria Candidatus Magnetoglobus multicellularis str. Araruama DNA modification methylase family protein [Candidatus Magnetoglobus multicellularis str. Araruama]. 571788483_ParB-HTH*->571788484_?->571788485_?-><-571788486_? 506386923 zf-CHC2->ParB-HTH*-> ParB-HTH - HRM2_RS24175 183 bacteria>proteobacteria>deltaproteobacteria Desulfobacterium autotrophicum DNA methylase [Desulfobacterium autotrophicum]. <-506386915_?<-752604933_?<-506386917_?||506386919_?-><-506386920_?||506386921_?->506386922_zf-CHC2->506386923_ParB-HTH*->506386924_?->506386925_?-><-752604934_?<-506386928_?||752604935_?->506386930_?->752604936_?-> # 12; 428682367 ParA->ParA->?-><-?<-HTH||ParB-HTH*-> ParB-HTH - Anacy_5838 389 bacteria>cyanobacteria Anabaena cylindrica PCC 7122 hypothetical protein Anacy_5838 (plasmid) [Anabaena cylindrica PCC 7122]. <-428682360_?<-428682361_?||428682362_ParA->428682363_ParA->428682364_?-><-428682365_?<-428682366_HTH||428682367_ParB-HTH*->428682368_?->428682369_?->428682370_?-><-428682371_?<-428682372_?<-428682373_?||428682374_?-> 505141386 <-RVT+HNH<-?||?->RVT+HNH-><-HTH||ParB-HTH*->?-><-?||?->DDE-> ParB-HTH - CYLST_RS31085 386 bacteria>cyanobacteria Cylindrospermum stagnale hypothetical protein [Cylindrospermum stagnale]. 505141381_?->752562984_?-><-505141382_RVT+HNH<-505141383_?||505141384_?->505141382_RVT+HNH-><-505141385_HTH||505141386_ParB-HTH*->505141387_?-><-752562986_?||505141389_?->505141390_DDE->505141391_?->505141392_?->505141393_?-> 755115685 ParA->ParA->?-><-?<-HTH||ParB-HTH*-> ParB-HTH - ANACY_RS28675 378 bacteria>cyanobacteria Anabaena cylindrica hypothetical protein [Anabaena cylindrica]. <-755115707_?||755115708_?->505177176_ParA->505177177_ParA->505177178_?-><-505177179_?<-505177180_HTH||755115685_ParB-HTH*->505177182_?->505177183_?->505177184_?-><-505177185_?<-505177186_?<-505177187_?||505177188_?-> 515520582 <-ParB-HTH*||HTH->?-><-?<-ParA<-ParA||SNF-helicase-> ParB-HTH - ANA7108_RS0126375 356 bacteria>cyanobacteria Anabaena sp. PCC 7108 hypothetical protein [Anabaena sp. PCC 7108]. 515520575_?-><-515520576_?<-515520577_?<-648412724_?<-515520579_?||755141323_?-><-515520581_?<-515520582_ParB-HTH*||515520583_HTH->515520584_?-><-515520585_?<-755140633_ParA<-755141324_ParA||515520588_SNF-helicase->515520589_?-> 504992580 RVT+HNH->HNH->?->RVT+HNH->?-><-ParB-HTH*||HTH->?->?->?->DDE_Tnp_ISAZ013->?-><-HISKIN ParB-HTH - OSC7112_RS32535 333 bacteria>cyanobacteria Oscillatoria nigro-viridis hypothetical protein [Oscillatoria nigro-viridis]. <-504992574_?||753868290_?->504987353_RVT+HNH->504992576_HNH->753868291_?->504992578_RVT+HNH->504992579_?-><-504992580_ParB-HTH*||753868292_HTH->753868293_?->504992583_?->753868294_?->504987920_DDE_Tnp_ISAZ013->753868295_?-><-504992584_HISKIN 740464136 TPR+CASPASE->HISKIN->HISKIN->?->?-><-HTH||ParB-HTH*-> ParB-HTH - TOL9009_RS37215 330 bacteria>cyanobacteria [Scytonema hofmanni] UTEX B 1581 hypothetical protein [[Scytonema hofmanni] UTEX B 1581]. 740464164_?->657929202_TPR+CASPASE->740464166_HISKIN->657929205_HISKIN->657929206_?->740464168_?-><-657929210_HTH||740464136_ParB-HTH*->657929212_?->657929213_?->657929215_?-><-740464169_?<-740464171_?<-657929217_?<-657929219_? 751574204 <-DDE||?-><-?<-?<-HTH||ParB-HTH*-> ParB-HTH - SD81_RS36485 330 bacteria>cyanobacteria Tolypothrix campylonemoides hypothetical protein [Tolypothrix campylonemoides]. <-751574214_?<-751574216_?<-751574198_DDE||751574218_?-><-751574200_?<-751574202_?<-751574220_HTH||751574204_ParB-HTH*->751574206_?->751574207_?-><-751568714_? 407266820 <-McrB||ParA->ParB->?->?-><-?<-HTH||ParB-HTH*->?->?-><-?<-?||?-><-AAA-ATPase ParB-HTH - FDUTEX481_04373 329 bacteria>cyanobacteria Tolypothrix sp. PCC 7601 hypothetical protein FDUTEX481_04373 [Tolypothrix sp. PCC 7601]. <-407266813_McrB||407266814_ParA->407266815_ParB->407266816_?->407266817_?-><-407266818_?<-407266819_HTH||407266820_ParB-HTH*->407266821_?->407266822_?-><-407266823_?<-407266824_?||407266825_?-><-407266826_AAA-ATPase||407266827_?-> 196179143 ParA->ParB-><-?||DDE_3->METHYLASE-><-HTH||ParB-HTH*-><-?||?->?->?->?->Peptidase_M10->Peptidase_M10-> ParB-HTH - MC7420_4124 326 bacteria>cyanobacteria Coleofasciculus chthonoplastes PCC 7420 hypothetical protein MC7420_4124 [Coleofasciculus chthonoplastes PCC 7420]. 196179077_?->196179180_ParA->196179206_ParB-><-196179080_?||196179138_DDE_3->196179087_METHYLASE-><-196179110_HTH||196179143_ParB-HTH*-><-196179150_?||196179219_?->196179199_?->196179072_?->196179235_?->196179093_Peptidase_M10->196179247_Peptidase_M10-> 797212730 <-ABC<-?<-?||ParA->ParB-><-?<-HTH||ParB-HTH*->?->?->?-><-AAA-ATPase ParB-HTH - FDUTEX481_RS32380 302 bacteria>cyanobacteria Tolypothrix sp. PCC 7601 hypothetical protein, partial [Tolypothrix sp. PCC 7601]. <-797212658_ABC<-797212659_?<-797212660_?||797212661_ParA->797212662_ParB-><-797212663_?<-797212729_HTH||797212730_ParB-HTH*->797212664_?->797212665_?->797212666_?-><-797212667_AAA-ATPase||797212668_?->797212731_?-><-797212669_? 737188608 <-HTH||ParB-HTH*-> ParB-HTH - CAL7103_RS0139200 297 bacteria>cyanobacteria Calothrix sp. PCC 7103 hypothetical protein, partial [Calothrix sp. PCC 7103]. <-518325591_?<-518325592_?||737188605_?-><-518325594_?||648401859_?-><-518325597_?<-737188607_HTH||737188608_ParB-HTH*-><-518325600_?||518325601_?->518325602_?-><-737188609_?<-518325604_?||518325605_?-><-737188610_? 737187623 HISKIN-><-?<-ABC<-?<-ParB-HTH*||HTH-><-?<-?<-?<-?<-?||STYKIN-> ParB-HTH - CAL7103_RS51515 278 bacteria>cyanobacteria Calothrix sp. PCC 7103 hypothetical protein, partial [Calothrix sp. PCC 7103]. 737187620_?-><-518318456_?||518318457_?->737187621_HISKIN-><-518318459_?<-737187276_ABC<-518318461_?<-737187623_ParB-HTH*||648401019_HTH-><-648401020_?<-737187625_?<-518318467_?<-737187626_?<-518318469_?||737187628_STYKIN-> # 9; 499190814 <-ParB-HTH*||?-><-?<-?<-?<-?<-?<-HNH ParB-HTH - DR_1719 287 bacteria>deinococci Deinococcus radiodurans hypothetical protein [Deinococcus radiodurans]. <-15806715_?<-15806716_?||15806717_?->15806718_?-><-15806719_?<-15806720_?<-15806721_?<-499190814_ParB-HTH*||15806723_?-><-15806724_?<-15806725_?<-15806726_?<-15806727_?<-15806728_?<-15806729_HNH 516480931 <-ParB-HTH* ParB-HTH - Q424_RS0102995 280 bacteria>deinococci Deinococcus MULTISPECIES: hypothetical protein [Deinococcus]. <-516480924_?<-516480925_?<-646632761_?<-646632768_?||750079322_?-><-516480929_?<-516480930_?<-516480931_ParB-HTH*||516480932_?-><-760145289_?<-516480934_?<-516480935_?<-646632779_?||516480937_?->516480938_?-> 736351733 <-ParB-HTH* ParB-HTH - BS32_RS0115445 280 bacteria>deinococci Deinococcus radiodurans hypothetical protein [Deinococcus radiodurans]. <-654876149_?||499190809_?->499190810_?-><-499190811_?<-499190812_?<-736324533_?<-736351733_ParB-HTH*||499190815_?-> 736394879 DDE-><-?<-?||ParB-HTH*-> ParB-HTH - Q322_RS0113730 275 bacteria>deinococci Deinococcus frigens hypothetical protein [Deinococcus frigens]. 736394882_DDE-><-657676348_?<-657676349_?||736394879_ParB-HTH*->657676351_?-><-736394883_?||657676353_?->736394885_?->657676355_?->657676356_?->657676357_?-> 736389644 <-ParB-HTH* ParB-HTH - Q319_RS0103110 274 bacteria>deinococci Deinococcus marmoris hypothetical protein [Deinococcus marmoris]. <-657678463_?<-657678464_?<-657678466_?<-736389672_?<-657678468_?||736389675_?-><-657678470_?<-736389644_ParB-HTH*||657678472_?->657678473_?-> 746727627 ParB-HTH*-> ParB-HTH - QR90_RS12370 274 bacteria>deinococci Deinococcus swuensis hypothetical protein [Deinococcus swuensis]. 746727614_?->746727617_?->746727619_?->746727621_?->746730019_?->746727623_?-><-746727625_?||746727627_ParB-HTH*-><-746727629_?<-746727631_?<-746727633_?<-746730022_?<-746727635_?<-746730024_?||746730026_?-> 760094872 <-ParB-HTH* ParB-HTH - H564_RS20900 271 bacteria>deinococci Deinococcus ficus hypothetical protein, partial [Deinococcus ficus]. 760094866_?-><-760094869_?<-653261436_?<-551072753_?<-760094715_?<-653261449_?<-653261454_?<-760094872_ParB-HTH*||653261460_?-><-653261465_?<-760094875_?<-653261469_?<-760094878_?<-760094881_?||653261475_?-> 760136477 ParB-HTH*-> ParB-HTH - DEINO_RS19540 271 bacteria>deinococci Deinococcus sp. 2009 hypothetical protein, partial [Deinococcus sp. 2009]. 760136474_?->760136476_?->654853319_?->760094875_?->551072738_?-><-551072740_?<-551072742_?||760136477_ParB-HTH*->551072746_?->551072748_?->654853321_?->551072751_?->551072753_?->551072755_?->760094869_?-> 653294307 ParB-HTH*-> ParB-HTH - T313_RS0115790 208 bacteria>deinococci Deinococcus radiodurans hypothetical protein, partial [Deinococcus radiodurans]. <-499190815_?||653294307_ParB-HTH*-> # 7; 499635648 ParB-HTH*->?->?->?-><-?||?-><-?<-NACHT ParB-HTH AAA_23 AVA_RS26435 548 bacteria>cyanobacteria Anabaena variabilis hypothetical protein [Anabaena variabilis]. <-499635640_?<-499635641_?<-499635644_?<-752818090_?<-499635645_?<-752818120_?||499635647_?->499635648_ParB-HTH*->752818121_?->499635650_?->499635651_?-><-499635652_?||499635654_?-><-499635655_?<-499635656_NACHT 499308918 NACHT-> - AAA_23 PCC7120DELTA_RS29045 544 bacteria>cyanobacteria Nostoc sp. PCC 7120 hypothetical protein [Nostoc sp. PCC 7120]. 499308911_NACHT-><-499308912_?<-499308913_?||499308914_?-><-499308915_?<-499308916_?<-764953466_?<-499308918_?*<-764953350_?||499308920_?->499308921_?->499308922_?->499308923_?->499308924_?->499308925_?-> 493212749 <-DDE<-?<-?<-?<-ParB-HTH* ParB-HTH AAA_23 NSP_RS09855 538 bacteria>cyanobacteria Nodularia spumigena hypothetical protein [Nodularia spumigena]. <-493212742_?<-493212743_?<-493212744_?<-493212745_DDE<-493212746_?<-493212747_?<-493212748_?<-493212749_ParB-HTH*<-493212750_?<-493212751_?||493212753_?->493212755_?->493212756_?->493212757_?->493212758_?-> 501381627 Nostoc - DUF2869 NPUN_RS35395 532 bacteria>cyanobacteria Nostoc punctiforme hypothetical protein [Nostoc punctiforme]. PCC 499635690 <-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-?||?-><-?<-?<-?<-?||?*->?-><-?<-NACHT - AAA_23 AVA_RS26650 526 bacteria>cyanobacteria Anabaena variabilis hypothetical protein [Anabaena variabilis]. <-499635683_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-752818124_?||499635685_?-><-499635686_?<-752818125_?<-499635688_?<-499635689_?||499635690_?*->752818126_?-><-499635692_?<-752818127_NACHT<-499635694_?<-499635695_?||499635696_?->499635697_?-> 501381520 Nostoc - Mitofilin NPUN_RS34800 526 bacteria>cyanobacteria Nostoc punctiforme hypothetical protein [Nostoc punctiforme]. PCC 501381574 Peptidase_M10-> - Prominin NPUN_RS35095 524 bacteria>cyanobacteria Nostoc punctiforme hypothetical protein [Nostoc punctiforme]. 501381565_Peptidase_M10-><-753811023_?||501381568_?->753811024_?-><-501381571_?<-501381572_?<-753811025_?<-501381574_?*<-501381575_?||501381577_?->501381578_?->753810958_?->501381580_?->501381581_?->753810959_?-> # 7; 494515216 <-ParB-HTH*||ABC-> ParB-HTH DUF3102 CWATWH0005_3991 333 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. <-494520597_?<-494515216_ParB-HTH*||546223097_ABC-> 515877202 RVT+HNH-><-?<-?<-?<-?<-?<-?||ParB-HTH*-> ParB-HTH SP+DUF3102 PCC9339_RS0102660 331 bacteria>cyanobacteria Fischerella sp. PCC 9339 hypothetical protein [Fischerella sp. PCC 9339]. 648361600_RVT+HNH-><-515877195_?<-515877196_?<-515877197_?<-515877198_?<-515877200_?<-648361604_?||515877202_ParB-HTH*->515877203_?->515877204_?->515877205_?->515877206_?->515877207_?->515877208_?->737126249_?-> 504967303 ParB-HTH*->?->?->ABC-> ParB-HTH DUF3102 CHRO_RS11615 328 bacteria>cyanobacteria Chroococcidiopsis thermalis hypothetical protein [Chroococcidiopsis thermalis]. <-504967295_?||504967296_?->504967297_?-><-752824248_?||504967299_?-><-504967300_?<-504967301_?||504967303_ParB-HTH*->504967304_?->504967305_?->504967306_ABC->752824249_?->752824857_?->752824858_?->752824859_?-> 748141416 <-ABC<-?<-?<-ParB-HTH* ParB-HTH DUF3102 QH73_RS39240 327 bacteria>cyanobacteria Scytonema millei hypothetical protein [Scytonema millei]. <-748142456_?<-748142457_?<-748142458_?<-748142459_?<-748141414_ABC<-748141415_?<-748142460_?<-748141416_ParB-HTH*||748141417_?->748141418_?-><-748141419_?||748141420_?-><-748142461_?<-748141421_?||748141422_?-> 499306042 <-ParB-HTH*||?->ABC-> ParB-HTH SP+DUF3102 PCC7120DELTA_RS15095 326 bacteria>cyanobacteria Nostoc sp. PCC 7120 hypothetical protein [Nostoc sp. PCC 7120]. <-499306035_?<-499306036_?<-764952782_?<-764951162_?<-499306039_?||499306040_?->499306041_?-><-499306042_ParB-HTH*||499306043_?->764952785_ABC-><-499306045_?||764951163_?->499306048_?->499306049_?->499306050_?-> 497233611 <-ABC<-?<-?||?->?-><-ABC<-ABC||ParB-HTH*-> ParB-HTH DUF3102 CY51472DRAFT_RS0220405 325 bacteria>cyanobacteria Cyanothece MULTISPECIES: hypothetical protein [Cyanothece]. <-737891168_ABC<-497233604_?<-497233605_?||497233607_?->497233608_?-><-497233609_ABC<-497233610_ABC||497233611_ParB-HTH*-><-497233614_?<-497233615_?<-497233616_?<-497233617_?||497233618_?->497233619_?->501330428_?-> 505002935 ParB-HTH*-> ParB-HTH DUF3102 GLO7428_RS18185 321 bacteria>cyanobacteria Gloeocapsa sp. PCC 7428 hypothetical protein [Gloeocapsa sp. PCC 7428]. <-505002927_?<-754508026_?<-505002929_?<-754508640_?<-505002931_?<-754508641_?<-754508642_?||505002935_ParB-HTH*->505002936_?->505002937_?->505002938_?->505002939_?->505002940_?->505002941_?->505002942_?-> # 7; 559034988 ParA->ParB-HTH*-><-?<-ParA ParB-HTH - M878_RS91610 330 bacteria>actinobacteria Streptomyces roseochromogenus hypothetical protein [Streptomyces roseochromogenus]. 559034980_?->665860281_?->739880871_?->665860283_?->559034984_?-><-559034985_?||665860286_ParA->559034988_ParB-HTH*-><-559034989_?<-739880872_ParA<-559034991_?<-559034992_?<-559034993_?<-559034994_?<-739880873_? 514922043 SFII-helicase->?-><-?||ParA->?-><-ParB-HTH*<-ParA ParB-HTH - STRAU_RS27165 323 bacteria>actinobacteria Streptomyces aurantiacus hypothetical protein [Streptomyces aurantiacus]. 514922036_?->739810306_?->739810308_SFII-helicase->739810311_?-><-514922040_?||739810313_ParA->514922042_?-><-514922043_ParB-HTH*<-514922044_ParA 703383829 ParB-HTH*-><-ParB<-ParA ParB-HTH - H293_RS0145400 317 bacteria>actinobacteria Streptomyces canus hypothetical protein [Streptomyces canus]. <-703383826_?||522154481_?->518968523_?->703383829_ParB-HTH*-><-703383827_ParB<-518968526_ParA||703383828_?-><-703383830_?<-518968529_?||655409978_?->518968531_?-> 751920075 ParA->ParB-HTH*-><-?<-ParA<-?<-Mrr_cat-REase||?->DDE-> ParB-HTH - SVTN_RS39960 311 bacteria>actinobacteria Streptomyces vietnamensis hypothetical protein, partial [Streptomyces vietnamensis]. <-751919914_?<-751919915_?<-751920072_?<-751920073_?<-751920074_?<-751919916_?||751919917_ParA->751920075_ParB-HTH*-><-751919918_?<-751920076_ParA<-751919919_?<-751920077_Mrr_cat-REase||751920078_?->751920079_DDE-><-751919920_? 654253752 ParB-HTH*-> ParB-HTH - H298_RS0133730 307 bacteria>actinobacteria Streptomyces sp. CNQ865 hypothetical protein [Streptomyces sp. CNQ865]. 654253746_?->739888683_?->654253747_?->654253748_?->654253749_?->654253750_?-><-654253751_?||654253752_ParB-HTH*-><-654253753_?<-654253754_?||739888709_?-><-654253755_?<-739888710_?<-739888686_?<-654253758_? 759461293 <-TrwC||HNH-><-?<-?<-?<-ParB-HTH*<-ParA<-?<-?<-?<-?<-?<-ASCH ParB-HTH - IH54_RS0129315 306 bacteria>actinobacteria Streptomyces sp. NRRL F-5123 hypothetical protein [Streptomyces sp. NRRL F-5123]. <-671538860_?||759461292_?-><-759461305_TrwC||759461307_HNH-><-671538868_?<-671538870_?<-671538872_?<-759461293_ParB-HTH*<-671538876_ParA<-671538879_?<-671538881_?<-671538883_?<-759461309_?<-671538889_?<-671538891_ASCH 663184423 Mrr_cat-REase-><-?||ParB-HTH*-><-?<-?||TrwC-><-?||TrwC-> ParB-HTH - OO47_RS31770 285 bacteria>actinobacteria Streptomyces bikiniensis hypothetical protein, partial [Streptomyces bikiniensis]. 702585104_?->702585107_?-><-663184414_?||663184417_Mrr_cat-REase-><-663184420_?||663184423_ParB-HTH*-><-663184426_?<-663184430_?||739772506_TrwC-><-663184437_?||702585117_TrwC->663184446_?-><-663184449_? # 5; 755052115 <-ParB-HTH*<-ParA ParB-HTH - TR46_RS22930 297 bacteria>actinobacteria Streptacidiphilus carbonis hypothetical protein [Streptacidiphilus carbonis]. 755052226_?->755052227_?->755052106_?->755052108_?->755052109_?->755052110_?->755052112_?-><-755052115_ParB-HTH*<-755052229_ParA<-755052118_?||755052120_?->755052122_?->755052125_?->755052126_?->755052128_?-> 702687171 ParA->ParB-HTH*-> ParB-HTH SP OO55_RS21730 278 bacteria>actinobacteria Streptomyces griseus hypothetical protein [Streptomyces griseus]. 702687145_?->702687150_?->702687153_?->702687158_?->702687500_?->702687504_?->702687166_ParA->702687171_ParB-HTH*->702687175_?->702687178_?->702687182_?->702687185_?->702687188_?->702687192_?->702687196_?-> 759768668 DDE_Tnp_1_2->?-><-?<-RVT+HNH<-?||ParA->ParB-HTH*-><-?<-ASCH ParB-HTH - BI06_RS43855 271 bacteria>actinobacteria Kitasatospora sp. MBT66 hypothetical protein [Kitasatospora sp. MBT66]. 759752930_?->759752926_DDE_Tnp_1_2->759752923_?-><-759768662_?<-759768875_RVT+HNH<-759768665_?||759768878_ParA->759768668_ParB-HTH*-><-759768881_?<-759768883_ASCH<-759768670_?<-759768673_?||759768886_?-><-759768675_?<-759768678_? 654253933 ParA->ParB-HTH*-> ParB-HTH - H298_RS0135225 259 bacteria>actinobacteria Streptomyces sp. CNQ865 hypothetical protein [Streptomyces sp. CNQ865]. 739888835_?-><-739888841_?<-654253928_?<-654253929_?<-654253930_?<-654253931_?||654253932_ParA->654253933_ParB-HTH*-><-654253934_?<-654253935_?<-654253936_?<-654253937_?<-654253938_?<-654253939_?<-739888843_? 654240343 ParA->ParB-HTH*-> ParB-HTH - B121_RS0125930 258 bacteria>actinobacteria Streptomyces sp. CNH099 hypothetical protein [Streptomyces sp. CNH099]. 654240336_?->654240337_?->654240338_?->739933181_?-><-739933148_?||654240341_?->654240342_ParA->654240343_ParB-HTH*-><-654240410_?<-739933184_?<-739933187_?<-654240344_?<-654240345_?<-654240346_?<-739933189_? # 4; 755027075 ParB-HTH*-> ParB-HTH - TR49_RS10465 378 bacteria>actinobacteria Streptacidiphilus melanogenes hypothetical protein [Streptacidiphilus melanogenes]. <-755027065_?||755027067_?-><-755027070_?<-755027072_?||755027342_?->755027343_?->755027073_?->755027075_ParB-HTH*->755027078_?-><-755027080_?||755027083_?->755027085_?->755027095_?->755027098_?->755027347_?-> 755021339 <-DDE||?-><-?||?-><-ParB-HTH* ParB-HTH - TR48_RS34600 363 bacteria>actinobacteria Streptacidiphilus neutrinimicus hypothetical protein [Streptacidiphilus neutrinimicus]. 755021332_?-><-755021333_DDE||755021334_?-><-739750968_?||755021337_?-><-755021339_ParB-HTH*<-755021340_?<-755021357_?<-755021358_?<-755021342_?<-755021345_?||755021347_?-><-755021349_? 755026932 ParB-HTH*->?-><-?<-?<-HNH ParB-HTH SP TR49_RS10120 363 bacteria>actinobacteria Streptacidiphilus melanogenes hypothetical protein [Streptacidiphilus melanogenes]. 755026918_?->755026921_?-><-755027301_?<-755026924_?||755027302_?->755026926_?->755026928_?->755026932_ParB-HTH*->755027305_?-><-755027308_?<-755026935_?<-755026937_HNH<-755026940_?||755027311_?-><-755026944_? 755027016 <-ABC||?->?->?->?-><-?||ParB-HTH*-> ParB-HTH - TR49_RS10340 363 bacteria>actinobacteria Streptacidiphilus melanogenes hypothetical protein [Streptacidiphilus melanogenes]. <-755027008_?<-755027331_ABC||755027010_?->755027012_?->755027334_?->755027337_?-><-755027014_?||755027016_ParB-HTH*->755027339_?->755027019_?->755027021_?->755027023_?-><-755027025_?<-755027027_?||755027029_?-> # 4; 505031549 ParB-HTH*->?->?-><-HISKIN<-HISKIN<-HISKIN ParB-HTH - CYAN10605_RS03945 312 bacteria>cyanobacteria Cyanobacterium aponinum hypothetical protein [Cyanobacterium aponinum]. 505031542_?-><-505031543_?<-754511752_?||505031545_?-><-754511999_?<-505031547_?<-505031548_?||505031549_ParB-HTH*->505031550_?->505031551_?-><-505031552_HISKIN<-505031553_HISKIN<-505031554_HISKIN<-505031555_?||754512000_?-> 770470161 <-ParB-HTH*<-?<-?<-DnaJ ParB-HTH - GM3708_3465 305 bacteria>cyanobacteria Geminocystis sp. NIES-3708 hypothetical protein GM3708_3465 [Geminocystis sp. NIES-3708]. 770470154_?->770470155_?-><-770470156_?||770470157_?->770470158_?->770470159_?-><-770470160_?<-770470161_ParB-HTH*<-770470162_?<-770470163_?<-770470164_DnaJ||770470165_?->770470166_?-><-770470167_?||770470168_?-> 515865463 STYKIN->?->?-><-ParB-HTH* ParB-HTH - SYN6308_RS19250 304 bacteria>cyanobacteria Geminocystis herdmanii hypothetical protein [Geminocystis herdmanii]. <-648410422_?<-515865456_?<-515865457_?<-515865458_?||750163354_STYKIN->515865460_?->515865462_?-><-515865463_ParB-HTH*||515865464_?->750163356_?->515865466_?-><-515865467_?||515865468_?->648410423_?->515865470_?-> 770473153 HISKIN->?->?->?->?->?-><-?||ParB-HTH*-> ParB-HTH - GM3709_2810 300 bacteria>cyanobacteria Geminocystis sp. NIES-3709 hypothetical protein GM3709_2810 [Geminocystis sp. NIES-3709]. 770473146_HISKIN->770473147_?->770473148_?->770473149_?->770473150_?->770473151_?-><-770473152_?||770473153_ParB-HTH*-><-770473154_?||770473155_?->770473156_?->770473157_?-><-770473158_?<-770473159_?||770473160_?-> # 4; 759896111 ParB-HTH*-> ParB-HTH - OBACDRAFT_RS02835 287 bacteria>verrucomicrobia Diplosphaera colitermitum hypothetical protein [Diplosphaera colitermitum]. 759896105_?->759896193_?-><-759896106_?<-759896107_?<-759896108_?||759896109_?->759896110_?->759896111_ParB-HTH*->759896112_?->759896113_?->759896114_?->759896115_?->759896116_?->759896117_?->759896118_?-> 759901356 <-ParB-HTH* ParB-HTH - OBACDRAFT_RS17200 283 bacteria>verrucomicrobia Diplosphaera colitermitum hypothetical protein [Diplosphaera colitermitum]. <-759901354_?<-759901356_ParB-HTH*<-759901359_?||759901361_?->759901362_?->759896106_?-><-759896185_?<-759901365_?<-759901367_? 497194662 ParB-HTH*-> ParB-HTH - OPIT5_RS20290 280 bacteria>verrucomicrobia Opitutaceae bacterium TAV5 hypothetical protein [Opitutaceae bacterium TAV5]. <-494601358_?<-494601357_?<-497194668_?<-497194667_?<-497194666_?<-497194665_?||497194664_?->497194662_ParB-HTH*->497194661_?->497194660_?->497194659_?->497194658_?->497194657_?->497194656_?->497194655_?-> 645069929 <-ParB-HTH*<-?<-?<-?||?->?->ABC-> ParB-HTH - OPIT5_RS27705 280 bacteria>verrucomicrobia Opitutaceae bacterium TAV5 hypothetical protein [Opitutaceae bacterium TAV5]. <-497194655_?<-497194656_?<-497194657_?<-497194658_?<-497194659_?<-497194660_?<-497194661_?<-645069929_ParB-HTH*<-497197845_?<-497197846_?<-497197847_?||497197848_?->497197849_?->494605492_ABC->497197850_?-> # 4; 504989405 <-STYKIN||?->?-><-?<-?||?->?->ParB-HTH*-> ParB-HTH Mitofilin OSC7112_RS14000 284 bacteria>cyanobacteria Oscillatoria nigro-viridis hypothetical protein [Oscillatoria nigro-viridis]. <-753867507_STYKIN||504992649_?->753866729_?-><-753867508_?<-504989402_?||504989403_?->504989404_?->504989405_ParB-HTH*->504989406_?->504989407_?->504989408_?->753867509_?->504989410_?->753867511_?->504989412_?-> 427981701 ParB-HTH*->?->?->?-><-?<-?<-?||STYKIN-> ParB-HTH DUF885 Ple7327_4170 282 bacteria>cyanobacteria Pleurocapsa sp. PCC 7327 hypothetical protein Ple7327_4170 [Pleurocapsa sp. PCC 7327]. <-427981694_?<-427981695_?<-427981696_?<-427981697_?||427981698_?->427981699_?->427981700_?->427981701_ParB-HTH*->427981702_?->427981703_?->427981704_?-><-427981705_?<-427981706_?<-427981707_?||427981708_STYKIN-> 516328499 DDE_Tnp_IS1->?->?->?->?->?->?->ParB-HTH*-> ParB-HTH - OSC10802_RS0123170 257 bacteria>cyanobacteria Oscillatoria sp. PCC 10802 hypothetical protein [Oscillatoria sp. PCC 10802]. 648406977_DDE_Tnp_IS1->763312939_?->763312942_?->516328495_?->516328496_?->516328497_?->516328498_?->516328499_ParB-HTH*->516328500_?->516328501_?->516328502_?->516328503_?->516328505_?->763312944_?->763312947_?-> 752746526 ParB-HTH*->?->?->?-><-?<-?<-?||STYKIN-> ParB-HTH DUF885 PLE7327_RS20050 255 bacteria>cyanobacteria Pleurocapsa minor hypothetical protein [Pleurocapsa minor]. 504958489_?-><-504958490_?<-752745292_?<-504958492_?<-504958493_?||504958495_?->504958496_?->752746526_ParB-HTH*->504958498_?->504958499_?->752746528_?-><-504958501_?<-504958502_?<-504958503_?||752746530_STYKIN-> # 3; 558889206 <-ParB-HTH*<-ParA? ParB-HTH - M877_RS79680 346 bacteria>actinobacteria Streptomyces niveus hypothetical protein [Streptomyces niveus]. 665865665_?->558889203_?-><-665865667_?||558889205_?-><-558889206_ParB-HTH*<-665865668_ParA?<-739935100_?<-739935101_?<-558889210_?<-558889211_?<-739935102_?||558889213_?-> 498331267 <-ParB-HTH*<-ParA? ParB-HTH - STREPS4_RS33205 345 bacteria>actinobacteria Streptomyces sp. S4 hypothetical protein [Streptomyces sp. S4]. 498331253_?->498331255_?->764840581_?->498331260_?-><-764840583_?||498331263_?-><-498331265_?<-498331267_ParB-HTH*<-764840585_ParA?<-764840587_?<-498331273_?<-498331274_?<-498331276_?<-648267387_?||498331278_?-> 663309700 ParA?->ParB-HTH*-> ParB-HTH - IF45_RS0127995 343 bacteria>actinobacteria Streptomyces albidoflavus hypothetical protein [Streptomyces albidoflavus]. <-663309687_?||663309689_?->663309691_?->663309693_?->663309695_?->663309696_?->663309698_ParA?->663309700_ParB-HTH*-><-663309702_?<-663309703_?<-663309705_?<-739786609_? # 3; 739808094 <-ParB-HTH*<-ParA<-?<-?<-?||?->?->URI-> ParB-HTH SP IG72_RS0133105 328 bacteria>actinobacteria Streptomyces MULTISPECIES: hypothetical protein [Streptomyces]. 739778362_?->662754654_?->662754655_?-><-739808094_ParB-HTH*<-739808096_ParA<-662754660_?<-662754663_?<-662754664_?||662754666_?->739808103_?->662754669_URI-> 505393262 ParB-HTH*-> ParB-HTH - F750_RS32695 326 bacteria>actinobacteria Streptomyces sp. PAMC26508 hypothetical protein [Streptomyces sp. PAMC26508]. <-505393252_?||505393253_?-><-505393255_?||753981584_?-><-505393258_?<-753981639_?||505393261_?->505393262_ParB-HTH*->505393263_?->753981587_?-><-505393265_?||753981589_?-><-753981641_?<-505393268_?<-505393269_? 499350288 DDE_Tnp_1_2->DDE_Tnp_1_2-><-ParB-HTH*<-ParA||?-><-?||?-><-?<-DDE_Tnp_1_2 ParB-HTH - SCP2.04c 325 bacteria>actinobacteria Streptomyces coelicolor hypothetical protein [Streptomyces coelicolor]. 21233965_DDE_Tnp_1_2->21233966_DDE_Tnp_1_2-><-499350288_ParB-HTH*<-21233968_ParA||21233969_?-><-21233970_?||21233971_?-><-21233972_?<-21233973_DDE_Tnp_1_2<-21233974_? # 3; 664543262 <-ParB-HTH* ParB-HTH - IH47_RS0134150 296 bacteria>actinobacteria Streptomyces sp. NRRL F-5702 hypothetical protein [Streptomyces sp. NRRL F-5702]. 664543246_?-><-664543249_?||695837320_?-><-664543253_?<-695837322_?||664543257_?->664543260_?-><-664543262_ParB-HTH*<-664543266_?<-664543269_?||664543272_?->664543275_?-> 748778099 ParB-HTH*-> ParB-HTH - QR77_RS41180 292 bacteria>actinobacteria Streptomyces sp. 150FB hypothetical protein [Streptomyces sp. 150FB]. 748778199_?->748778097_?->748778200_?-><-748778201_?||748778202_?->748778203_?->748778098_?->748778099_ParB-HTH*-><-748778100_?||748778101_?-><-748778102_?||748778204_?->748778103_?-><-748778104_?||748778105_?-> 739996264 NLPC->?->?->?->?->?-><-ParB-HTH* ParB-HTH SP IG93_RS28555 288 bacteria>actinobacteria Streptomyces sp. NRRL F-6628 hypothetical protein [Streptomyces sp. NRRL F-6628]. 739996257_?->739996259_NLPC->557418823_?->739996260_?->739996261_?->739996262_?->739996263_?-><-739996264_ParB-HTH*<-739996269_?<-739996265_?||739996270_?-> # 3; 501518403 <-ParB-HTH* ParB-HTH - ANAEK_RS11660 204 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter sp. K hypothetical protein [Anaeromyxobacter sp. K]. <-499739874_?||501518397_?-><-501518398_?<-501518399_?||501518400_?-><-501518401_?<-501518402_?<-501518403_ParB-HTH*<-501518404_?||501518405_?-><-501518407_?<-501518408_?<-501518409_?<-501518410_?<-501518411_? 501750516 <-ParB-HTH* ParB-HTH - A2CP1_RS12130 204 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter dehalogenans hypothetical protein [Anaeromyxobacter dehalogenans]. <-499739874_?||501750493_?-><-501750497_?<-501750502_?||501750508_?-><-501750511_?<-501518402_?<-501750516_ParB-HTH*<-501750519_?||501750522_?-><-501518407_?<-501750538_?<-501750543_?<-501750549_?<-501750556_? 775300647 ParB-HTH*-> ParB-HTH HrpB2 PSR1_03440 204 bacteria>proteobacteria>deltaproteobacteria Anaeromyxobacter sp. PSR-1 hypothetical protein PSR1_03440 [Anaeromyxobacter sp. PSR-1]. 775300640_?->775300641_?->775300642_?->775300643_?->775300644_?-><-775300645_?||775300646_?->775300647_ParB-HTH*->775300648_?-> # 3; 501117833 <-ParB-HTH* ParB-HTH - AM1_RS31280 140 bacteria>cyanobacteria Acaryochloris marina hypothetical protein [Acaryochloris marina]. <-501117027_?<-501117825_?<-501117826_?<-501117827_?<-501117828_?||753958425_?->501117831_?-><-501117833_ParB-HTH*||501117834_?->753958545_?->501117838_?->501117839_?->501117840_?-><-501117841_?||753958547_?-> 501118686 <-ParB-HTH* ParB-HTH - AM1_RS35145 140 bacteria>cyanobacteria Acaryochloris marina hypothetical protein [Acaryochloris marina]. <-501118694_?<-753959417_?<-501118692_?<-501118690_?<-501118689_?||753958425_?->501118687_?-><-501118686_ParB-HTH*||501118684_?->501118683_?->501118682_?-><-501118681_?||753959431_?->501118678_?-><-501118677_? 501119208 DDE-><-?||?->?-><-?||ParB-HTH*-> ParB-HTH - AM1_RS36990 140 bacteria>cyanobacteria Acaryochloris marina hypothetical protein [Acaryochloris marina]. <-753960122_?||753960222_?->753958881_DDE-><-753960126_?||501119205_?->501119207_?-><-501119206_?||501119208_ParB-HTH*-><-501119209_?<-753960137_?<-753960225_?<-753960141_?<-501119214_?||501119215_?->501119216_?-> # 2; 546230520 ASCH+ParB-HTH+Prok-TUDOR*-> ASCH+ParB-HTH+Prok-TUDOR ASCH CWATWH0005_934 498 bacteria>cyanobacteria Crocosphaera watsonii Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB [Crocosphaera watsonii]. 546230519_?->546230520_ASCH+ParB-HTH+Prok-TUDOR*-> 357263649 Primpol?->ASCH+ParB-HTH+Prok-TUDOR*-> ASCH+ParB-HTH+Prok-TUDOR ASCH CWATWH0003_2673b1 492 bacteria>cyanobacteria Crocosphaera watsonii WH 0003 hypothetical protein CWATWH0003_2673b1, partial [Crocosphaera watsonii WH 0003]. 357263647_?->357263648_Primpol?->357263649_ASCH+ParB-HTH+Prok-TUDOR*-> # 2; 664288086 <-ParB-HTH*<-ParA? ParB-HTH - IG73_RS0132720 409 bacteria>actinobacteria Streptomyces halstedii hypothetical protein [Streptomyces halstedii]. 664288217_?->664288079_?->739855292_?->664288082_?->664288084_?-><-664288086_ParB-HTH*<-664288088_ParA?<-739855299_?<-739855301_?<-664288097_?<-664288100_?<-739855303_?||664288106_?-> 664363832 ParA?->ParB-HTH*-> ParB-HTH - IF95_RS0132470 409 bacteria>actinobacteria Streptomyces varsoviensis hypothetical protein [Streptomyces varsoviensis]. <-664363810_?||664363813_?->664363816_?->664363819_?->740118370_?->740118372_?->664363829_ParA?->664363832_ParB-HTH*-><-664363835_?<-664363837_?<-664363839_?<-664363842_?<-664363845_? # 2; 497232009 ParB-HTH+Prok-TUDOR*->?->DDE_Tnp1->?->?->?-><-RDRP ParB-HTH+Prok-TUDOR - CY51472DRAFT_RS0223475 378 bacteria>cyanobacteria Cyanothece MULTISPECIES: hypothetical protein [Cyanothece]. <-737891482_?<-639854764_?<-497232014_?<-497232013_?<-497232012_?<-497232011_?<-497232010_?||497232009_ParB-HTH+Prok-TUDOR*->497232008_?->497232007_DDE_Tnp1->497232006_?->501330974_?->501330975_?-><-497232003_RDRP||501330977_?-> 495553174 ParB-HTH+Prok-TUDOR*->?->?->DDE_Tnp1-> ParB-HTH+Prok-TUDOR SP CY0110_RS21885 371 bacteria>cyanobacteria Cyanothece sp. CCY0110 hypothetical protein [Cyanothece sp. CCY0110]. <-495553167_?<-495553168_?||495553169_?-><-495553170_?<-495553171_?<-495553172_?<-495553173_?||495553174_ParB-HTH+Prok-TUDOR*->737833010_?->495553176_?->495553177_DDE_Tnp1->495553178_?-> # 2; 727520012 ParA->ParB-HTH*-> ParB-HTH - AA75_RS00975 374 bacteria>actinobacteria Kitasatospora sp. MBT63 hypothetical protein [Kitasatospora sp. MBT63]. <-727520004_?<-727520015_?||727520007_?-><-727520018_?<-727520021_?<-727520024_?||727520029_ParA->727520012_ParB-HTH*-> 759753188 <-ParB-HTH*<-ParA ParB-HTH - BI06_RS00860 374 bacteria>actinobacteria Kitasatospora sp. MBT66 hypothetical protein [Kitasatospora sp. MBT66]. 759753168_?->759753171_?->759753592_?->759753175_?->759753178_?->759753182_?->759753185_?-><-759753188_ParB-HTH*<-759753594_ParA||759753596_?->759753598_?-><-759753193_?||759753601_?-><-759753197_?<-759753200_? # 2; 306986392 <-ParB-HTH*||?-><-METHYLASE ParB-HTH SP Cyan7822_6496 361 bacteria>cyanobacteria Cyanothece sp. PCC 7822 conserved hypothetical protein (plasmid) [Cyanothece sp. PCC 7822]. <-306986385_?||306986386_?->306986387_?-><-306986388_?<-306986389_?<-306986390_?<-306986391_?<-306986392_ParB-HTH*||306986393_?-><-306986394_METHYLASE<-306986395_?||306986396_?->306986397_?-><-306986398_?||306986399_?-> 306986431 Subtilisin->METHYLASE->?->?-><-?||ParB-HTH*-> ParB-HTH - Cyan7822_6546 324 bacteria>cyanobacteria Cyanothece sp. PCC 7822 conserved hypothetical protein (plasmid) [Cyanothece sp. PCC 7822]. <-306986424_?<-306986425_?||306986426_Subtilisin->306986427_METHYLASE->306986428_?->306986429_?-><-306986430_?||306986431_ParB-HTH*->306986432_?->306986433_?->306986434_?-><-306986435_?||306986436_?-><-306986437_?<-306986438_? # 2; 494514846 <-ParB-HTH* ParB-HTH - CWATDRAFT_RS06740 347 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. <-494514838_?||494514839_?-><-494514840_?<-494514841_?<-494514842_?||757157200_?-><-494513614_?<-494514846_ParB-HTH*||494520213_?-><-757157202_?<-757157203_?||494514850_?->494514851_?-><-494514852_?||494514853_?-> 494520212 <-ParB-HTH* ParB-HTH - CWATWH0003_RS05610 255 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. <-737857974_?<-494520212_ParB-HTH* # 2; 697211997 Terminase_LS->ParB-HTH*-> ParB-HTH - N545_RS29500 346 bacteria>actinobacteria Streptomyces sp. URHA0041 hypothetical protein [Streptomyces sp. URHA0041]. <-697211992_?<-697211993_?||697211994_?->697211995_?->697212008_?->697211996_Terminase_LS->697211997_ParB-HTH*->697212009_?->697211998_?->697211999_?-><-697212000_?<-697212001_?<-697212002_?<-697212010_? 740047622 ParA->ParB-HTH*-><-?||TrwC-> ParB-HTH - CF54_RS37465 343 bacteria>actinobacteria Streptomyces sp. Tu 6176 hypothetical protein [Streptomyces sp. Tu 6176]. 740047610_?->740047611_?->740047613_?->740047614_?->740047616_?->740047619_?->740047620_ParA->740047622_ParB-HTH*-><-740047623_?||740047658_TrwC-><-740047625_?||740047628_?-><-740047631_?||740047633_?->740047660_?-> # 2; 663349955 HISKIN->?->?->?->?->?->?->ParB-HTH*-><-Mrr_cat-REase ParB-HTH SP IG05_RS0139385 329 bacteria>actinobacteria Streptomyces sp. NRRL S-1022 hypothetical protein [Streptomyces sp. NRRL S-1022]. 663349948_HISKIN->663349949_?->663349950_?->663349951_?->663349952_?->663349953_?->663349954_?->663349955_ParB-HTH*-><-663349956_Mrr_cat-REase<-663349957_?<-739965392_?||663349959_?->663349960_?->663349961_?-><-663349962_? 664445691 Mrr_cat-REase-><-ParB-HTH*<-?<-?<-?<-?<-?<-ASCH ParB-HTH - IH78_RS0156160 286 bacteria>actinobacteria Streptomyces sp. NRRL F-5140 hypothetical protein [Streptomyces sp. NRRL F-5140]. <-740023575_?<-664445688_?<-664445689_?||664445690_Mrr_cat-REase-><-664445691_ParB-HTH*<-740023578_?<-664445693_?<-664445694_?<-740023580_?<-740023582_?<-664445697_ASCH<-664445698_? # 2; 740055334 SFII-helicase->?-><-?||?->?-><-TrwC<-ParB-HTH*<-ParA<-?<-?<-?<-?<-?<-HISKIN ParB-HTH - BS72_RS00040 313 bacteria>actinobacteria Streptomyces yeochonensis hypothetical protein [Streptomyces yeochonensis]. 740055326_SFII-helicase->740055328_?-><-740055413_?||740055330_?->740055415_?-><-740055418_TrwC<-740055334_ParB-HTH*<-740055337_ParA<-740055340_?<-740055343_?<-740055346_?<-740055349_?<-740055351_?<-740055355_HISKIN 755076504 ParA->ParB-HTH*-> ParB-HTH - TR50_RS28230 313 bacteria>actinobacteria Streptacidiphilus anmyonensis hypothetical protein [Streptacidiphilus anmyonensis]. 755076484_?->755076487_?->755076604_?->755076491_?->755076494_?->755076498_?->755076501_ParA->755076504_ParB-HTH*->755076608_?->755076507_?->755076510_?->755076611_?-><-755076513_?||755076516_?->755076520_?-> # 2; 664512363 <-ParB-HTH*<-ParA ParB-HTH - IO31_RS0138135 308 bacteria>actinobacteria Streptomyces MULTISPECIES: hypothetical protein [Streptomyces]. 739974938_?->664512824_?->664512827_?->664512830_?->739974940_?->664512367_?->739974942_?-><-664512363_ParB-HTH*<-664512361_ParA||665655926_?->664512357_?->664512355_?->664512353_?->664512351_?->664512348_?-> 663148255 <-ParB-HTH*<-ParA ParB-HTH - IE94_RS0124715 242 bacteria>actinobacteria Streptomyces violaceorubidus hypothetical protein [Streptomyces violaceorubidus]. 663148232_?->663148235_?->663148238_?->663148243_?->663148246_?->740073938_?->740073972_?-><-663148255_ParB-HTH*<-663148257_ParA||663148259_?->663148262_?->663148265_?->663148268_?->663148271_?->663148273_?-> # 2; 736303905 ParB->RNAse_T->?->ParB-HTH*-> ParB-HTH - Q371_RS01470 298 bacteria>deinococci Deinococcus misasensis hypothetical protein [Deinococcus misasensis]. <-736303875_?||736303881_?->736303885_?->736303889_?->736304540_ParB->736303894_RNAse_T->736303897_?->736303905_ParB-HTH*->736303908_?->736303915_?->736303920_?->736303924_?->736303928_?->736304544_?->736304548_?-> 736313124 <-ParB-HTH*<-?<-?<-RNAse_T ParB-HTH - Q371_RS10895 277 bacteria>deinococci Deinococcus misasensis hypothetical protein [Deinococcus misasensis]. <-736313104_?<-736313106_?<-736313109_?<-736313112_?<-736313115_?<-736313118_?<-736313121_?<-736313124_ParB-HTH*<-736313126_?<-736313127_?<-736313290_RNAse_T<-736313129_?<-736313131_?<-736313132_?<-736313133_? # 2; 671527277 <-TrwC<-ParB-HTH*<-ParA ParB-HTH - IF47_RS0126170 295 bacteria>actinobacteria Streptomyces megasporus hypothetical protein [Streptomyces megasporus]. <-739882773_?<-671527271_?||671527272_?->671527273_?-><-671527274_?||671527275_?-><-739882774_TrwC<-671527277_ParB-HTH*<-671527278_ParA||671527279_?-><-671527280_? 662754816 SFII-helicase->?-><-ParB-HTH*<-ParA<-?<-DDE ParB-HTH - IA22_RS0132355 288 bacteria>actinobacteria [Kitasatospora] papulosa hypothetical protein [[Kitasatospora] papulosa]. <-662754806_?<-662754808_?<-662754809_?<-662754810_?<-662754812_?||740441354_SFII-helicase->662754815_?-><-662754816_ParB-HTH*<-662754817_ParA<-662754818_?<-505392728_DDE<-662754819_?<-662754820_?<-662754821_?<-662754822_? # 2; 291531542 <-SFII-helicase<-?<-?<-?<-ParB-HTH* ParB-HTH DUF3102 EUS_21090 291 bacteria>firmicutes [Eubacterium] siraeum 70/3 Protein of unknown function (DUF3102) [[Eubacterium] siraeum 70/3]. <-291531535_?<-291531536_?<-291531537_?<-291531538_SFII-helicase<-291531539_?<-291531540_?<-291531541_?<-291531542_ParB-HTH*<-291531543_?<-291531544_?<-291531545_?<-291531546_?<-291531547_?<-291531548_?<-291531549_? 491496822 <-SFII-helicase<-?<-?<-?<-ParB-HTH* ParB-HTH DUF3102 G397_RS0111230 291 bacteria>firmicutes [Eubacterium] siraeum hypothetical protein [[Eubacterium] siraeum]. <-518492507_?<-491496813_?<-491496815_?<-491496816_SFII-helicase<-491496818_?<-769258155_?<-491496821_?<-491496822_ParB-HTH*<-491496825_?<-491496826_?<-491496832_?<-491496834_?<-491496840_?||491496842_?->491496845_?-> # 2; 490596705 <-ParB-HTH* ParB-HTH - LEP1GSC048_RS07865 286 bacteria>spirochaetes Leptospira santarosai hypothetical protein [Leptospira santarosai]. <-490596713_?<-490596712_?<-490596711_?<-490596709_?<-490596708_?<-490596707_?<-490596706_?<-490596705_ParB-HTH*<-490596704_?<-490596703_?<-490596702_?<-696346182_?<-490596699_?<-490596698_?||696346167_?-> 446063157 <-ParB-HTH* ParB-HTH - LEP1GSC034_RS113505 285 bacteria>spirochaetes Leptospira interrogans hypothetical protein [Leptospira interrogans]. <-446521022_?<-447117837_?<-696579207_?<-447008490_?<-447185333_?<-446587050_?<-487902815_?<-446063157_ParB-HTH*<-446991473_?<-487902812_?<-447170069_?<-446586997_?<-446992945_?<-446505006_?||446577653_?-> # 2; 443331859 HISKIN->?-><-?<-?||ParB-HTH*-> ParB-HTH SP C789_3692 285 bacteria>cyanobacteria Microcystis aeruginosa DIANCHI905 hypothetical protein C789_3692 [Microcystis aeruginosa DIANCHI905]. 443331853_?->443331854_?->443331855_HISKIN->443331856_?-><-443331857_?<-443331858_?||443331859_ParB-HTH*->443331860_?->443331861_?->443331862_?->443331863_?->443331864_?->443331865_?-><-443331866_? 159026604 <-ParB-HTH*||?->?-><-?||?-><-HISKIN ParB-HTH SP IPF_3218 274 bacteria>cyanobacteria Microcystis aeruginosa PCC 7806 unnamed protein product [Microcystis aeruginosa PCC 7806]. <-159026604_ParB-HTH*||159026605_?->159026606_?-><-159026607_?||159026608_?-><-159026609_HISKIN<-159026610_?<-159026611_? # 2; 428272064 <-ParB-HTH* ParB-HTH - Sta7437_4542 278 bacteria>cyanobacteria Stanieria cyanosphaera PCC 7437 hypothetical protein Sta7437_4542 (plasmid) [Stanieria cyanosphaera PCC 7437]. 428272057_?->428272058_?->428272059_?->428272060_?-><-428272061_?||428272062_?->428272063_?-><-428272064_ParB-HTH*||428272065_?->428272066_?->428272067_?->428272068_?->428272069_?->428272070_?->428272071_?-> 428272125 <-ParB-HTH*<-?||?->?->?-><-AAA-ATPase ParB-HTH - Sta7437_4607 228 bacteria>cyanobacteria Stanieria cyanosphaera PCC 7437 hypothetical protein Sta7437_4607 (plasmid) [Stanieria cyanosphaera PCC 7437]. <-428272118_?<-428272119_?<-428272120_?<-428272121_?<-428272122_?<-428272123_?<-428272124_?<-428272125_ParB-HTH*<-428272126_?||428272127_?->428272128_?->428272129_?-><-428272130_AAA-ATPase||428272131_?->428272132_?-> # 2; 518337503 TPR+CASPASE-><-?||?->?->?-><-?||?->ParB-HTH*-><-?<-?||?->Peptidase_M10-> ParB-HTH - PLEUR7319_RS0123660 254 bacteria>cyanobacteria Pleurocapsa sp. PCC 7319 hypothetical protein [Pleurocapsa sp. PCC 7319]. 738913802_TPR+CASPASE-><-518337497_?||518337498_?->648410984_?->648410985_?-><-518337501_?||518337502_?->518337503_ParB-HTH*-><-518337504_?<-738913804_?||518337506_?->738913807_Peptidase_M10->648410986_?->518337509_?->738913809_?-> 493559029 <-ParB-HTH*||?-><-?||?->?-><-?<-?<-HISKIN ParB-HTH - XEN7305_RS25675 252 bacteria>cyanobacteria Xenococcus sp. PCC 7305 hypothetical protein [Xenococcus sp. PCC 7305]. 493559020_?->493559022_?->750617818_?-><-493559024_?<-493559026_?<-493559027_?||493559028_?-><-493559029_ParB-HTH*||750617821_?-><-493559031_?||750617853_?->750617856_?-><-493559034_?<-493559035_?<-493559036_HISKIN # 2; 664188154 ParB-HTH*->?->?-><-?||?->?-><-?<-HISKIN ParB-HTH - IF88_RS0135010 228 bacteria>actinobacteria Streptomyces sp. NRRL F-2580 hypothetical protein [Streptomyces sp. NRRL F-2580]. 664188133_?-><-664188136_?<-664188139_?||664188142_?-><-664188145_?<-664188148_?||664188151_?->664188154_ParB-HTH*->664188157_?->664188160_?-><-664188162_?||664188165_?->664188168_?-><-664188171_?<-664188174_HISKIN 759522371 ParB-HTH*-> ParB-HTH - STRVI_RS46070 214 bacteria>actinobacteria Streptomyces violaceusniger hypothetical protein, partial [Streptomyces violaceusniger]. <-503809945_?<-503809946_?<-503809947_?<-503809948_?<-503809949_?||503809950_?->503809951_?->759522371_ParB-HTH*->759522303_?-><-759522305_?<-503809955_?<-759522373_?||503809958_?->503809959_?->759522375_?-> # 2; 218762801 <-MU-transposase<-ParB-HTH*<-?<-?||?-><-?||?->HISKIN-> ParB-HTH - Dalk_3579 224 bacteria>proteobacteria>deltaproteobacteria Desulfatibacillum alkenivorans AK-01 hypothetical protein Dalk_3579 [Desulfatibacillum alkenivorans AK-01]. <-218762794_?<-218762795_?<-218762796_?<-218762797_?<-218762798_?<-218762799_?<-218762800_MU-transposase<-218762801_ParB-HTH*<-218762802_?<-218762803_?||218762804_?-><-218762805_?||218762806_?->218762807_HISKIN-><-218762808_? 654862385 <-P-loop<-?<-ParB-HTH* ParB-HTH - G491_RS0111365 223 bacteria>proteobacteria>deltaproteobacteria Desulfatibacillum aliphaticivorans hypothetical protein [Desulfatibacillum aliphaticivorans]. <-654862378_?<-654862379_?<-654862380_?<-737234295_?<-654862382_?<-654862383_P-loop<-654862384_?<-654862385_ParB-HTH*||654862386_?-><-737234244_?<-737234296_?||654862387_?->654862388_?->654862389_?->654862390_?-> # 2; 523467872 DnaJ->?->?->?-><-?<-?<-?<-ParB-HTH* ParB-HTH - dsmv_2585 201 bacteria>proteobacteria>deltaproteobacteria Desulfococcus multivorans DSM 2059 hypothetical protein dsmv_2585 [Desulfococcus multivorans DSM 2059]. 523467865_DnaJ->523467866_?->523467867_?->523467868_?-><-523467869_?<-523467870_?<-523467871_?<-523467872_ParB-HTH*<-523467873_?<-523467874_?<-523467875_?<-523467876_?<-523467877_?<-523467878_?<-523467879_? 750110637 DnaJ->?->?->?-><-?<-?<-?<-ParB-HTH*<-?<-ParB-HTH ParB-HTH - DSMV_RS10090 198 bacteria>proteobacteria>deltaproteobacteria Desulfococcus multivorans hypothetical protein [Desulfococcus multivorans]. 527025605_DnaJ->527025606_?->527025607_?->527025608_?-><-527025609_?<-750110705_?<-527025611_?<-750110637_ParB-HTH*<-527025613_?<-750110641_ParB-HTH<-527025615_?<-527025616_?<-527025617_?<-750110707_?<-527025619_? # 2; 759540387 ParA->ParB-HTH*->?->?->TrwC-> ParB-HTH - SSP08S_RS57300 154 bacteria>actinobacteria Streptomyces sp. R1-NS-10 hypothetical protein, partial [Streptomyces sp. R1-NS-10]. 517904201_?->517904202_?->517904203_?->759540386_?->517904205_?->517904206_?->517904207_ParA->759540387_ParB-HTH*->759540388_?->517904210_?->759540389_TrwC->517904212_?-><-517904213_?||648487654_?->517904215_?-> 759768796 SFII-helicase-><-?<-ParB-HTH*<-ParA ParB-HTH Ndufs5 BI06_RS43330 143 bacteria>actinobacteria Kitasatospora sp. MBT66 hypothetical protein, partial [Kitasatospora sp. MBT66]. 759768792_SFII-helicase-><-759768475_?<-759768796_ParB-HTH*<-759768805_ParA||759756574_?-><-759768478_?||759768481_?-><-759752933_?||759753483_?->759752930_?-> # 2; 748137595 RecT->HTH->HTH->ParB-HTH*->ParB-HTH+Prok-TUDOR-><-HU-IHF ParB-HTH - QH73_RS16185 146 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. <-748137493_?<-748137494_?<-748137495_?||748137496_?->748137497_RecT->748137498_HTH->748137499_HTH->748137595_ParB-HTH*->748137500_ParB-HTH+Prok-TUDOR-><-748137501_HU-IHF||748137502_?-><-748137503_?<-748137504_?||748137505_?->748137596_?-> 748136711 HU-IHF->?->?-><-ParB-HTH*<-HTH<-RecT ParB-HTH - QH73_RS11420 127 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. <-748136710_?<-748136545_?<-748136546_?||748136547_?->748136548_HU-IHF->748136549_?->748136550_?-><-748136711_ParB-HTH*<-748136551_HTH<-748136712_RecT<-748136552_?||748136553_?-><-748136554_?||748136555_?-><-748136556_? # 2; 67856109 <-RVT+HNH<-?<-DDE_Tnp_1 - SP CwatDRAFT_4471 138 bacteria>cyanobacteria Crocosphaera watsonii WH 8501 hypothetical protein CwatDRAFT_4471 [Crocosphaera watsonii WH 8501]. 67856105_?-><-67856266_RVT+HNH<-67856265_?<-67856264_DDE_Tnp_1||67856106_?->67856107_?->67856108_?->67856109_?*->67856110_?->67856111_?->67856112_?->67856113_?->67856114_?->67856115_?->67856116_?-> 757157310 <-RVT+HNH<-?||?-><-DDE_Tnp_1||?->?->?->ParB-HTH*-> ParB-HTH - CWATDRAFT_RS09845 114 bacteria>cyanobacteria Crocosphaera watsonii hypothetical protein [Crocosphaera watsonii]. <-757157323_RVT+HNH<-494515558_?||757157308_?-><-494514515_DDE_Tnp_1||757157309_?->757157324_?->494515407_?->757157310_ParB-HTH*->757157311_?->757157325_?->494515410_?->494515411_?->494515412_?->494515413_?->494515414_?-> # 2; 158310190 ParB-HTH*->?->?->?->?->?->?->HU-IHF-> ParB-HTH - AM1_B0079 138 bacteria>cyanobacteria Acaryochloris marina MBIC11017 hypothetical protein AM1_B0079 (plasmid) [Acaryochloris marina MBIC11017]. 158310183_?-><-158310184_?<-158310185_?<-158310186_?||158310187_?-><-158310188_?<-158310189_?||158310190_ParB-HTH*->158310191_?->158310192_?->158310193_?->158310194_?->158310195_?->158310196_?->158310197_HU-IHF-> 753958401 ParB-HTH*->?->?->?->?->?->HU-IHF-> ParB-HTH - AM1_RS30720 134 bacteria>cyanobacteria Acaryochloris marina hypothetical protein [Acaryochloris marina]. 753958398_?->753958491_?-><-501117690_?<-501117691_?<-501117692_?<-753958400_?<-501117695_?||753958401_ParB-HTH*->753958494_?->501117699_?->501117700_?->501117701_?->501117702_?->501117703_HU-IHF->501117704_?-> # 2; 119462359 <-ParB-HTH+Prok-TUDOR*<-ParB-HTH<-?||DCM-> ParB-HTH+Prok-TUDOR - N9414_06389 116 bacteria>cyanobacteria Nodularia spumigena CCY9414 hypothetical protein N9414_06389 [Nodularia spumigena CCY9414]. <-119462352_?<-119462353_?<-119462354_?<-119462355_?<-119462356_?||119462357_?-><-119462358_?<-119462359_ParB-HTH+Prok-TUDOR*<-119462360_ParB-HTH<-119462361_?||119462362_DCM->119462363_?-><-119462364_?||119462365_?-><-119462366_? 585121647 <-ParB-HTH+Prok-TUDOR*<-ParB-HTH<-?||?->DCM-> ParB-HTH+Prok-TUDOR - NSP_22570 114 bacteria>cyanobacteria Nodularia spumigena CCY9414 Benzoyl-CoA reductase/2-hydroxyglutaryl-CoA dehydratase subunit, BcrC/BadD/HgdB [Nodularia spumigena CCY9414]. <-585121640_?<-585121641_?<-585121642_?<-585121643_?<-585121644_?||585121645_?-><-585121646_?<-585121647_ParB-HTH+Prok-TUDOR*<-585121648_ParB-HTH<-585121649_?||585121650_?->585121651_DCM->585121652_?-><-585121653_?<-585121654_? # 1; 306986606 DCM+ParB-HTH+Prok-TUDOR*-> DCM+ParB-HTH+Prok-TUDOR DNA_methylase Cyan7822_6833 645 bacteria>cyanobacteria Cyanothece sp. PCC 7822 DNA-cytosine methyltransferase (plasmid) [Cyanothece sp. PCC 7822]. 306986599_?-><-306986600_?<-306986601_?<-306986602_?||306986603_?-><-306986604_?<-306986605_?||306986606_DCM+ParB-HTH+Prok-TUDOR*-><-306986607_?||306986608_?->306986609_?->306986610_?-><-306986611_?<-306986612_?<-306986613_? 543538779 <-ParB-HTH+Prok-TUDOR*<-?<-?||?->?->?-><-RecD<-RecD ParB-HTH+Prok-TUDOR - CWATWH0402_1321 560 bacteria>cyanobacteria Crocosphaera watsonii WH 0402 hypothetical protein CWATWH0402_1321 [Crocosphaera watsonii WH 0402]. <-543538778_?<-543538779_ParB-HTH+Prok-TUDOR*<-543538780_?<-543538781_?||543538782_?->543538783_?->543538784_?-><-543538785_RecD<-543538786_RecD 480702301 HNH->?->?->?->ParB-HTH*-> ParB-HTH DUF2360 HMPREF1089_00435 551 bacteria>firmicutes [Clostridium] bolteae 90B3 hypothetical protein HMPREF1089_00435 [[Clostridium] bolteae 90B3]. 480702294_?->480702295_?->480702296_?->480702297_HNH->480702298_?->480702299_?->480702300_?->480702301_ParB-HTH*->480702302_?->480702303_?->480702304_?->480702305_?->480702306_?->480702307_?->480702308_?-> 510895729 ParA->ParB-HTH*-> ParB-HTH - C819_RS19820 527 bacteria>firmicutes Lachnospiraceae bacterium 10-1 hypothetical protein [Lachnospiraceae bacterium 10-1]. 665905575_?->510895723_?->510895724_?->510895725_?-><-510895726_?||665905576_?->510895728_ParA->510895729_ParB-HTH*-><-665905577_?<-510895731_?||510895733_?->736104461_?->550997608_?->510895737_?->510895739_?-> 344043288 ParB-HTH*-> ParB-HTH - Strvi_0238 361 bacteria>actinobacteria Streptomyces violaceusniger Tu 4113 hypothetical protein Strvi_0238 (plasmid) [Streptomyces violaceusniger Tu 4113]. <-344043281_?<-344043282_?<-344043283_?<-344043284_?<-344043285_?||344043286_?->344043287_?->344043288_ParB-HTH*->344043289_?->344043290_?-><-344043291_?<-344043292_?<-344043293_?||344043294_?->344043295_?-> 427992361 <-ParB-HTH* ParB-HTH SP Pse7367_3831 353 bacteria>cyanobacteria Pseudanabaena sp. PCC 7367 hypothetical protein Pse7367_3831 (plasmid) [Pseudanabaena sp. PCC 7367]. <-427992354_?<-427992355_?||427992356_?-><-427992357_?||427992358_?->427992359_?-><-427992360_?<-427992361_ParB-HTH*<-427992362_?||427992363_?->427992364_?->427992365_?->427992366_?->427992367_?-><-427992368_? 386428626 ParB-HTH*-> ParB-HTH Trp_dioxygenase+DUF488 BegalDRAFT_1574 337 bacteria>proteobacteria>gammaproteobacteria Beggiatoa alba B18LD hypothetical protein BegalDRAFT_1574 [Beggiatoa alba B18LD]. 386428619_?->386428620_?->386428621_?-><-386428622_?<-386428623_?||386428624_?-><-386428625_?||386428626_ParB-HTH*->386428627_?-><-386428628_?||386428629_?->386428630_?->386428631_?->386428632_?->386428633_?-> 787066260 ParB-HTH*-> ParB-HTH - BAR36866.1 331 viruses uncultured Mediterranean phage uvMED unnamed protein product [uncultured Mediterranean phage uvMED]. <-787066253_?<-787066254_?<-787066255_?<-787066256_?||787066257_?->787066258_?->787066259_?->787066260_ParB-HTH*->787066261_?->787066262_?->787066263_?->787066264_?->787066265_?->787066266_?-><-787066267_? 521992951 ParB-HTH*-> ParB-HTH - A39O_RS0108365 328 bacteria>proteobacteria>gammaproteobacteria Lamprocystis purpurea hypothetical protein [Lamprocystis purpurea]. 750223431_?->521992951_ParB-HTH*->521992952_?->750223432_?-> 749286507 <-ParB-HTH*<-ParA||NLPC-> ParB-HTH - NS07_v2contig00189-0005 328 bacteria>actinobacteria Nocardia seriolae hypothetical protein NS07_v2contig00189-0005 [Nocardia seriolae]. 749286503_?->749286504_?->749286505_?->749286506_?-><-749286507_ParB-HTH*<-749286508_ParA||749286509_NLPC-><-749286510_?||749286511_?-><-749286512_?||749286513_?-> 428272094 <-ParB-HTH* ParB-HTH SP Sta7437_4575 326 bacteria>cyanobacteria Stanieria cyanosphaera PCC 7437 hypothetical protein Sta7437_4575 (plasmid) [Stanieria cyanosphaera PCC 7437]. 428272087_?->428272088_?-><-428272089_?||428272090_?->428272091_?->428272092_?->428272093_?-><-428272094_ParB-HTH*||428272095_?-><-428272096_?<-428272097_?<-428272098_?<-428272099_?<-428272100_?<-428272101_? 291541580 ParB-HTH*->?->?->SFII-helicase-> ParB-HTH AAA_13 RBR_02590 325 bacteria>firmicutes Ruminococcus bromii L2-63 hypothetical protein RBR_02590 [Ruminococcus bromii L2-63]. 291541573_?->291541574_?->291541575_?->291541576_?->291541577_?->291541578_?->291541579_?->291541580_ParB-HTH*->291541581_?->291541582_?->291541583_SFII-helicase->291541584_?->291541585_?->291541586_?->291541587_?-> 759552136 ParB-HTH*-> ParB-HTH - RZ84_RS34725 317 bacteria>actinobacteria Streptomyces sp. CT34 hypothetical protein [Streptomyces sp. CT34]. <-759552125_?<-759552181_?<-759552127_?||759552183_?->759552129_?->759552131_?->759552134_?->759552136_ParB-HTH*-><-759552138_?||759552140_?->759552185_?->759552142_?->759552144_?->759552147_?->759552150_?-> 727525039 ParA->ParB-HTH*-> ParB-HTH - AA75_RS07430 309 bacteria>actinobacteria Kitasatospora sp. MBT63 hypothetical protein [Kitasatospora sp. MBT63]. 727525073_?->727525034_ParA->727525039_ParB-HTH*->727525044_?-><-727525078_?||727525047_?->727525050_?->727525055_?->727525081_?->727525058_?-> 787047096 ParB-HTH*-> ParB-HTH - BAR21317.1 309 viruses uncultured Mediterranean phage uvMED unnamed protein product [uncultured Mediterranean phage uvMED]. 787047089_?->787047090_?->787047091_?->787047092_?->787047093_?->787047094_?->787047095_?->787047096_ParB-HTH*->787047097_?->787047098_?->787047099_?->787047100_?->787047101_?->787047102_?-><-787047103_? 759952224 ParA->ParB-HTH*-> ParB-HTH - IO39_RS42050 298 bacteria>actinobacteria Nonomuraea candida hypothetical protein [Nonomuraea candida]. 759952206_?->759952209_?-><-759952212_?<-759952215_?||759952296_?->759952218_?->759952221_ParA->759952224_ParB-HTH*-><-759952228_?<-759952230_?<-759952232_?<-759952237_?<-759952240_?||759952243_?-><-759952298_? 690403288 <-ParB-HTH*<-ParA ParB-HTH - pSLA2-S.15 294 bacteria>actinobacteria Streptomyces rochei hypothetical protein [Streptomyces rochei]. 689922287_?->689922288_?->689922289_?-><-689922290_?||689922291_?-><-689922292_?||689922293_?-><-690403288_ParB-HTH*<-689922295_ParA<-689922296_?||689922297_?-><-689922298_?||689922299_?->689922300_?->689922301_?-> 493426429 <-ParB-HTH*<-ParA<-RNAse_T ParB-HTH - STRTUCAR8_RS40225 291 bacteria>actinobacteria Streptomyces turgidiscabies hypothetical protein [Streptomyces turgidiscabies]. 764831940_?->764831942_?->764831945_?->493426431_?->493426426_?->764831947_?-><-493426429_ParB-HTH*<-764831949_ParA<-764831951_RNAse_T 503099618 NACHT-><-?||?->?-><-ParB-HTH+Prok-TUDOR* ParB-HTH+Prok-TUDOR - CYAN7822_RS28405 291 bacteria>cyanobacteria Cyanothece sp. PCC 7822 hypothetical protein [Cyanothece sp. PCC 7822]. 503099611_?-><-754535247_?<-754535459_?||754535462_NACHT-><-754535465_?||503099616_?->503099617_?-><-503099618_ParB-HTH+Prok-TUDOR*||754535249_?->503099620_?->754535467_?->754535470_?->754535252_?->503099623_?->503099625_?-> 497670003 ParB-HTH*->?->?->?->SFII-helicase-> ParB-HTH V_ATPase_I RUMFLAFD1_RS0105685 290 bacteria>firmicutes Ruminococcus flavefaciens hypothetical protein [Ruminococcus flavefaciens]. 497669993_?->497669994_?->497669996_?->497669998_?->497669999_?->739421887_?->497670001_?->497670003_ParB-HTH*->497670004_?->739421889_?->739421891_?->497670009_SFII-helicase->497670011_?->497670013_?->497670015_?-> 738539870 <-ParA<-?<-?||?->HNH-><-ParB-HTH+Prok-TUDOR*||?-><-ParB-HTH+Prok-TUDOR<-?<-?||XerD-> ParB-HTH+Prok-TUDOR - KV40_RS28175 287 bacteria>cyanobacteria Myxosarcina sp. GI1 hypothetical protein [Myxosarcina sp. GI1]. <-738539857_?<-738539859_?<-738539861_ParA<-738539864_?<-738539995_?||738540017_?->738539867_HNH-><-738539870_ParB-HTH+Prok-TUDOR*||738539872_?-><-738539875_ParB-HTH+Prok-TUDOR<-738539878_?<-738540021_?||738540025_XerD-><-738539881_?<-738539884_? 739811243 <-TrwC||?->?-><-ParB-HTH* ParB-HTH - IF34_RS35665 286 bacteria>actinobacteria Streptomyces griseofuscus hypothetical protein [Streptomyces griseofuscus]. <-739811291_?<-739811292_?<-739811236_?||739811293_?-><-739811294_TrwC||739811295_?->739811240_?-><-739811243_ParB-HTH*<-739811297_?<-739811299_?<-739811247_?<-739811301_?<-739811248_?<-739811250_?<-739811253_? 506264213 ParB-HTH*-> ParB-HTH - CYAN8802_RS10090 284 bacteria>cyanobacteria Cyanothece sp. PCC 8802 hypothetical protein [Cyanothece sp. PCC 8802]. <-506264220_?<-506264219_?||752567501_?-><-506264217_?<-506264216_?<-752567744_?||506264214_?->506264213_ParB-HTH*->506264212_?->506264211_?->506264210_?->752567742_?->506265425_?-><-506265426_?<-506265427_? 664375955 ParA?->ParB-HTH*-><-?<-?<-?<-?<-ASCH ParB-HTH - IG86_RS0137215 282 bacteria>actinobacteria Streptomyces virginiae hypothetical protein [Streptomyces virginiae]. 664375953_ParA?->664375955_ParB-HTH*-><-664375958_?<-664375960_?<-664375962_?<-664375964_?<-664375967_ASCH<-664375970_?<-664375973_? 502659622 ParA->ParB-HTH*-> ParB-HTH - SROS_RS45435 275 bacteria>actinobacteria Streptosporangium roseum hypothetical protein [Streptosporangium roseum]. <-502659614_?<-502659615_?<-759974859_?||502659617_?-><-759974841_?||502659620_?->759974844_ParA->502659622_ParB-HTH*-><-759974847_?||502659624_?-><-502659625_?<-502659626_?<-759974862_?||502659628_?->502659629_?-> 663245548 <-ParB-HTH*<-ParA ParB-HTH - IE98_RS0140635 274 bacteria>actinobacteria Streptomyces sp. NRRL B-24484 hypothetical protein [Streptomyces sp. NRRL B-24484]. 663245517_?->663245521_?->663245524_?->759454963_?->663245536_?-><-663245541_?<-663245545_?<-663245548_ParB-HTH*<-663245553_ParA 748135960 ParB-HTH+Prok-TUDOR*->?->HISKIN->HISKIN-> ParB-HTH+Prok-TUDOR - QH73_RS08030 272 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. 748135956_?->748135904_?->748135905_?->748135957_?-><-748135906_?<-748135958_?||748135959_?->748135960_ParB-HTH+Prok-TUDOR*->748135961_?->748135907_HISKIN->748135908_HISKIN->748135909_?-><-748135910_?||748135962_?-><-748135963_? 518317229 <-HISKIN<-?<-?<-?<-?<-ParB-HTH* ParB-HTH - OSCIL6407_RS0116325 270 bacteria>cyanobacteria Kamptonema formosum hypothetical protein [Kamptonema formosum]. <-494599497_?<-494599498_?<-494599499_HISKIN<-494599904_?<-518317226_?<-518317227_?<-518317228_?<-518317229_ParB-HTH*<-518317230_?<-518317231_?<-750368297_?||518317233_?->518317234_?->518317235_?->518317236_?-> 505048307 <-ParB-HTH*<-?<-?<-?<-RNAse_T ParB-HTH - DEIPE_RS07610 269 bacteria>deinococci Deinococcus peraridilitoris hypothetical protein [Deinococcus peraridilitoris]. <-505048301_?<-505048302_?<-505048303_?<-505048304_?<-505048305_?<-752559530_?<-505048306_?<-505048307_ParB-HTH*<-505048308_?<-505048309_?<-505048310_?<-752560050_RNAse_T<-505048312_?<-505048313_?<-752560051_? 427376379 P-loop->?->?->?->?->ParB-HTH*-> ParB-HTH - Syn6312_1142 266 bacteria>cyanobacteria Synechococcus sp. PCC 6312 hypothetical protein Syn6312_1142 [Synechococcus sp. PCC 6312]. 427376372_?->427376373_?->427376374_P-loop->427376375_?->427376376_?->427376377_?->427376378_?->427376379_ParB-HTH*->427376380_?->427376381_?->427376382_?->427376383_?->427376384_?->427376385_?->427376386_?-> 504973660 ParB-HTH*->?-><-HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon ParB-HTH MatP 266 bacteria>cyanobacteria Chamaesiphon minutus hypothetical protein [Chamaesiphon minutus]. <-504973653_?<-753793967_?<-504973656_?<-753792584_?||753793968_?-><-504973658_?<-753792586_?||504973660_ParB-HTH*->504973661_?-><-753793138_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon<-504971468_?||753793969_?->504973664_?->753792588_?-><-504973666_? 387857486 <-Relaxase<-?||?->?->?-><-ParB-HTH*<-ParA ParB-HTH - Emtol_0315 260 bacteria>bacteroidetes Emticicia oligotrophica DSM 17448 hypothetical protein Emtol_0315 (plasmid) [Emticicia oligotrophica DSM 17448]. 387857479_?-><-387857480_?<-387857481_Relaxase<-387857482_?||387857483_?->387857484_?->387857485_?-><-387857486_ParB-HTH*<-387857487_ParA||387857488_?->387857489_?->387857490_?->387857491_?->387857492_?->387857493_?-> 501453636 ParA->ParB-HTH*-><-?<-?<-?<-?||?-><-SFII-helicase ParB-HTH SP 260 bacteria>actinobacteria Streptomyces sp. FR1 hypothetical protein [Streptomyces sp. FR1]. 190410681_?->190410682_?->190410683_?->190410684_?->190410685_?->190410686_?->190410687_ParA->501453636_ParB-HTH*-><-190410689_?<-190410690_?<-190410691_?<-190410692_?||190410693_?-><-190410694_SFII-helicase<-190410695_? 576415619 <-ParB-HTH+ParB*<-?||DDE_Tnp_1_2->DDE_Tnp_1_2-> ParB-HTH+ParB ParBc I546_4173 257 bacteria>actinobacteria Mycobacterium kansasii 732 parB-like nuclease domain protein [Mycobacterium kansasii 732]. <-576415477_?<-576415788_?<-576415759_?<-576415637_?<-576415706_?<-576415701_?<-576415774_?<-576415619_ParB-HTH+ParB*<-576415492_?||576415657_DDE_Tnp_1_2->576415496_DDE_Tnp_1_2-><-576415798_?<-576415497_?||576415587_?->576415793_?-> 390172790 <-ParB-HTH+ParB-HTH* ParB-HTH+ParB-HTH SP NITHO_3110002 254 bacteria>chloroflexi Nitrolancea hollandica Lb hypothetical protein NITHO_3110002 [Nitrolancea hollandica Lb]. 390172789_?-><-390172790_ParB-HTH+ParB-HTH*||390172791_?-><-390172792_?||390172793_?-><-390172794_?<-390172795_?<-390172796_?<-390172797_? 503393144 <-ParB<-?||ParB-HTH*-><-?||DDE_Tnp_1_2-> ParB-HTH - 246 bacteria>planctomycetes Planctomyces brasiliensis hypothetical protein [Planctomyces brasiliensis]. <-752752012_?<-503393138_?<-752752013_?<-503393140_?<-503393141_?<-503393142_ParB<-503393143_?||503393144_ParB-HTH*-><-752750814_?||752751618_DDE_Tnp_1_2->752750816_?->752750818_?->503393146_?-><-503393147_?<-752750820_? 652339044 <-ParB-HTH*<-?<-?<-?<-?<-?||?-><-TPR+CASPASE ParB-HTH - 240 bacteria>cyanobacteria Fischerella sp. PCC 9605 hypothetical protein [Fischerella sp. PCC 9605]. 737153835_?->737153754_?->652339039_?->652339040_?-><-652339041_?||652339042_?-><-652339043_?<-652339044_ParB-HTH*<-652339045_?<-737153837_?<-652339047_?<-652339048_?<-652339049_?||737153839_?-><-652339052_TPR+CASPASE 373100107 ABC-> - - OR16_21363 235 bacteria>proteobacteria>betaproteobacteria Cupriavidus basilensis OR16 hypothetical protein OR16_21363 [Cupriavidus basilensis OR16]. <-373100100_?<-373100101_?||373100102_?-><-373100103_?||373100104_?->373100105_?->373100106_?-><-373100107_?*<-373100108_?||373100109_?->373100110_?->373100111_?->373100112_ABC->373100113_?->373100114_?-> 522821476 HISKIN-><-ParB-HTH* ParB-HTH - 228 bacteria>proteobacteria>deltaproteobacteria Sorangium cellulosum hypothetical protein [Sorangium cellulosum]. <-769244403_?||522821470_?->522821471_?->769244405_?-><-769244406_?<-522821474_?||522821475_HISKIN-><-522821476_ParB-HTH*||769241447_?->522821478_?->769244407_?->522821482_?-><-522821483_?<-522821484_?<-522821485_? 223896866 ParB-HTH*-> ParB-HTH - Cflav_PD5941 212 bacteria>verrucomicrobia Pedosphaera parvula Ellin514 hypothetical protein Cflav_PD5941 [Pedosphaera parvula Ellin514]. 223896859_?-><-223896860_?<-223896861_?||223896862_?->223896863_?->223896864_?->223896865_?->223896866_ParB-HTH*->223896867_?->223896868_?-><-223896869_?<-223896870_?<-223896871_?<-223896872_?<-223896873_? 522187926 <-ParB-HTH* ParB-HTH - BN60_RS12900 210 bacteria>proteobacteria>alphaproteobacteria Reyranella massiliensis hypothetical protein [Reyranella massiliensis]. <-522187918_?||759564486_?->522187920_?-><-522187921_?<-522187923_?<-759566104_?<-522187925_?<-522187926_ParB-HTH*<-522187927_?<-522187928_?<-522187929_?||522187930_?-><-522187931_?<-522187932_?<-522187933_? 494604192 ParB-HTH*->ParB-> ParB-HTH - OPIT1DRAFT_RS19430 209 bacteria>verrucomicrobia Opitutaceae bacterium TAV1 hypothetical protein [Opitutaceae bacterium TAV1]. <-494604186_?<-494604187_?<-494604188_?<-494604189_?||759401514_?-><-759401518_?||494604191_?->494604192_ParB-HTH*->759403425_ParB->494604194_?->494604195_?->494604196_?->494604197_?->759401523_?->759401526_?-> 748137500 RecT->HTH->HTH->ParB-HTH->ParB-HTH+Prok-TUDOR*-><-HU-IHF ParB-HTH+Prok-TUDOR - QH73_RS16190 174 bacteria>cyanobacteria Scytonema millei hypothetical protein, partial [Scytonema millei]. <-748137494_?<-748137495_?||748137496_?->748137497_RecT->748137498_HTH->748137499_HTH->748137595_ParB-HTH->748137500_ParB-HTH+Prok-TUDOR*-><-748137501_HU-IHF||748137502_?-><-748137503_?<-748137504_?||748137505_?->748137596_?->748137597_?-> 488660258 ParB-HTH*->?->?->?->?-><-?<-?<-DDE ParB-HTH - HMPREF1086_RS21385 152 bacteria>firmicutes [Clostridium] clostridioforme hypothetical protein [[Clostridium] clostridioforme]. 488660264_?->488660263_?->488660262_?->488660261_?->488660260_?->488633868_?->488660259_?->488660258_ParB-HTH*->488660257_?->488660256_?->488660255_?->488660254_?-><-488663774_?<-488648570_?<-488648571_DDE 390173641 <-ParB-HTH* ParB-HTH SP NITHO_2240002 135 bacteria>chloroflexi Nitrolancea hollandica Lb hypothetical protein NITHO_2240002 [Nitrolancea hollandica Lb]. 390173640_?-><-390173641_ParB-HTH*||390173642_?->390173643_?->390173644_?-><-390173645_?<-390173646_?||390173647_?-><-390173648_? 663210974 <-ParB-HTH*<-ParA ParB-HTH - IE97_RS0131565 134 bacteria>actinobacteria Streptomyces sp. NRRL S-455 hypothetical protein, partial [Streptomyces sp. NRRL S-455]. 663210936_?->663210951_?->663210955_?->663210959_?->663210964_?->663210968_?->663210971_?-><-663210974_ParB-HTH*<-739995163_ParA||663210981_?->663210985_?->663210989_?->663210991_?->663210994_?->663210996_?-> 760128192 <-MU-transposase<-ParB-HTH*<-?<-?||?-><-?||?->HISKIN-> ParB-HTH - DALK_RS18450 128 bacteria>proteobacteria>deltaproteobacteria Desulfatibacillum alkenivorans hypothetical protein, partial [Desulfatibacillum alkenivorans]. <-506428598_?<-760126304_?<-760128188_?<-506428601_?<-506428602_?<-506428603_?<-760128190_MU-transposase<-760128192_ParB-HTH*<-760126305_?<-506428607_?||506428608_?-><-506428609_?||506428610_?->760128194_HISKIN-><-506428612_? 308205593 ParB-HTH+Prok-TUDOR*-> ParB-HTH+Prok-TUDOR - Nfla_3501 127 bacteria>cyanobacteria Nostoc flagelliforme str. Sunitezuoqi hypothetical protein Nfla_3501 [Nostoc flagelliforme str. Sunitezuoqi]. 308205593_ParB-HTH+Prok-TUDOR*-> 501377550 ParB-HTH*-> ParB-HTH - NPUN_RS12905 107 bacteria>cyanobacteria Nostoc punctiforme hypothetical protein [Nostoc punctiforme]. <-501377544_?<-753809726_?<-501377546_?||753810489_?-><-501377548_?||501377549_?->753809727_?->501377550_ParB-HTH*->753809728_?->753810490_?->501377553_?->501377554_?->753809729_?->501377555_?->753810491_?-> 752563686 HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->ParB-HTH*-> ParB-HTH - CYAST_RS12110 107 bacteria>cyanobacteria Cyanobacterium stanieri hypothetical protein, partial [Cyanobacterium stanieri]. 505036590_?->505036591_?->505036592_?-><-505036593_?||505036594_?->505036595_?->505036023_HTH_OrfB_IS605+OrfB_IS605+OrfB_Zn_ribbon->752563686_ParB-HTH*->752563385_?-><-505036596_?||505036597_?->505036598_?->505036599_?->752563687_?-><-505036601_? 571784120 <-ParB-HTH*<-zf-CHC2 ParB-HTH - OMM_12844 96 bacteria>proteobacteria>deltaproteobacteria Candidatus Magnetoglobus multicellularis str. Araruama hypothetical protein OMM_12844, partial [Candidatus Magnetoglobus multicellularis str. Araruama]. <-571784120_ParB-HTH*<-571784121_zf-CHC2<-571784122_? 648257288 <-ParB-HTH* ParB-HTH - RH99_RS08025 90 bacteria>firmicutes Clostridium arbusti hypothetical protein [Clostridium arbusti]. <-497922162_?<-497922164_?<-497922165_?<-497922167_?<-497922169_?<-497922171_?<-497922174_?<-648257288_ParB-HTH*<-497922180_?<-648257289_?<-648257290_?<-497922182_?<-497922185_?<-497922187_?<-497922189_? 543531433 <-ParB-HTH*<-XerD<-PriCT_2<-DDE_Tnp_ISAZ013<-DDE_Tnp_ISAZ013 ParB-HTH - CWATWH0402_4419 68 bacteria>cyanobacteria Crocosphaera watsonii WH 0402 hypothetical protein CWATWH0402_4419, partial [Crocosphaera watsonii WH 0402]. <-543531433_ParB-HTH*<-543531434_XerD<-543531435_PriCT_2<-543531436_DDE_Tnp_ISAZ013<-543531437_DDE_Tnp_ISAZ013 52696830 <-ParB-HTH*<-ParB-HTH ParB-HTH - BGP305 50 bacteria>spirochaetes Borrelia garinii PBi hypothetical protein BGP305 [Borrelia garinii PBi]. <-52696828_?<-52696829_?<-52696830_ParB-HTH*<-52696831_ParB-HTH<-52696832_?<-52696833_?Back to Contents
ALIGN ---EEEE---------------------EEEEEEE--------------EEEEEE---E--------EEEEEE----HH-HHHHHH----- HMM -HHHHHHHH---------------EE-EEEEEEEE-------E-----EEEEEEE-------------EEHHHH-HHHH------------ FREQ --EEEEE---------------------EEEEEEE--------------EEEEEE---E-E-----EEEEEEE----HH-HHHHHHH---- PSSM -HHHHHHH---------------------EEEEEE--------------EEEEEE------------EEEEEE----EE-EEE-------- FINAL -HHHHHHH--------------------EEEEEEE-------------EEEEEEE---E--------EEEEEE-----E-EEHHHH----- CAL7103_RS0100030_Calothrix_sp_PCC_7103_737187200 KDIVDKIRERTKLP--------NPYRVGEVCLILPK-DNPD-LRGKSGCWCVVTH---V-G--D--FSCTIDTW-DNEY-TVKIEHLKSLE CAL7103_RS0150440_Calothrix_sp_PCC_7103_518327692 KDIVNRIRERTKLP--------NPHRVGEVCMILPK-DNPD-LRGKSGFWCVIVG---V-G--D--FSCTVETW-DGEY-TVKIEHLKSLE SD81_RS35605_Tolypothrix_campylonemoides_751574024 KDIVQRIRERTKVP--------NPYQVGEVCRILPK-DNPE-LKGKSGCWCIVTY---V-A--D--YSCTVTTW-DCEY-VVKLEHLKSLD CYLST_RS31010_Cylindrospermum_stagnale_505141377 KSIVDKIRERTKLP--------NPYRLGEVCQILPK-DNPE-LKGKSGCWGIVTH---L-G--D--YSCTITTW-DGEY-TVKIENLKSLE CWATWH0402_1321_Crocosphaera_watsonii_WH_0402_543538779 KSIVDQIRERTPVP--------NPWRKGEVAMIMVK-DNPD-LRGKGGCWCVISE---V-H--N--FTCTVRLW-DGEY-QVKPENLKELP CWATWH0402_RS27635_Crocosphaera_watsonii_737861903 KSIVDQIRERNPVP--------NPWRVNEVAMIMVK-DNPE-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGDY-QVKPENLKELP CWATWH0402_1907_Crocosphaera_watsonii_WH_0402_543531309 KNIVDQIRERSPVP--------NPWRKGEVCMILVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVSLW-DGDY-QVKPENLKELP _Crocosphaera_watsonii_494523440 KSIVDQIRERNPVP--------NPWRVNEVAMIMVK-DNPE-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGDY-QVKPENLKELP CWATWH0003_RS12720_Crocosphaera_watsonii_737859551 KSIVSQIRERNPIP--------NPWRKGEVAMIIVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGNY-QVKPENLKDLP _Crocosphaera_watsonii_546230520 KSIVSQIRERNPIP--------NPWRKGEVAMIIVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGNY-QVKPENLKDLP CWATWH0003_2673b1_Crocosphaera_watsonii_WH_0003_357263649 KSIVSQIRERNPIP--------NPWRKGEVAMIIVK-DNPD-LRGKGGCWCVITE---V-H--N--FSCTVRLW-DGNY-QVKPENLKDLP NPUN_RS34540_Nostoc_punctiforme_501381481 KDVVQRIMERTQVP--------NTYQIGEVCQILAK-DNPE-LRGKAGCWGIVNH---V-G--E--FSCTVKTW-DGEY-TVGLQHLKSFN FIS9431_RS33115_Fischerella_sp_PCC_9431_737134277 KDIVQRIMERTKVT--------NPYRVGEICQIIAK-DNPE-LRGKGGCWCIVNK---V-N--D--FSCTVIAW-DKEY-TMRIEHLKSLD STA7437_RS23975_Stanieria_cyanosphaera_753865019 RDLVQRIKEKTKVP--------NPYRVGEVCQIVAK-DNLE-LRGKGGCWCIVSA---V-H--D--FSCTVNTW-DCEY-LVKLEHLKSLD FIS9431_RS31145_Fischerella_sp_PCC_9431_737132827 KDIVQRIMERTKVA--------NPYQLGEVCQIIAK-DNPE-LRGKGGCWCIVSQ---V-N--D--FSCTVTTW-DGEY-SIALQHLKSFD FDUTEX481_04300_Tolypothrix_sp_PCC_7601_407266570 QDIVDKIRERTKVP--------NPYRLGEVCTLLPK-DNPE-LKGRSGCWGVITH---V-G--D--YSCTLETW-DAEY-TVKIEHLKSLE FDUTEX481_RS32065_Tolypothrix_sp_PCC_7601_797212629 QDIVDKIRERTKVP--------NPYRLGEVCTLLPK-DNPE-LKGRSGCWGVITH---V-G--D--YSCTLETW-DAEY-TVKIEHLKSLE Sta7437_4876_Stanieria_cyanosphaera_PCC_7437_428272365 RDLVQRIKEKTKVP--------NPYRVGEVCQIVAK-DNLE-LRGKGGCWCIVSA---V-H--D--FSCTVNTW-DCEY-LVKLEHLKSLD DA73_0214905_Tolypothrix_bouteillei_VB521301_744450902 NDIVEQIMKRTKVP--------NPYQVGEVCQILPK-DNPE-LRGKSKCWCIVTE---V-N--D--FSCVVRAW-DGEY-VVKMDHLKSLD NPUN_RS35730_Nostoc_punctiforme_753811080 KDVVQQIMERTKVP--------NTYQIGEVCQILAK-DNPE-LRGKGGCWGIVNH---V-G--E--FSCTIKIW-DGEY-TVGLQYLKSYN AVA_RS27595_Anabaena_variabilis_499635872 KDVVQRIMESSKAP--------NPYRVGEVCQFVVK-DNPD-LRGMGSCWCIVTH---V-G--E--FSCTVTAW-NGEY-TVRTDHLKPMD Npun_BR102_Nostoc_punctiforme_PCC_73102_186469442 KDVVQQIMERTKVP--------NTYQIGEVCQILAK-DNPE-LRGKGGCWGIVNH---V-G--E--FSCTIKIW-DGEY-TVGLQYLKSYN _Microcystis_aeruginosa_763118064 KDIVQRIRERTKIP--------IPYRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRTW-NGEY-TVREENLRDLQ DA73_0203765_Tolypothrix_bouteillei_VB521301_744452929 KDVVQRIMERTKVP--------NPYRVGEVCQILPK-DYPD-LRGKGKCWCIVSQ---V-N--D--LSCTVTAW-DGDY-IVKMDCLKSLD alr7299_Nostoc_sp_PCC_7120_17135837 KDVVQRIMERTQVP--------NSYQIGEVCQILAK-DNPE-LRGKGGCWCIVVA---V-H--D--FSCTVRIW-DGEL-TAGLKHLKSFD MICAK_2860002_Microcystis_aeruginosa_PCC_9701_389882556 KDIVQRIRERTKIP--------IPYRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRTW-NGEY-TVREENLRDLQ _Nostoc_sp_PCC_7120_764953510 KDVVQRIMERTQVP--------NSYQIGEVCQILAK-DNPE-LRGKGGCWCIVVA---V-H--D--FSCTVRIW-DGEL-TAGLKHLKSFD NPUN_RS34090_Nostoc_punctiforme_501381405 KDVVDRIRERTKVP--------NPYHIGEICILLPK-DDPD-LRGKAGYWGVVSH---V-G--D--YSCTVQTW-DGDY-TVKIEHLKLLE CAL7103_RS0120705_Calothrix_sp_PCC_7103_737188140 KDIVTRIRERTKLP--------NPYREGEICVLLPK-NNPD-LRGKSGCWGAITH---V-G--D--YSCTIETW-DGEY-TVKIQHLLSLN N44_RS02080_Microcystis_aeruginosa_779871805 KDIVQRIRERTKIP--------IPHRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRAW-NGEY-TVREENLRDLQ PLEUR7319_RS0114150_Pleurocapsa_sp_PCC_7319_518335686 KDVVQRIKDKKRPP--------ITLRVGEVCFLIAK-DNPE-LRGKSGCWCIVSE---V-Y--K--FSCSVATW-DNEY-ILRPEHLQSLE N44_02315_Microcystis_aeruginosa_NIES-44_718251661 KDIVQRIRERTKIP--------IPHRVGDVCEILIK-DNPE-LRGLGGCWCIVIE---V-R--E--FSCLVRAW-NGEY-TVREENLRDLQ CY51472DRAFT_RS0225515_Cyanothece_497232044 QEVVQNMQQQKKIP--------NPWHVGEVAQIVVK-GNSD-LKGKGGCWAVIVA---V-N--D--FSCSVQLW-DGEY-QVKPENLKELP ANACY_RS28045_Anabaena_cylindrica_505030514 KSIVDKLREKTNLP--------NPYYLGQVCQILPK-DIPE-LKGKNGCWGIITH---V-G--S--YSCRITTW-NGEY-LVKIENLKSLD ANA7108_RS0100620_Anabaena_sp_PCC_7108_515515560 KSIVDKLREKTNLP--------NPYYLGQVCQILPK-DIPE-IKGKNGCWGIITH---V-G--N--YSCTITTW-EGEY-LVKIENLKSLD FDUTEX481_RS10740_Tolypothrix_sp_PCC_7601_797208446 KDVVQRIMERTQVL--------NSYQLGEVCQILAK-DNPE-LRGKGGCWAIVAQ---V-N--N--FSCTVRNW-DGEL-TVGLKHLKSYE STA7437_RS22850_Stanieria_cyanosphaera_505024902 KDVVRRMKEKNSAP--------ISFRVGEVCQILAK-DNPE-LRGKSGCWCIVSE---V-Y--E--FSCLVDTW-NERY-LLRGENLSSLD AVA_RS26020_Anabaena_variabilis_499635567 QDVIDRIRDRTSVP--------NPYQVGEICVLHPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTVKCW-DGDY-TAKVEHLKSLE PCC7120DELTA_RS29565_Nostoc_sp_PCC_7120_499309017 QDVIDRIRDRTSVP--------NPYQVGEICVLHPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTVKCW-DGDY-TAKVEHLKSLE CWATWH0401_RS11050_Crocosphaera_watsonii_494518295 RSIVDEIRERRPVP--------NPWRVGEVAQIIIK-GNPD-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGII-QVKLENLKDVY DA73_0201705_Tolypothrix_bouteillei_VB521301_744453553 KDIVEQIMERTKVP--------NPYQVGEVCQILPK-DNPE-LRGKSKCWCIVTE---V-N--D--FSCVVRAW-DGEY-VV--------- cce_2332_Cyanothece_sp_ATCC_51142_171698701 RSVVDEIRERRPVP--------NPWRVGEVAQIVMK-RNPE-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGII-QVKIENLKDVY CY51472DRAFT_RS0216845_Cyanothece_497230707 RSVVDEIRERRPVP--------NPWRVGEVAQIVMK-RNPE-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGII-QVKIENLKDVY CY0110_RS20455_Cyanothece_sp_CCY0110_495552874 ISVIDEIRERRPVP--------NPWRVGEVAQIVVK-KNPD-LKGRSGQWCIIEE---V-L--N--FSCLVKTW-DGTI-QVKIENLKDVY XEN7305_RS25800_Xenococcus_sp_PCC_7305_750617827 KEVVKRMKDNNPKP--------IPFRVGEICQIMTK-DNPE-LRGKGGCWCIVKD---I-Y--K--LSCQVSTW-NDNY-ILRAENLKSLG Xen7305DRAFT_00000510_Xenococcus_sp_PCC_7305_442790849 KEVVKRMKDNNPKP--------IPFRVGEICQIMTK-DNPE-LRGKGGCWCIVKD---I-Y--K--LSCQVSTW-NDNY-ILRAENLKSLG CAL6303_RS10710_Calothrix_parietina_505010770 LDIVQQIKDKNPLP--------NPYHIGEVCRILPS-TDPD-LKPFSGCWCIVYE---I-N--P--HSCGVKTW-KANLATVKPEYLERID CAL7103_RS55850_Calothrix_sp_PCC_7103_737188944 PDIVEQIKNKKRVP--------NPHHVGEVCRIISK-GNPD-LKPVAGAWCIVTQ---V-N--P--HSCGIKTW-KMDFPAVKPENLEITY _Cyanothece_sp_PCC_7822_754536191 EALKQRVREKSLAP--------FPYNEGDVCKILVK-GNPD-LKGKGGHWGVIVA---I-N--N--FSADIQLA-DGIY-LAKEENLEELP CAL7103_RS55635_Calothrix_sp_PCC_7103_737188848 TDIVEQIKNKQRVP--------NPHYKGQVCQIIAC-GDKE-LKKFAGCWAIIIQ---V-N--E--HSCNIQTW-REDIPTVKPENLKPLD WA1_RS0149465_Scytonema_hofmanni_703031009 LDITQQIKNKQRVP--------NPRRVGEICQIVAQ-GDPE-LKKFSKCWGIIKE---V-N--L--HSCYIQTW-KMDFPTVKPENLEPVY RIV7116_RS23620_Rivularia_sp_PCC_7116_504933746 TSIVQQIKDKQRVP--------NPRIVGEICQIISK-GDPD-LKQYSRCWCIINA---V-N--L--HSCAIKTW-KMDFPTVKPENLEPTY Cyan7822_6833_Cyanothece_sp_PCC_7822_306986606 EALKQRVREKSLAP--------FPYNEGDVCKILVK-GNPD-LKGKGGHWGVIVA---I-N--N--FSADIQLA-DGIY-LAKEENLEELP SD81_RS27565_Tolypothrix_campylonemoides_751570983 KGIVEKLKQKPLFL-----AT-DFCQVGDVFILTRL-EGAE--RKYNGCWAIASV---L-N--D--FTVEVDVH-DTTL-NVKPENLNKID SYN7509_RS0223705_Synechocystis_sp_PCC_7509_740179759 KGIVERLKEKPLVK-----AS-DFCQIGGVFILTRL-EGNE--RKYNGCWAIASE---L-R--E--FTVVVDVY-DGEF-AVKPENLNSID SYN7509_RS0222055_Synechocystis_sp_PCC_7509_655839534 KSIVERLKEKPLVK-----AS-DFCTIGDPFILTRL-EGAE--RKYNGCWAIARE---H-R--D--FTIAVDVY-DGEL-AVKPENLNPID UH38_20050_Chroococcales_cyanobacterium_CENA595_768384071 KDIVQRLKEKPLAL-----AS-DYCSIGDVFTLTRL-EGIE--RKYNGCWAIAKE---L-R--D--YTIAVDVH-DTTL-SVKPDNLQPLD UH38_RS20080_Chroococcales_cyanobacterium_CENA595_769922127 KDIVQRLKEKPLAL-----AS-DYCSIGDVFTLTRL-EGIE--RKYNGCWAIAKE---L-R--D--YTIAVDVH-DTTL-SVKPDNLQPLD QH73_RS02585_Scytonema_millei_748134961 KGIVERLKEKPRLH-----AA-DFCCIGDVFVLTKL-EDSD--RKYNGYPCIAVE---L-K--Q--FSVDVDVH-DTTL-TVKPENLKKVD _Cyanothece_sp_PCC_7424_752567338 EELKQRVREKANVP--------FPYKVNDVCKIIVK-ENPQ-LRGKSGHWGIIVE---V-M--N--FSANIQLA-DGIY-QVKEENLEELS UYC_RS0100505_Chlorogloeopsis_fritschii_515385753 KGIVEQLKDKPLLL-----AS-DFCQIGDVFTLTRL-EGTE--RKYNGCWAIAVA---L-K--E--FSVEVDVH-DTTL-NVKPENLNKID TOL9009_RS0101730_[Scytonema_hofmanni]_UTEX_B_1581_657929542 KGIVERLKEKPLFL-----AT-EFCQIGDVFTITKL-EGVE--RKYNGCWAIAVA---L-N--D--FTLEVDVH-DTTL-NVKPENLNKID SYN7509_RS26630_Synechocystis_sp_PCC_7509_740179430 RGIVERLKEKPLVK-----AS-DFCTVGDPFILTRL-EGAE--RKYNGCWAIARE---L-R--D--FTIAVDVH-DTTL-AVKPDNLDPID PCC7424_5430_Cyanothece_sp_PCC_7424_218175274 EELKQRVREKANVP--------FPYKVNDVCKIIVK-ENPQ-LRGKSGHWGIIVE---V-M--N--FSANIQLA-DGIY-QVKEENLEELS UYG_RS0120335_Fischerella_muscicola_515347403 KGIVEQLKEKPLLL-----AS-NFCQIGDVFTLTRL-EGTE--RKYNGCWAIAVV---L-K--E--FSVEVDVY-DTTL-NVKPENLNKID _Cyanothece_sp_PCC_7424_501601085 REVVEQYKEKPE----H-----NPFELGEVVGVESK-DNPL-LRGRNGAWGIVTG---V-S--K--HHCNLQLW-DTEFEEVGVEYLKELN PCC9339_RS0106675_Fischerella_sp_PCC_9339_515877940 KGIVEQLKEKPLLL-----AS-DFCQIGDVFTLTRL-EGTE--RKYNGCWAIAVV---L-K--E--FSVEVEVH-DTTL-NVKPENLNKID Glo7428_4930_Gloeocapsa_sp_PCC_7428_428267400 KGIVEQLQEKPLLQ-----AR-DFCTCGDVFTLVKL-EGSM--RKYNGYWAIVCS---I-N--T--FTIAVDVH-DTTI-LVKPENLQPID GLO7428_RS24200_Gloeocapsa_sp_PCC_7428_754508876 KGIVEQLQEKPLLQ-----AR-DFCTCGDVFTLVKL-EGSM--RKYNGYWAIVCS---I-N--T--FTIAVDVH-DTTI-LVKPENLQPID PLEUR7319_RS33990_Pleurocapsa_sp_PCC_7319_738911651 -IVAQAVRLHQAAE--QN-LV-NPFTSGEICRLVVR-DNSQ-LKGKGGCWCIVDQ---V-Y--L--SSCTVNTW-SDEF-EVPIENLESLG QH73_RS11110_Scytonema_millei_748136693 KGIVKQLKEKGLRY-----AT-EFCSVGDVFVLTKL-EDSE--RKYNGCPCVAIE---L-K--Q--FTVDVDVH-DTTL-TVKPENLQKLD PCC7424_5542_Cyanothece_sp_PCC_7424_218175378 REVVEQYKEKPE----H-----NPFELGEVVGVESK-DNPL-LRGRNGAWGIVTG---V-S--K--HHCHLQLW-DTEIEEVGVEYLKELD CHRO_RS28535_Chroococcidiopsis_thermalis_752825464 KTVVERMKEKQLFP-----AR-DFCAVGDVFTLTRL-HSRE--RKYNGYPCVALV---L-K--D--FTIEVDVY-DGTL-IVKPENLKPID _Cyanothece_sp_PCC_7424_752567372 REVVEQYKEKPE----H-----NPFELGEVVGVESK-DNPL-LRGRNGAWGIVTG---V-S--K--HHCHLQLW-DTEIEEVGVEYLKELD Chro_5819_Chroococcidiopsis_thermalis_PCC_7203_428013042 KTVVERMKEKQLFP-----AR-DFCAVGDVFTLTRL-HSRE--RKYNGYPCVALV---L-K--D--FTIEVDVY-DGTL-IVKPENLKPID QH73_RS16405_Scytonema_millei_748137603 KGIVEQLKEKPLVI-----AK-DFCQVGDVFTLVRL-EGKE--KKYNGCSCVAVE---S-R--D--FTVMVEVH-DTTL-TVKPENLNKID CY0110_RS14620_Cyanothece_sp_CCY0110_737832178 KQVVREMTREDA----D-----NPFELGEVVGIVAQ-DNPD-LKGKNGCWGIVTA---L-T--K--TTCNLQTW-NDELEAIEIEFLRELE CWATDRAFT_RS29615_Crocosphaera_watsonii_757158775 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE CY0110_32445_Cyanothece_sp_CCY0110_126620031 KQVVREMTREDA----D-----NPFELGEVVGIVAQ-DNPD-LKGKNGCWGIVTA---L-T--K--TTCNLQTW-NDELEAIEIEFLRELE CWATWH0401_4234_Crocosphaera_watsonii_WH_0401_543428839 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DTELEGVEIEFLQELE CWATDRAFT_RS03435_Crocosphaera_watsonii_494514224 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DTELEGVEIEFLQELE _Crocosphaera_watsonii_494523812 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEEVEIEFLQELE CWATWH0005_RS08635_Crocosphaera_watsonii_737857352 KQVVREMTREEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---V-S--K--NTCDLQTW-DSELEGVEIEFLQELE _Crocosphaera_watsonii_546220971 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DTELEGVEIEFLQELE CwatDRAFT_0109_Crocosphaera_watsonii_WH_8501_67852287 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE _Crocosphaera_watsonii_494519775 KQVVREMTREEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---V-S--K--NTCDLQTW-DSELEGVEIEFLQELE _Crocosphaera_watsonii_546222413 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE _Crocosphaera_watsonii_546206668 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE CWATWH0005_RS11590_Crocosphaera_watsonii_737862397 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEEVEIEFLQELE CWATWH0005_RS00905_Crocosphaera_watsonii_494523801 KQVVREMTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE CYAN8802_RS22020_Cyanothece_sp_PCC_8802_752568031 KQVVRQLTRIPG----S-----NPFEVGEVVGIVAK-DHPG-VRGRNGSWAIVTA---I-A--S--DTCDLQLW-DTFLEGIESEYLKEMD Cyan8802_4571_Cyanothece_sp_PCC_8802_256592473 KQVVRQLTRIPG----S-----NPFEVGEVVGIVAK-DHPG-VRGRNGSWAIVTA---I-A--S--DTCDLQLW-DTFLEGIESEYLKEMD CY51472DRAFT_RS0223830_Cyanothece_497231939 KKVVREMTRGDE----D-----NPFELGEVVGIVAQ-DNPE-LKGKNGCWGIVTA---L-T--K--TTCDLQIW-DTELEGIEIEFLRELE CY0110_RS25950_Cyanothece_sp_CCY0110_495554039 KQVVREMTREDA----D-----NPFELGEVVGIVAQ-DNPQ-LKGKNGCWGIVTA---L-T--I--TTCDLQTW-DNELEAIEIEFLRELE _Nostoc_sp_PCC_7120_764953501 ----------------------DPYRVGEICTLQPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTIKWW-DGDY-TAKVEHLKLLE MICAB_RS03030_Microcystis_aeruginosa_763118968 RETVGKYRDKGK----A-----NVFEVGEVVGILAK-DNPR-LKGKNNCWAIVTA---V-H--P--RSCDLRLH-DGAIDLVKIEYLKELG MICAB_900014_Microcystis_aeruginosa_PCC_9717_389714985 RETVGKYRDKGK----A-----NVFEVGEVVGILAK-DNPR-LKGKNNCWAIVTA---V-H--P--RSCDLRLH-DGAIDLVKIEYLKELG QH73_RS07795_Scytonema_millei_748135946 KDIVVQLKQKELFP-----IA-HFCQVGDAFTLMRL-EGCE--RKYNGYPGVATK---L-K--D--FTIEVEVF-DGTM-AVKPENLRPID MAE_RS14770_Microcystis_aeruginosa_501223295 RETVGKYRDKGK----A-----NVFEVGEVVGILAK-DNPR-LKGKNNCWAIVTA---V-H--L--RSCDLRLH-DGAIDLVKIEYLKELG CWATWH0402_3406_Crocosphaera_watsonii_WH_0402_543537000 ------MTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEEVEIEFLQELE CWATWH0402_3406_Crocosphaera_watsonii_WH_0402_543531669 ------MTRDEK----D-----NPFELGEVVGIMSL-DNPD-LKGKNGCWAIVTG---L-S--K--NTCDLQTW-DSELEGVEIEFLQELE Ava_B0348_Anabaena_variabilis_ATCC_29413_75705382 -----------MLPCDRSWCNSDPYRVGEICTLQPK-DNPD-LRGKSGYWGVVTH---V-G--E--YSCTIKCW-DGDY-TAKVEHLKLLE QH73_RS10255_Scytonema_millei_748136445 KGIVEQLKEKPLRY-----AT-DFGQVGDAFTLMRL-EGCE--RKYNGYPSVAIE---L-R--D--FTILVEVF-DGTM-AVKPENLKPID UH38_RS16060_Chroococcales_cyanobacterium_CENA595_769921346 KSIVEQLKERPLRY-----AT-EFCAVGDVFTLTQL-EGEE--RRYNCYPCVAVD---L-N--E--FTVKVDVC-NTTI-AVKPENLKSVD QH73_RS16190_Scytonema_millei_748137500 KDIVVQLKQKELFP-----IA-HFCQVGDAFTLMRL-EGCE--RKYNGYPGVATK---L-K--D--FTIEVEVF-DGTM-AVKPENLRPID KV40_RS24315_Myxosarcina_sp_GI1_738538439 AEAVRLYLAKDSDR--E-----NPFVEREVCRIVVK-GNSK-LKGKGDCWCIVSQ---V-L--A--NSCMVDTWAEDNI-EVPIENLESMG KV40_RS30300_Myxosarcina_sp_GI1_738540904 KEAVKAYLKQKYPP--V-----NPFSPGEICRI-----TSE-VPGKKNCWCVVAE---L-R--K--DECVVDTW-DDRF-VVSVEDLVPLK UH38_RS09160_Chroococcales_cyanobacterium_CENA595_769920071 RAVVEQLQAKPLVL-----AQ-DYCQVGDVFILTRL-QGKD--SQYNGCWAIALK---P-T--N--STVIVDVH-DATL-TVKPNNLNKID KV40_RS29900_Myxosarcina_sp_GI1_738540774 KTAVKTYLNQKYPP--V-----NPFTEGQICRI-----SSG-IKGKLHCWCVISE---V-R--K--DKCVVDTW-DSQY-VVSVEDLQEMK KV40_RS28185_Myxosarcina_sp_GI1_738539875 KEAVKAYLKQKYPP--V-----NPFSPGEICRI-----TSG-VPGKKNCWCVVAE---V-R--K--DECVVNTW-DDRF-VVSVDDLLSLK CY51472DRAFT_RS0225395_Cyanothece_497232068 KKVVREMTREDA----E-----NPFELGEVVGIVAQ-DNPE-LKGKNGCWGIVTA---L-T--K--TTCNLQTW-NDELEEIEIEFLRELE PLEUR7319_RS33705_Pleurocapsa_sp_PCC_7319_738911416 KSAVKAYLKQKYPP--V-----NPFSIGEICRI-----KSG-VPGKQNCWCVVAE---V-S--Q--DECVVDTW-DDRF-VVSVDDLMPMK CYAN7822_RS28405_Cyanothece_sp_PCC_7822_503099618 GEVVKQLQNITIEQ--------NPFEVGDICLIAPG-DNPA-LRGLQGCWAIIAA---K-N--E--FYCDIKTS-LGLIRQISSSYLLHSA QH73_RS12040_Scytonema_millei_748136747 KGIVERIKEKPLHL-----AS-DYCKEGEVFFLQRL-VGIE--KHYNGCWAIAIE---VES--R--FTIKAAVY-DGTL-ELRQENMKPID QH73_RS10490_Scytonema_millei_748136457 KNAVERIRKKTLHL-----AS-DYCQDGEVFFLQRL-SGKE--KKYNGCWAIALE---VEN--R--LTVKAAVY-DGTL-ELRQEQMKSID CY0110_RS21885_Cyanothece_sp_CCY0110_495553174 REVVRSSQQKEVQT--S--SF-SKKDIGAVIEIIKV-EGNDSLRGQKGNYGIITG---V-N--N--FSVSIETA-MGKYDTIPQQCVRKMP CY51472DRAFT_RS0223475_Cyanothece_497232009 REVVRASQQKEVQA--R--SF-STEDIGAVVEITKV-EGNDNLRGQKGNYGIITG---V-N--N--FSVSIETA-IGEYDTIPQQCVRKMP PCC9339_RS0103785_Fischerella_sp_PCC_9339_648361686 EALLHVLIKHGQVQ--------NSYSIGEVCQIIAK-NNPQ-LKDKNGCWCIVKE---I-DATT--CTCTVATI-YGLC-SLSVDHLKSLN Nfla_3501_Nostoc_flagelliforme_str_Sunitezuoqi_308205593 KDVVQRMMERTQVP--------NTSQIGEVCQILVK-DNPL------TIYVVNEQGKKI-R--T--SAIPITN--DGTY-SGYKEFLLILE consensus/95% ......h.p..................Gplh.l..........+.b....slh.....l........ph.lp...p..h..h..p.L..h. consensus/90% b.hlpph.pp..............hp.G-lh.l..b..s....+.bss.aslh.....l....p...oh.lp.a.ss.h..l..c.Lp.h. consensus/85% b.lVpph.cc............s.hphG-lh.l..b.pssc..+.bsGhWslh.....l....p...oh.lp.a.ssph..lp.-.Lp.h. consensus/80% cplVpphpcc............s.hplG-Vh.l..b.css-..+.bsGhWsll.....l....p...oh.lp.a.-sph..lc.E.Lp.l. consensus/75% +plVpphpcc............sshplG-Vh.l..b.-ss-.l+sbsGhWslls....l....p...oh.lpsa.Dsph..Vc.E.Lpplp consensus/70% +plVpphpc+pb..........ssaplG-Vh.lh.b.-ss-.L+GbsGCWsIls....l.p..p...oCslpsa.Dsph..Vc.EpLpplpBack to Contents
Str-1 Str-2 Str-3 lost Str-4 Str-5 Str-6 Str-7 ** * * * * RES -------M------PASDHCETPREAYADIAPV---VSAIAEMC----------------GKTDDS-VTIYDPY-FCAGAMKQRLASC--------GF-P-NIINRNS---------DFYED---IR-T-NN-I-P-EY-DILITNPPYSTEPYNHIKR-----------------------------------LMQFL----------A--------DMGKPFFILQPVYVYTKPYYQAAR----------------------------------------------------------------------------ERLGE------------G--------------------------------------------------------------------C-F-FITPS--HR---YRF---------------------------------------ET--P-QG-MRNV---------------RQQE-LI-----------------------------------------------------------TS---PFVSL-WYCYVP-A------HM--------------------------FPKLRR---------WWC------AE-G------HR----LSQGCV--MRAQSR ALIGN -----------------------HHHHHHHHHH---HHHHHHH------------------------EEEE--------HHHHHHHH--------------EEEE-------------EEEE------------------EEEE-----------HHHH-----------------------------------HHHHH----------H--------H----EEEE----HH-----H-EH----------------------------------------------------------------------------H-----------------------------------------------------------------------------------------EEE---------EEE---------------------------------------------------------------------------------------------------------------------------------------------HE-EEEE---------------------------------------HHHHH---------HHH------HH-H-------------------------- HMM ----------------------HHHHHHHHHHH---HHHHHHH------------------------EEEEEEE-E--HHHHHHHHH--------------EEEE--E---------EEEEE---E-----------EE-EEEEE--------HHHHHH-----------------------------------HHHHH----------H--------H---EEEEEE-HH----HHHHHHH----------------------------------------------------------------------------HH------------------------------------------------------------------------------------E-E-EEE---------EEE---------------------------------------E-----------------------------EE-EE-----------------------------------------------------------E------EEE-EEEE-----------------------------------------EEE---------EEE------E-------------------EE--EE---- FREQ -----------------------HHHHHHHHHH---HHHHHHH--------------------------EE--------HHHHHHHH------------E-EEE--------------EEEE----------------E-EEEE--------HHHHHHH-----------------------------------HHHHH----------H--------H----HHHH---HHH------HHH------------------------------------------------------------------------------------------------------------------------------------------------------------------E-E-EEE---------EEE-------------------------------------------------------------------------------------------------------------------------------------------EEEE-EEEE----------------------------------------EEEE---------EEE------------------------------------ PSSM -----------------------HHHHHHHHHH---HHHHHH--------------------------EEEE------HHHHHHHHH----------------EE------------------------------------EEE--------HHHHHHH-----------------------------------HHHHH----------H--------H---EEEEEE---------HHHHH----------------------------------------------------------------------------H-------------------------------------------------------------------------------------E-E-EEE-------------------------------------------------------------------------------------------------------------------------------------------------------EEEE-EEE----------------------------------------HHHHH---------HEH------------------------------------ FINAL -----------------------HHHHHHHHHH---HHHHHHH-------------------------EEEEE-----HHHHHHHHH--------------EEEE------------EEEEE----------------E-EEEE--------HHHHHHH-----------------------------------HHHHH----------H--------H----EEEEE---------HHHHH----------------------------------------------------------------------------H-------------------------------------------------------------------------------------E-E-EEE---------EEE-------------------------------------------------------------------------------------------------------------------------------------------EEEE-EEEE----------------------------------------EEEE---------EEE------------------------------------ PTSG_07527_Salpingoeca_rosetta_514687493 -------M------PASDHCETPREAYADIAPV---VSAIAEMC----------------GKTDDS-VTIYDPY-FCAGAMKQRLASC--------GF-P-NIINRNS---------DFYED---IR-T-NN-I-P-EY-DILITNPPYSTEPYNHIKR-----------------------------------LMQFL----------A--------DMGKPFFILQPVYVYTKPYYQAAR----------------------------------------------------------------------------ERLGE------------G--------------------------------------------------------------------C-F-FITPS--HR---YRF---------------------------------------ET--P-QG-MRNV---------------RQQE-LI-----------------------------------------------------------TS---PFVSL-WYCYVP-A------HM--------------------------FPKLRR---------WWC------AE-G------HR----LSQGCV--MRAQSR PTSG_04809_Salpingoeca_rosetta_514693110 RQ--W-QC------EDADHCETPLDAVRDIEPI---LEELCQAL----------------GKERAA-LRIYDPY-YCAGGIRVHLARL--------GF-T-SVYNKPE---------DFYSV--AEK-K-AA-S-P-DF-DVVVTNPPYSSVPVNHVLK-----------------------------------LARFL--------------Y----SQPKPWLVVQPNYVYTKGPWHALV-----------------------------------------------------------------------------------R---------K-------H--------------------------------------------------------HPQ-P-F-YVIPG--TP---RAY------------V----Y-Q--------------------A--P-AT-MRGD------------VQRKKQL----K---------------------------------------------------------TS---PFVTF-WFCNLR-E------HQ--------------------------QRVFEK-----------V------QQ-G--LLS-SR----PHVGVA--VKSF-- MONBRDRAFT_26429_Monosiga_brevicollis_MX1_167525218 FP--W-EA------DANDHCETPYLAYAHIAPL---LRWLAKSL----------------GKTSQE-LVIYDPY-FCAGSMRAHLARL--------GF-E-TVINECR---------DFYAD---VA-S-GN-V-P-DY-DVLVTNPPFSTTGQDHVAL-----------------------------------LCQFL--------------Q----KTPKPFCVVQPNYVYTKAPWAALT-----------------------------------------------------------------------------------QTTG------R-------A--------------------------------------------------------APF-P-F-YLTPR--QP---RQY------------V----Y-E--------------------T--P-PA-FRP--------------LNAKQR----K---------------------------------------------------------TS---PFVTL-WYCRLP-A------HM--------------------------QGAAFR---------WMV------TP-G------CK----LPLRLS--IVST-- MONBRDRAFT_25858_Monosiga_brevicollis_MX1_167523896 HA--F-KT------EREDHCETPFEAYKDIAPL---LHQVAGML----------------GKNADE-VQLYDPY-YCAGTTKKHLARL--------GF-P-NVYNENV---------DFYEA---VR-S-EQ-T-P-AY-DILITNPPFSNTHVNHIKR-----------------------------------LMRFV----------A--------NSGKPFFILQPNYVYTKPYYQEVR----------------------------------------------------------------------------A------------------------------------------------------------------------------------------------------------------------------------------------A--P-MP-LPSV---------------HKPR---------------------------------------------------------------Q---PF------CSVS-S------HR--------------------------ISSDDRDPADCSGPGSAA------CR-G------YT----QPPICV--RHPRGL ACA1_068850_Acanthamoeba_castellanii_str_Neff_470514790 WY--F---------HFGRHHQRVVQWWQAQVQR---QRKASKNT-----K-TKT------KKKKNE-SESEDED-EEDEEEEEKKKAG--------GL-L-PDCVLAR---------DVNSL---P--K-GV-R-P-EA-GR-SGGPPLGG----------------------------------------------------------------------------------QKRRFEEAI--GGGRG-----------------------------------------GRGGRGGGR------------------------------------G--GR------------------------------------------------------------GGD------------GR-G-GSW-----------------R-G-------------------------GG-RGGG-------DRG--GRGGRGR-------GAPSGDG------------------------------------------------GK---RFKSH-HPQPPS-G----------------------------------GGATMR------G-----------AS--------QG----RHTRFD-------- ACA1_037140_Acanthamoeba_castellanii_str_Neff_470455333 HP--F-EA------DPRDHAETPFQAYRDIAPL---LRALAQRLYGSVVGDANDDADGGVAAASRQ-LRIYDPY-FCEGSMVAYLARL--------GF-T-NVYNRNE---------DFYGV---VA-A-GR-V-P-DF-DVLVTNPPFSG---DHMER-----------------------------------IVRWA----------E--------GCGRPWLLLMPDFVANKPYYHAFR---------------------------------------------------------------------------------------------HRCTSTSAA--------------------------------------------------------SGK-P-V-YIGPTR-QA---YVF-----------------A-A-----------------------P-LH-TPDG--------------------------VTP-LVG------------------------------------------------QR---PATAV-VPAGLK-R---R------AADKNEDDENEDDEGEG------------------------------------------------------------- VOLCADRAFT_105839_Volvox_carteri_f_nagariensis_302843234 HG--F-AT------EPGDHAETPLQAYEHIEPL---LARLATRL----------------GTTKAA-LRIYDPF-FCEGRMLAHMASL--------GF-E-SVHNRNE---------DFYAM---RD-T-KR-T-P-EF-DVLVTNPPFSG---DHIEA-----------------------------------TFEFA----------I--------ASGRPFFILVPQYVSRKAFYLEWL---------------------------------------------------------------------------------------------N--NGCTPG--------------------------------------------------------LPP-P-V-FVGPKH-EP---YIF-----------------L-A-----------------------P-DR-ADVR--------------------------TAR-APG------------------------------------------------QE---PATAA-ATQG--------------DSVPGDDAASRACAGAGVGSADGGLTDAVE---------WST------ET---------------------------- VOLCADRAFT_120421_Volvox_carteri_f_nagariensis_302830991 ---------------------------------------------------------------------------------------------------M-NFIHRNR---------DFYQD---VK-C-GD-V-P-PY-DILVTNPPYSA---DHKER-----------------------------------ILEFC--------------L----RSGKPWALLLPNYVATKAYFGQLL---------------------------DSTTT-------------------------------------------------------------A-------A--------------------------------------------------------SQR-P-F-FLTPQ--AR---YSY-----------------E-H-----------------------P-EG-TGHL---------------------------------------------------------------------------------ES---PFFSI-WYIGLG-E-Q--------------------------------------------------------TE---------------------------- OT_ostta10g00600_Ostreococcus_tauri_693498069 LA--FGDV------DEADHCETPFRAYRDVEPF---LFAIAKAL----------------KREKKD-LRIYDPY-YCEGSMVEHLRAL--------GF-E-SVYNRNE---------DFYEK---VA-T-KT-T-P-DF-DVLVTNPPYSG---DHFKR-----------------------------------ILTFC----------R--------DSNKPWLLLLPNFVCRKSYYEECV---------------------------------------------------------------------------------------------G-------E--------------------------------------------------------RAK-P-L-FLIPDSTKP---YRY-----------------W-A-----------------------P-GR-NEDG--------------------------VR--AKG------------------------------------------------TT---PFETF-WYIEFS-G-------------------------------TLDPEAQRA---------WWL------KK--------YK----AHSTCD--IPALDE OT_ostta14g01460_Ostreococcus_tauri_693497110 GG----MS------IEDDEWATATRTWRSLFTV---L--------GAT------------ANGFRT-KKVWAPF-VYDELAGKRMREA--------GF-E-RVVHKRW---------DFFDK---VR-D-GPFV-R-SL-DAVVDNPPYTG-K-GMKQK-----------------------------------VLKSL----------V--------DAELPFCLLLPLGVVHGAFVREML---------------------------------------------------------------------------------------------E--ER------------------------------------------------------------YVQ-----IIVPR--KC---YVF---------------------------------------------KK-NGA----------------------------------------------------------------------------------EI---PFKFLVWLCYKM-E-L--------------------------------ERDLYL-ID--------------------------------------------- Ot14g01730_Ostreococcus_tauri_308811312 GG----MS------IEDDEWATATRTWRSLFTV---L--------GAT------------ANGFRT-KKVWAPF-VYDELAGKRMREA--------GF-E-RVVHKRW---------DFFDK---VR-D-GPFV-R-SL-DAVVDNPPYTG-K-GMKQK-----------------------------------VLKSL----------V--------DAELPFCLLLPLGVVHGAFVREML---------------------------------------------------------------------------------------------E--ER------------------------------------------------------------YVQ-----IIVPR--KC---YVF---------------------------------------------KK-NGA----------------------------------------------------------------------------------EI---PFKFLVWLCYKM-E-L--------------------------------ERDLYL-ID--------------------------------------------- Ot13g02010_Ostreococcus_tauri_308810727 HP--F-ET------DADDHCETSPEAHENVANF---LKAACGQL----------------NKSPAE-LVIYDPY-YCAGGVRRNLALI--------GF-T-NVINRNE---------DFYQV---IE-E-DR-V-P-QH-DVLLTNPPYSE---DHIVK-----------------------------------CVTFA----------AENLA----AHGRPYMLLLPSYVIHKDYYMPALLTGGTRGR-------------------------------EKAKALAAAAERDAKGSDDDDDEAEEMKIHSGS---------------------A--SR---A--------------------------------------------------------QIL-P-F-YVAPK--KR---YYY-----------------W-T-----------------------P-KA-MIKARAAKGQSEES--IKARRKR----TH-IGALGER------------------------------------------------TS---PFLSF-WYCCFG-T-M--------------------------------QRDVLA---------WHK---R--LP--------RA----DVFGYT--LARHPN Ot10g00600_Ostreococcus_tauri_308808370 LA--FGDV------DEADHCETPFRAYRDVEPF---LFAIAKAL----------------KREKKD-LRIYDPY-YCEGSMVEHLRAL--------GF-E-SVYNRNE---------DFYEK---VA-T-KT-T-P-DF-DVLVTNPPYSG---DHFKR-----------------------------------ILTFC----------R--------DSNKPWLLLLPNFVCRKSYYEECV---------------------------------------------------------------------------------------------G-------E--------------------------------------------------------RAK-P-L---------------Y-----------------W-A-----------------------P-GR-NRR-----------------------------------------------------------------------------------R---RFGTF----EFS-G-------------------------------TLDPEAQRA---------WWL------KK--------YK----AH--C--------- OSTLU_29451_Ostreococcus_lucimarinus_CCE9901_145356641 ------MS------RADDEWATSERTWASLFGV---L----------E------------RNGYAK-KKLWAPF-VYDGDAGRRMRSA--------GF-E-RVVHRRA---------DFFER---VR-D-GVFV-R-SL-DAVVDNPPYTG-K-GMKER-----------------------------------ILKAL----------R--------DAEVPFCLLLPLGVLHAAFVRDIL---------------------------------------------------------------------------------------------E--ET------------------------------------------------------------HVQ-----VIVPR--KC---YVF---------------------------------------------KK-GGT----------------------------------------------------------------------------------EV---PFKFLCWLCYKM-E-L--------------------------------KRDLYF-ID--------------------------------------------- OSTLU_27164_Ostreococcus_lucimarinus_CCE9901_145353330 HP--F-ET------DADDHCETSPEAHENVANF---LKAACGRL----------------NKTPAD-LVIYDPY-YCAGGTRRNLALI--------GF-T-NVINRNE---------DFYKV---IE-E-GR-V-P-EH-DVFLTNPPYSA---DHIEK-----------------------------------CVTFA----------AENLA----AHGRPYMLLLPSYVIHKDYYLPALLTGGSRGK-------------------------------EKAKSLELQRKDEENNEDDVEENDDGIKRHGGG---------------------A--AR---A--------------------------------------------------------QIL-P-F-YIAPK--KR---YYY-----------------W-T-----------------------P-KA-MVKARAAKGQSEES--AKARRKR----TH-IGALGER------------------------------------------------TS---PFLSF-WYCCFG-S-M--------------------------------QRDVLA---------WHK---K--LP--------RS----EVWGYV--VARHPN OSTLU_26028_Ostreococcus_lucimarinus_CCE9901_145351428 LA--FADA------DEADHCETPFRAYRDVEPF---LFNLAKAM----------------KKEKKT-LRIYDPY-YCEGSMVAHLNAL--------GF-E-NVYNRNE---------DFYEK---VA-T-KS-T-P-EF-DVLVTNPPYSG---DHFKR-----------------------------------ILSYC----------R--------DSNKPWLLLLPNFVCRKSYYAPSI---------------------------------------------------------------------------------------------G-------D--------------------------------------------------------AAK-P-L-FLIPDDAKP---YRY-----------------W-A-----------------------P-GR-QGHE--------------------------IR--AKG------------------------------------------------TT---PFETF-WYIEFA-G-------------------------------VLDAQQQRA---------WWL------KK--------YA----AHSSCS--VPALDE MNEG_5609_Monoraphidium_neglectum_761971839 HP--F-ET------DPGDHAETPFEAYEHLEPL---LARLARRL----------------GTTKAA-LRVYDPF-YCEGRMVQHMARL--------GF-E-SVYNRNE---------DFYQV---RA-E-GR-C-P-EY-DVLLTNPPFSG---DHLER-----------------------------------IFRFA----------V--------NSNKPWFLLIPQYVARKAFYLEWL---------------------------------------------------------------------------------------------T--NRRPKG--------------------------------------------------------APK-P-S-FVAPTR-QP---YVF-----------------T-A-----------------------P-DR-ADVR--------------------------LMRLADG------------------------------------------------QQ---AAAAA-APAAAQ-R---QVAEGAASQEQGTAQGQVECSGLGADGEAEQHQDQLS---------QQQ------QQQQ---QQQQQ----QQQQQQ--QPALSD MICPUN_64290_Micromonas_sp_RCC299_255088714 GR----MS------VEDDEWATAPRTWAALAPY---L------------------------SDYHD-KKIWAPF-YYDGAAGKRLRDA--------GF-T-RVVHKRE---------DFFKR---VN-D-RVFV-K-SV-SAVVDNPPYTG-K-GMKER-----------------------------------VLRAL----------V--------AVDVPFCLLLPLGVLHTATVREIL---------------------------------------------------------------------------------------------D--PE------------------------------------------------------------HVQ-----ALIPR--RC---WVS---------------------------------------------KS-GQR----------------------------------------------------------------------------------EV---PFKYLVWLCYKM-R-L--------------------------------PRDLVL-MPDT------------------------------------------- MICPUN_103519_Micromonas_sp_RCC299_255086063 HP--F-EV------DAADHCETPFQAYQDIEPF---LFRMALAL----------------KKPKDK-LRIYDPY-FCEGSVAKHLARL--------GF-T-SVYNKNE---------DFYKC---IE-E-KR-I-P-EH-DVLLTNPPYSG---DHFRR-----------------------------------ILSFC----------A--------KNKKPWLLLLPNFVCRKQYYQPCV---------------------------------------------------------------------------------------------G-------E--------------------------------------------------------DVK-A-L-FLIPDPTKP---YRY-----------------W-A-----------------------P-GR-RGFE--------------------------DRNQAKG------------------------------------------------TT---PFETF-WYVNYA-G-------------------------------LAPHEEVRA---------WWM------KK--------FA----PHSTCT--LPAPDE MICPUN_59514_Micromonas_sp_RCC299_255079482 PS--A-PR------DPADDCETPDVAYAHIAPL---LRKLAQRL----------------GKPPGA-LRIWDPY-YCAGGVKARLGAL--------GFGD-VVNDPDA---------DFYDV---VD-G-SRPP-P-PH-DICVTNPPFSG---NHARR-----------------------------------LFEYLNSRGERGEGKGTTKG----GPVARFVVLAPEYVHRKA-WF------------------------------------------------------------------------------------------------E-------A--------------------------------------------------------PRG-T-F-FMVPS--RR---YSF------------V----A-A--------------------S--G-GR-RENT------------AVDCRHW----RR-ERSCPRGETCPFVHVGPGIDPEGERVAEEARRARANAGFSGGGGGGD---GGRVTVA---PFDCY-WHCHLG-E------FT--------------------------RSVAAA---------WRQ------KH-G------RR----GRVGVR--MVDGVE MICPUN_59025_Micromonas_sp_RCC299_255078302 HA--F-QV------DADDHCETSPEAHAHILNF---LNKTASAL----------------GKTPKT-LVIYDPY-YCAGGTKRSFAAL--------GF-P-NVINENK---------DFYAV---CQ-R-GE-V-P-EH-DVLVTNPPYSA---DHVER-----------------------------------CITFA----------AKNLY----EHGRPYFLLLPSYCVNKPYYTSALLTGGAAGKKARAAREEAEGGTKTKEEDGRDKKSGDEPRRDENEQQQEEEEEKTEEEEEEEEDGEGFKVHDGGKSRTKKI---------ATRDGG--SR---R--------------------------------------------------------QTL-P-F-YVAPV--KR---YYY-----------------W-T-----------------------P-KP-LIAARKAQQGVELG--GKTRRKK----SH-VGRLGER------------------------------------------------TS---PFLSF-WYCGMG-DDL--------------------------------QPEALR---------WHR---K--LP--------RA----AVGGYT--VARNPN MICPUCDRAFT_53312_Micromonas_pusilla_CCMP1545_303288509 GG----MS------VEDDEWATSARTWRALAPH---L------------------------SAYIA-KKVWAPF-YYDGTAGIRMREA--------GF-R-RVVHTKD---------DFFKR---VN-D-RAFV-K-SL-AAVIDNPPYTG-K-GVKER-----------------------------------VIAAL----------V--------RADVPFCLLLPIGVLHA---QELL---------------------------------------------------------------------------------------------D--AD------------------------------------------------------------KVQ-----TLIPR--RC---RVN---------------------------------------------KA-GGR----------------------------------------------------------------------------------EI---PFKYLVWLCYKM-E-L--------------------------------ERDLVL-MPDE------------------------------------------- MICPUCDRAFT_51291_Micromonas_pusilla_CCMP1545_303284947 HP--F-EY------DAADHCETPFQAYQDVEPF---LFRVALAL----------------KKPKEK-LVIYDPY-FCEGSVVKHLARL--------GF-A-NVINRNE---------DFYQC---ID-E-KR-I-P-EH-DVLLTNPPYSG---DHFRR-----------------------------------ILSFC----------G--------KSKKPWLLLLPNFVCRKQYYEPAI---------------------------------------------------------------------------------------------G-------A--------------------------------------------------------SSK-P-L-FLIPDQLKP---YRY-----------------W-A-----------------------P-GR-KGYE--------------------------DRTRAKG------------------------------------------------TT---PFETF-WYVDFA-G-------------------------------VASHADVRA---------WWM------KK--------FS----PHSNCT--LPDVND MICPUCDRAFT_62794_Micromonas_pusilla_CCMP1545_303284171 ----M---------NPTDEYMTPPSAWEAIQKY---I--------------------------PKN-KVIWEAF-YGDGKSGDTLRTL--------GF---KVIHDEV---------DFFE---------NN-----LG-DVIVSNPPFSR-R-AAGPR-----------------------------------ASVSI-STIAEVYP-S-----D--RRRRDYSVCHEELGITPDFRRVEI------------TR-----------------------R-VNFIHVN----------------------------------W------------D--SR---P----GSPRS--------------------------------------STTPR----RRR-S-I-RNRSR--AR---YTY------RR---------R-R-----------------------P-------------------------------RC-------------------------------------------------------------------WWETAA-P-S-------------------------------------------------------------------------------------- MICPUCDRAFT_60475_Micromonas_pusilla_CCMP1545_303283096 HA--F-DA------DGDDHCETSPEAHANVVNF---LNDVASRL----------------GKKPSE-LIIYDPY-YCAGGTERSFNAL--------GF-R-NVINRNE---------DFYAV---AK-R-NE-V-P-EH-DVLVTNPPYSA---DHVEK-----------------------------------CLTFA----------AANLA----EHGRPYFLLLPSYVIHKPYYVDALLTGGAAGRRAKEAREKRERGRAEEKEEDVEEE-------EEEEEDEEEDEEEMVDDDDDDDDDDQMVFKRASDATGERIPLAKKKSNASSSSSS--SR---R--------------------------------------------------------QTL-P-F-YVAPA--KR---YYY-----------------W-T-----------------------P-KA-LLAARRAASARDDGESASARRKK----KH-VGRLGER------------------------------------------------TS---PFPSI-WCCCLG-E-F--------------------------------QTDALR---------RHR---K--LP--------RA----FVDGYT--VSTHPN MICPUCDRAFT_57902_Micromonas_pusilla_CCMP1545_303278244 RD--R-DR------DPADDCETPAVAYAHLAPI---LRKLAQRL----------------KKPPSE-LAVYDPY-RCAGAVESRLGAL--------GF-D-AVANPAD---------DFYAA---LE-E-DR-V-P-PH-DVLVTNPPFSG---EHARR-----------------------------------LVSFL----------ASTRY----SRKRAFCFLAPEYVHRKA-WYAAM-----------------------------------------------------------------------------------T---------R-------A--------------------------------------------------------RPD-V-C-YLVPK--ER---YAF------------V----A-S--------------------S--G-GR-RENT------------AKPCRHW----AR-DGRCPRGDECPFQHGGGG--------GSGASTSSEDAPVARGGGASSSVTGTRTVVA---PFDCI-WHVHAG-E-----GRQ--------------------------RSVVAA---------WRQ------KY-GGGDGK-KE----DALGAR--LVERAE CHLREDRAFT_205675_Chlamydomonas_reinhardtii_159485216 WP--F-EV------DYNDHFETSSAAVDDIQPV---LLALCNRL----------------KKTPAQ-LAIYDPF-FCKGGIRRHYEAR--------GF-T-NFIHRKR---------DFYAD---VE-S-GQ-L-P-EY-DVMVTNPPYSA---DHKER-----------------------------------ALDFC--------------L----RSGKPWALLLPNYVATKAYYSELV---------------------------DAAGT-------------------------------------------------------------P-------P--------------------------------------------------------QQR-P-F-YLTPI--TR---YAY-----------------E-H-----------------------P-EG-TGHA---------------------------------------------------------------------------------ES---PFYSI-WYVGLG-V-H--------------------------------------------------------TE---------------------------- Bathy01g03440_Bathycoccus_prasinos_612399992 DR----FS------IEDDEWATSQRAWNALAKH---L------------------------EKFKG-KKIWAPF-YYDGKVKTRLKQA--------GF-RGKVTHEKR---------DFFKL---MN-D-AKFL-A-NV-DAIIDNPPYTG-K-GMKEK-----------------------------------ILTKL----------I--------AKDVPFCLLFPLGVLHSKFLRDLT---------------------------------------------------------------------------------------------A--AK-----AR--------------------------------------------------RK-KVQ-----AIVPR--RV---FVH---------------------------------------------KE-FGE----------------------------------------------------------------------------------EL---PFKYLVWLCYGL-E-L--------------------------------ERDLVL-MDEE------------------------------------------- Bathy07g04630_Bathycoccus_prasinos_612393160 HA--F-EC------NPDDHCETSLEAHKDIVNF---LNIVASQK----------------NKKPSE-LIIYDPY-YCAGATRLNFTEL--------GF-P-NVINENK---------DFYEM---VA-K-NK-V-P-EH-DVFVTNPPYSE---EHVEK-----------------------------------CVTFA----------AKNMT----QFGRPYFMLVPSYVVCKPYFVPALLTGGAQGA-------------------------------------------EERENTKEEDAEDDMNMH------------------------K--KR---G--------------------------------------------------------QVL-P-F-YIAPT--KR---YYY-----------------F-T-----------------------P-KP-LAKLR--NKNIDEN--GIEKKRR----GH-VGRRGER------------------------------------------------TS---PFLSL-WICGFG-D-D--------------------------------QIEALR---------MHK---K--LR--------RE----QVKHYV--VARNPK Bathy11g00290_Bathycoccus_prasinos_612389594 HP--F-DV------DASDHCETPFQAYKDIEPF---LFRIALSL----------------KKTKAS-LKIYDPY-FCEGSAKEHLKRL--------GF-E-SVHNVNE---------DFYEN---VK-K-NT-I-P-EY-DVLLTNPPYSS---DHFKR-----------------------------------ILNFC----------G--------ASEKPFFLLLPNFVCRKTYYANEI---------------------------------------------------------------------------------------------T-------S--------------------------------------------------------RKKEP-L-FLIPDELKP---YRY-----------------W-A-----------------------P-GR-KGFE--------------------------ER--AKG------------------------------------------------TT---PFITF-WYLEFG-D-------------------------------AIDKNEIRG---------WWL------KK--------YS----PHSRCE--LPAPEE GUITHDRAFT_46531_Guillardia_theta_CCMP2712_551675275 ----F-SV------EHQDHCETPGEAYDDIVPV---LLAIASNI----------------GKRADE-LMIYDPY-YCNGLVAQNLRDR--------GF-Q-HVYNKNE---------DFYEA---VK-Q-GT-T-P-PF-DVLVTNPPYSN---DHIER-----------------------------------LFSFC----------S---S----CEK-PWMVLVPNYVYTKDYYEKML-------------------------------------------------------------------------------------------------K---S--------------------------------------------------------GVR-P-F-YVIPP--NR---YEY-----------------I-S-----------------------P-AG-ARGS---------------REKK--------------------------------------------------------------TS---PFVSF-WFI--------------------------------------------------------------------------------------------- GUITHDRAFT_109156_Guillardia_theta_CCMP2712_551658644 HP--F-EH------DPADDCETCFQAYCDIAPF---LIKLAQRV----------------GKPKKD-LCIWDPY-YCAGKVKDHLRKL--------GF-H-NVHNNNE---------DFY-S---LK-P-EQ-F-P-PY-DVLLTSPPYSR---NHIEK-----------------------------------ILVFA----------S--------ECKKPWILLMPQYVHRKSYYSAII---------------------------------------------------------------------------------------------E-------G--------------------------------------------------------QH--P-F-YMIPP--KP---YVYHAHHGGRKDNTNVTCRHW-ARDGKCPKGDECAFVHGEVGDSAQP-AI-QSKG------------------------------ITP------------------------------------------------VT---PFKSI-WHMHFP-P-------------------------------EGMNNGIYT---------WAV------HK--------LR----KS------------ GUITHDRAFT_90353_Guillardia_theta_CCMP2712_551638519 HS--F-ET------TDADHAETPREAYEHILPL---LHKMAEAA----------------SKKPSE-LRIYDPF-FCTGSMKRHLASL--------GF-T-NVYNKNE---------DFYEM---VK-S-KR-I-P-EH-DMVVTNPPYSL---DHIPR-----------------------------------FLRWL----------S--------VNDKPWLLLVPNYVYTKDYFSSSL---------------------------------------------------------------------------------------------R--GRL---------------------------------------------------------------P-M-FLTPPG-R----YVY-----------------E-S-----------------------P-KH-VAN-------------------------------AQG------------------------------------------------QT---APYVS-FWYVET-R---------------------------------------------------------------------------------------- GUITHDRAFT_113893_Guillardia_theta_CCMP2712_551648195 HP--F-PT------EYGDHFETSKVAIHDIAPI---LQQFAKVS----------------GKQASS-LAIYDPY-YCDGAVIEHFRQE--------GF-H-NVHNLNV---------DCYQV---WK-S-AT-TSS-DF-DIVVTNPPFSG---DHKQK-----------------------------------CLEHC--------------V----KREQAWMVLLPAYCATKNYFQELM-----------------------------SNW-------------------------------------------------------------K-------E--------------------------------------------------------RGK-V-F-YGIPK--VR---YDF-----------------E-H-----------------------P-EG-TGHA---------------------------------------------------------------------------------VS---PFFSI-WFVYLG-K-H--------------------------------------------------------TE---------------------------- GUITHDRAFT_165084_Guillardia_theta_CCMP2712_551646515 FP--Y-EI------DDADHAETPAEAYADISHV---LEYVAGIL----------------KKDNNT-VKIYDPY-YCNGSVKKRLMRQ--------GF-P-NVYNERE---------DFYKA---IE-D-KR-I-P-SH-DILLTNPPYSG---DHPER-----------------------------------LMNFI----------S---R----TKS-PWFLLMPNWVYTKDYYKDLI----------------------------------LN---------------------------------------------------------K--AC---S--------------------------------------------------------SNP-P-F-YYIPK--KR---YTY-----------------W-T-----------------------P-PW-LHSS----------------------------QFGVS------------------------------------------------TS---PFPSF-WYIH--------------------------------------CGKHTE---------KVKGW----LE--------SN----ASDSMM--FAGGVQ THAPSDRAFT_bd1109_Thalassiosira_pseudonana_CCMP1335_224015927 HA--F-ET------NSLDHCETPLCAYENVQTV---LEMMAKHL----------------HVQPSK-LRIWDPY-YCDGTVKQHLASL--------GY-D-RVINENV---------DFYKR---VE-D-NT-I-P-EH-DVLLTNPPYSG---DHIER-----------------------------------LLKFV----------T---T----VNDKPFCLLMPNWVARKKEYKSII-----------------------------------------------------------------------------------------------------G--------------------------------------------------------KTN-L-F-YVSPI--EV---YTY-----------------A-M-----------------------P-TW-NSKP------------EHVDEET----GK--------------------------------------------------------TT---PYLSS-WYVSLR-S----------------------------------NSEATG---------RIE------NK--------LD----SIAKR--------- THAPSDRAFT_9806_Thalassiosira_pseudonana_CCMP1335_224009558 -P--F-KA------DPDDHCESSPTSYAHIAPI---LNYVAKCI----------------GKKPRK-LEIYDPY-YCAGGMVRHMNKL--------GF-N-KVYNKAE---------DFYQV---IR-D-GN-V-P-SH-DVVVTNPPYSG---DHFDR-----------------------------------LLQF--------------LS----GNHKPALLLLPEHFSKNK---------------------------------------------------------------------------------------------------S--AR---H--------------------------------------------------------AQH-N-FCFLVPT--ER---YHY-----------------W-T-----------------------P-DG-M-------RPDDEG--DKKRKKQ----HR-NLVLGSR------------------------------------------------NS---PFPSH-WFIAME-P-IMT------------------------------NKQLIS---------LVR---D--GE--------IK----LLEGCG--LYERQE THAPS_23466_Thalassiosira_pseudonana_CCMP1335_224005064 FP--Y-PT------NPDDHCETPLQSYQDILPI---LNELRKGT-----G----------ATERET-LKIYDPY-FCNGSVVKHLASL--------GY-T-NVYNKKE---------DCYKV---WK-Q-RK-E-P-PF-DAFLTNPPYSD---DHIDK-----------------------------------LMEYL-ASP------S---F----DN-KPWLLLMPSWVHKKDYYINAT---------------------------------------TGNKKDRKKGK-------------------------------------------D-------S--------------------------------------------------------RSN-P-F-YIVPK--KR---YVY-----------------V-P-----------------------P-PD-FREK---------K--VSDVHKK--------------------------------------------------------------SS---PFTSM-WYIWGG-T----------------------------------NEKNEA---------LIK---A--FQ--------KS----NVDGCD--VARSRS THAPSDRAFT_6523_Thalassiosira_pseudonana_CCMP1335_224003919 HA--F-ET------NSLDHCETPLCAYENVQPV---LEMMAKHL----------------HVQPSM-LRIWDPY-YCDGTVKQHLASL--------GY-D-RVINENF---------DFYKR---VE-D-NT-I-P-EH-DVLLTNPPYSG---DHIER-----------------------------------LLKFV----------T---T----VNDKPFCLLMPNWVARKKEYKSII-----------------------------------------------------------------------------------------------------G--------------------------------------------------------KTN-L-F-YVSPI--EV---YTY-----------------A-M-----------------------P-TW-NSKP------------EHVDEET----GK--------------------------------------------------------TT---PYLSS-WYVSLR-S----------------------------------NSEATS---------RIE------NK--------LD----SIAKR--------- THAPSDRAFT_21256_Thalassiosira_pseudonana_CCMP1335_223996249 YP--Y-PT------DYNDHFETPQRAYEDILPI---IGYVLKKK---I---------KR-YNSQSD-VTIYDPY-FCTGRAATLLNATFEQ--HTTGNKRHTNIRIQHEKR------DFYQD---VR-Q-NN-T-P-QY-DILVTNPPYSG---DHKER-----------------------------------CLEYV-------VD-Q-----LK-NNQRPFFLLMPNYVASKEYFRKIV--------------------------------------------------------------------------------L------------E--EK---I-----------------------------------------------------Q------I-V-FITPS--SKHP-YEY-----------------D-H-----------------------P-EG-TGHE---------------------------------------------------------------------------------TS---PFASV-WFCGLS-C----------------------------------GDTDGT---------WKK----------------NQ------------------ THAOC_11048_Thalassiosira_oceanica_397625596 YR--A-TV------DYNDHFETPLRAYTDVFPV---IETLIQQK----------------C-KGKR-VIIYDPF-YCTGRAASLLRQC--------LQ-S-NNEKLAEKVDIQHEKRDFYRD---LR-E-NT-V-P-KF-DILVTNPPYSG---DHKER-----------------------------------CLEFA--------------V----NSSRPFFLLMPNYIATKEYFRKTV-------------------------------L-------------------------------------------------------------E-------T--------------------------------------------------------KKV-QDV-YIIPS--PGES-YEY-----------------H-H-----------------------P-EG-TGKP---------------------------------------------------------------------------------LS---PFESV-WFVGVS-R-R--------------------------------------------------------TS---L------------------------ THAOC_20767_Thalassiosira_oceanica_397605573 FP--Y-DV------NPDDHCETPPEAYRDVDPL---LSDLCRRL----------------GKSKSE-LRIYDPY-YCDGSVRRHLADI--------GY-G-DVHNERV---------DCYRV---WE-E-GR-E-P-EF-DVLVTNPPYSH--------------------------------------------IGYS----------Q------------DAFPSLPANEQNKRRSHREA---------------------------------------HEVRHLAVLRGQAVAPPNAAVGAQEGLLRGDHDG--------------------P--SR---P--------------------------------------------------------SPP-A-V-LRRAP--EA---VRL-----------------P-P-----------------------P-RG-PAGE------------EGQRHAQ--------------------------------------------------------------EE---LAVRL-HVVLLG-G----------------------------------EGGGQR---------GMD---RV-VP--------RG----GTGEGG--CDVARS PHATRDRAFT_41248_Phaeodactylum_tricornutum_CCAP_1055/1_219130185 YP--Y-PV------NYNDHFETPLLAYKDLQPL---IDWLWSSS---I---CRKVKQGR-NAKATD-ISIYDPY-YCDGRTRSILAEL--------GY-R-NVLHEKR---------DFYKD---VM-R-NT-V-P-EY-DLLLTNPPYSD---QHKTK-----------------------------------CLEYC-------FS-Q-----LR-ESNKPFCILMPNYVASRQYFRNFL--------------------------------------------------------------------------------M------------K--EE---P-----------------------------------------------------ED-----V-V-YLIPT--LQ---YQY-----------------D-H-----------------------P-EG-TGKD---------------------------------------------------------------------------------KS---PFDSL-WFCGIG-R----------------------------------DRAKSA-VEF-----WKG----------------LG----RATFCP--KMAASL PHATRDRAFT_36383_Phaeodactylum_tricornutum_CCAP_1055/1_219120494 FP--F-VT------EADDHCESPLDAYHDIMPL---LKHL---------S----------GNETEK-FCIYDPY-YCDGGVTRNLNEL--------GF-P-NVYNRKE---------DCYAV---WS-D-VD-QCP-KF-DCLVTNPPYST---DHIER-----------------------------------LVKHV-TSS------T---F----TTGKPWFLLLPQWVHKKEFYQAAT---------------------------------------DALR-----------------------------------------------------------------------------------------------------------------------P-F-YLVPH--KR---YVY-----------------V-P-----------------------P-KD-FRES---------R--KSDVHKK--------------------------------------------------------------SS---PFVSM-WYVYGG-S----------------------------------AKQTEA---------IIR---T--YL--------QI----QNAPCD--LARSKS Naga_100023g2_Nannochloropsis_gaditana_585103216 YP--F-PT------DYADHFESPLRAYEDLEPF---LQWLRRAL----------------RREKTS-LHIYDPY-FCRGAVVNLLKSL--------GF-P-RVTNKMR---------DFYAD---VA-A-GT-V-P-SY-DVLVTNPPYSD---DHKEK-----------------------------------ILRFC--------------L----GSDKPWCLLLPNYVANKSYYLDAI-----------------------------RPL-------------------------------------------------------------P-------Q--------------------------------------------------------DRQ-P-F-YLVPH--AK---YEY-----------------Q-H-----------------------P-EG-TGHI---------------------------------------------------------------------------------SS---PFFSI-WVCNPG-P-I--------------------------------------------------------PR---------------------------- Esi_0085_0104_Ectocarpus_siliculosus_298715150 HP--F-PT------EYGDHFETPLQAYRDIEVA---LALLAKLL----------------DKKRKH-LRIWDPY-YCAGRTPRLLGQL--------GF-P-KVEHSNQ---------DFYKV---VR-E-KR-Q-P-KH-DVLITNPPYSG---DHKKR-----------------------------------CLEYC--------------R----ASGKPWFLLVPNYVATKDYYRLAV---------------------------LGPAA-------------------------------------------------------------G-------P--------------------------------------------------------GGE-P-F-YVVPE--NK---YYF-----------------D-H-----------------------P-EG-TGHA---------------------------------------------------------------------------------DS---PFTGV-WYVHCG-S-H--------------------------------------------------------TE---------------------------- Esi_0571_0002_Ectocarpus_siliculosus_298713748 YP--F-EV------EECDHCETSERAYSDISPL---LSALAAEL----------------GKPPED-LVIYDPY-YCQGSTVGRLASL--------GF-P-RVHNRKE---------DFYEV---VK-N-GN-I-P-QH-DVVVTNPPFSG---EHMPK-----------------------------------ILKFC----------A---R----QGAKPWFLLLPNYVYLKDYYEPSL---------------------------------GRR---------------------------------------------------------S--GQ---G--------------------------------------------------------ATR-P-F-YLTPP--KR---YMY-----------------Y-S-----------------------P-QG-SRLK------------VKSSERK--------------------------------------------------------------TS---PFNTF-WYIHLG-D----------------------------------CAVTSK---------ILQSYDAASRK--------LD----INARCC--VARTTQ AURANDRAFT_65195_Aureococcus_anophagefferens_676390061 YA--W-DT------DYGDHFETSEQAFRDVAPA---LRAL--------------------CGDGAG-AAILDPY-YCDGAAETRLRAL--------GF-R-NATNPAT---------DFYASR-AYR-EPGD-R-S-TF-DALVTNPPYSG---DHKER-----------------------------------CLAFA--------------L----ACGRPFALLLPAYVAEKKYFADAC-------------------------------A-------------------------------------------------------------E-------T--------------------------------------------------------GAA-P-F-FVSPA--RGRPPYEY-----------------A-H-----------------------P-HG-TGKA---------------------------------------------------------------------------------AA---PFASA-WVVDSG-R-G--------------------------------------------------------AA---A------------------------ EMIHUDRAFT_250132_Emiliania_huxleyi_CCMP1516_551539647 QPLAF-SA------AEDDHCETAPEAYAHIVSL---LRLVARKR----------------GVPPEE-LRIWDPY-YCNGAVARHLAAL--------GF-P-HVHNANE---------DFYAR---LD-S-GD-L-P-EH-DVLLTNPPYTH---PHPER-----------------------------------LLAHC----------A---A----SGT-PWLALMPNWVYTKDYYWAAL---------------------------------GRS---------------------------------------------------------H--GT---A--------------------------------------------------------DTQ-P-F-YIAPR--KR---YNY-----------------W-T-----------------------P-RG-RRSD------------LTSGGAKAKTHGHTNAALGIR------------------------------------------------TS---PFVSF-WY----------------------------------------CGGFGP---------ALR------KR--------VT----PPEGCV--LCWSTE EMIHUDRAFT_212966_Emiliania_huxleyi_CCMP1516_551557088 WS--F-VT------EYNDHFETPRRAYADILPL---LAAASPL-----------------------------PP-KRDGGSAPEAEAL-AAVTAL-GVRRERVLNRNR---------DFYAD---IA-T-GQ-L-P-QY-DVLLTNPPYSG---DHKQRRGTA--------------TSPPTSPPDCNAPFPARLLRFL-ASDGD-------------MRGAPFLLLLPAW----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------A-TGNS---------------------------------------------------------------------------------AS---PFHAV-WFCGGW-A----------------------------------TEGARR------RA-MRA------LR-P------SR----RLREVE-------- EMIHUDRAFT_119528_Emiliania_huxleyi_CCMP1516_551561218 WS--F-VT------EYNDHFETPRRAYADILPL---LAAASPL-----------------------------PP-KRDGGSAPEAEAL-AA--AL-GVRRERVLNRNR---------DFYAD---IA-T-GQ-L-P-QY-DVLLTNPPYSG---DHKQRRGVALPCSLPRRSVPLLLPSPEPEARDCNAPFPARLLRFL-ASDGD-------------MRGAPFLLLLPAWVCEKDYWNAFL---------------------------------------ERLATHRAAGGGGGGGDGGGGGGGGGDGGGGGGGAG------------------D--SS---A-----------------------------AHSSGECCKRRRRRRVGEGGLER----AAG-V-F-------------YVR-------------------------------------------P-SA-TGNS---------------------------------------------------------------------------------AS---PFHAV-WFCGGW-A----------------------------------TEGARR------RA-MRA------LR-P------SR----RLREVE-------- EMIHUDRAFT_440832_Emiliania_huxleyi_CCMP1516_551614698 GG----AG------ASDD-WQTARRSWAAIAEV---L--------G---------------PAARE-KRIWMPF-YYDGACAEHLREL--------GF-T-RVHHKRE---------DFFVQ---VR-N-PKFL-K-KV-DLILDNPPYTS-P-EMKEA-----------------------------------VLRAL----------A--------STGKPFVMLLPISVLHVGFAREVL---------------------------------------------------------------------------------------------D--TD------------------------------------------------------------KLQ-----AVVPR--RV---YVR---------------------------------------------KT-GGE----------------------------------------------------------------------------------EV---PFKYLCWLCCGV-R-L--------------------------------KRDLIL-IDDDDDA-AAA---D-------------------------------- EMIHUDRAFT_439419_Emiliania_huxleyi_CCMP1516_551626801 GG----AG------ASDD-WQTARRSWAAIAEV---L--------G---------------PAARE-KRIWMPF-YYDGACAEHLREL--------GF-T-RVHHKRE---------DFFVQ---VR-N-PKFL-K-KV-DLILDNPPYTS-P-EMKEA-----------------------------------VLRAL----------A--------STGKPFVMLLPISVLHVGFAREVL---------------------------------------------------------------------------------------------D--TD------------------------------------------------------------KLQ-----AVVPR--RV---YVR---------------------------------------------KT-GGE----------------------------------------------------------------------------------EV---PFKYLCWLCCGV-R-L--------------------------------KRDLIL-IDDDDDA-AAA---D-------------------------------- STCU_02709_Strigomonas_culicis_528246120 HP--F-RA------NFNDHFETSIEALRDVLPV---VRELQQLL-----R----------PSTPER-FTLYDPY-YCSGTVKELWAQL--------EV-P-NVVHENV---------DFYAA---VE-A-RT-V-P-PH-DMLVTNPPFSD---DHIER-----------------------------------FLRFV----------L-----LR-NAGRPWAMLAPDYVVTKPWYRELV-------------------------------------E-RHCTKASRIAKGVLTGAA------------------------------------R--PP---P-A--------TFALPPFIQAAAAPAAA------------AAAAPPPPAVVGV------E-P-F-YIVPR--AR---YDY-----------------R-H-----------------------PVEG-AARE---------------------------------------------------------------------------------HS---HFKSM-WYVWAG-R-H--------------------------------TPEVVR------GV-RVA-V----LH--------RA----A------VPDRAPQ Pmar_PMAR017622_Perkinsus_marinus_ATCC_50983_294899807 FP--Y-PT------DSLDHAESPAKAYGHVAPI---LELMAKKL----------------GKTKED-LKIYDPY-YCNGAVVDNLKAL--------GF-N-NVYNKCE---------DFYS----VE-T------P-DF-DVLLTNPPYSG---EHPEK-----------------------------------LAEFT----------A--------KVGKPWLWLVPNWFYMKDYYKKLI---------------------------------------------------------------------------------------------E--QP---G--------------------------------------------------------QSG-M-F-FVAPK--KR---YVY-----------------Q-T-----------------------P-KH-LRAS--------------SDEAK--------------------------------------------------------------TS---PFPSF-WFINGC-D-V--------------------------------CTPVEM------------------QS-A------MA----DAEGVL--AVTDVH TRSC58_02209_Trypanosoma_rangeli_SC58_554942173 HP--F-KA------EFNDHFETSMEALRDVAVV---VDQLRRLQ-----R----------PSAPEN-FVVYDPY-YCAGTVLHHWNAL--------GV-Q-RVIHENR---------DFYRD---IA-E-GK-V-PPDY-DMLVTNPPFSG---DHIER-----------------------------------LFDYL----------V--------TAKKPFAFLVPDYTATKDWYRAAV-------------------------------------R-RHFTAAPPSGKGDINAPR------------------------------------H--TR---P----KASAA-ALQPPPFLKMDPPADAETPLNDTINKINREKDKGGGEETVPI----GTE-P-F-YLVPR--SC---YDF-----------------R-H-----------------------P-KG-AGND---------------------------------------------------------------------------------HS---HFRSM-WFVWAG-R-H--------------------------------TTEVLR------GA-KVE-F----AR--------RH----REALTQ--RQTVLD DQ04_18661000_Trypanosoma_grayi_686647126 -----------------------MEALQDIMVV---VEQLRQLV-----R----------PSAPEN-FAVYDPY-YCAGGIVQQWKEL--------GV-Q-RVLHDNR---------DFYKD---VA-E-GT-V-PRDY-DMLVTNPPFSG---DHIER-----------------------------------MFDYL----------L--------ASKRPFAFLVPDYTATKEWYRSAV-------------------------------------R-RHFTPAPPTGKGDINAPR------------------------------------R--AR---P----LVPAAVLLQPPPFVKTDADAASG-----NNNNTNSGSGDCGDCDVVPI----GVE-P-F-YLVPR--VR---YDF-----------------K-H-----------------------P-KG-VGNE---------------------------------------------------------------------------------HS---HFRSM-WFVWAG-R-R--------------------------------TTEVLR------GA-KVE-F----SR--------LH----RETMTAAGRRAVPD Tc00.1047053506297.320_Trypanosoma_cruzi_strain_CL_Brener_71666516 HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLLLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVRYWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GD-D-PDDY-DMLVTNPPFSG---DHIER-----------------------------------LFNYL----------V--------ARKKPFAFLVPDYTATKDWYRTAV-------------------------------------R-RHFTPVPSSGKGDINASR------------------------------------H--TR---P----KLPAA-LLQPPPFLKMEQTASAEIALDDTKNKKQKEDNKCDNEETIPI----GTE-P-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGKD---------------------------------------------------------------------------------HS---HFRTM-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REDSTQ--RQVSPD Tc00.1047053508041.40_Trypanosoma_cruzi_strain_CL_Brener_71409237 HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLRLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVRYWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GD-D-PDDY-DMLVTNPPFSG---DHIER-----------------------------------LFNYL----------V--------TRKKPFAFLVPDYTATKDWYRTAV-------------------------------------R-RHFTPAPPSGKGDINAPR------------------------------------H--TR---P----KVPAA-LLQPPPFLKMEQTTSAGIALDDTNDKKQKEDNKCDGEETIPI----GTE-P-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGKD---------------------------------------------------------------------------------HS---HFRTM-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REDSTQ--RQVSPD TCSYLVIO_001620_Trypanosoma_cruzi_407859934 HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLRLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVRYWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GD-D-PDDY-DKLVTNPPFSG---DHIER-----------------------------------LFNYL----------V--------TRKKPFAFLVPDYTATKDWYRTAV-------------------------------------R-RHFTPVPPSGKGDINASR------------------------------------H--TR---P----NVSAA-LLQPPPFLKMEQTAGAGMAFVDTNEKKQKEDNKCDSEETIPI----GTE-A-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGKD---------------------------------------------------------------------------------HS---HFRTM-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REHSTQ--RQASPD MOQ_000466_Trypanosoma_cruzi_marinkellei_407425172 HP--F-RA------EFNDHFETSLEALRDIAVV---VDQLPLLL-----R----------PSAPEN-FVVYDPY-YCAGTVVQHWNTL--------GV-Q-RVIHANR---------DFYKD---IA-E-GN-V-PDDY-DMLVTNPPFSG---DHIER-----------------------------------LFKFL----------V--------ARKKPFAFLVPDYTATKDWYRNAV-------------------------------------R-RQFTPAPPTGKGDINAPR------------------------------------H--TR---P----KVPAA-VLQPPPFLKLEQTPSAGIARDDSNDKKQKEDNKSDSEETLPI----GTE-P-F-YLVPR--GR---YDF-----------------S-H-----------------------P-KG-VGND---------------------------------------------------------------------------------HS---HFRSI-WFVWTG-R-H--------------------------------TTEVLR------GT-KVE-F----AR--------RH----REKSTQ--RQVSPD AGDE_06588_Angomonas_deanei_528257849 HP--F-RA------NFNDHFETSLEALRDVLAA---VQEVRQQL-----R----------PSTPEK-FTLYDPY-YCSGTVVASWAQL--------DM-P-NVINENV---------DFYAT---MA-N-HT-I-P-VH-DMLVTNPPFSD---DHIPR-----------------------------------LMKFL----------A-----DG-NDGRPWAFLAPDYVATKPWYIQFV-------------------------------------N-EHYAKATRVAKGVLRGPA------------------------------------P--TA---P-R--------SFALPPYLAAGNTAAT---------------------KVLPV------E-P-F-YIIPK--QK---YDF-----------------H-H-----------------------PVEG-VGKE---------------------------------------------------------------------------------HS---HFKSM-WYVWAG-R-H--------------------------------TNDVVR------AS-RVE-L----LR--------RH----P-------TGAAPA GSEM1_T00001947001_Phytomonas_sp_isolate_EM1_588317381 HN--F-KA------NFNDHFETTIEALRDLLPV---VQELRRLT-----R----------PSAPER-FVLYDPY-YCAGAIPGLWRDL--------GL-P-HTLHENR---------DFYAD---IA-R-DT-V-PGPY-DLLVTNPPFSD---DHLPR-----------------------------------LLEFL----------ARGRDETRGNRQRPWAFLAPDYIAAKPWYRAWV-------------------------------------R-DHFEAA---GGGNRNPDS------------------------------------D--GA---PGALKKAQIT-RFEAPPFLKASQADEVV-----DGVPQEGANGVCHTVGGSPVCTKLGPE-P-F-YIVPK--GR---YDF-----------------K-H-----------------------P-LN-AGHE---------------------------------------------------------------------------------HS---HFKSM-WYVWMG-S-R--------------------------------TSEIIR------AA-KIE-L----LK--------TS----TSGSSA-------T D341_RS0120100_Proteobacteria_bacterium_JGI_0000113-E04_655449388 FP--Y-DA------VERDHCESPRVAYQQIEPL---LRSYASSI----------------GKMAKE-LKIYDPY-YCNGAVKKHLRFL--------GY-Q-DVYNECE---------DFYNK---IE-T-DT-V-P-AF-DVMITNPPYSG---DHMEK-----------------------------------LLKFC----------AGYCA----KKKKPFFLLLPNYVYTKEYYSDVF---------------------------------------------------------------------------------------------T--EQ---P--------------------------------------------------------DRS-I-L-FN------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- ADD95864.1_uncultured_organism_MedDCM-OCT-S12-C54_291336302 AA--M-DA------ALAGKRKRKRKEGVPPREV---TGTVPEGS-----E--------S-EAGLES-LYVYDPY-FCQGGMVDALVEL--------GCARERVINLNR---------DFYQD---VA-D-GS-V-P-SH-DVLLTNPPYSA---DHKQK-----------------------------------LLDYL-------LG-E--------HQHRPGKGM---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- AIA83135.1_Podovirus_Lau218_643012036 ------KN------PASDEYYTPPEGIEPLVKF---L--------------------------NKE-LRYYEP---TAGKSQRIVKYL-TS--K--GF---NITGSKP-----DE--DFLK---------GD-F-S-DY-DAIITNPPFSN-KGDFIER---------------------------CY------------------------------EIGKPFALLMPVSTIQGQKRGKRF--------------------------------------------------------------------------------I------------E-------D--------------------------------------------------------GIE-----LLVLN--KR---VSF-----------------I-K-----------------------P-DN-TIAG---------------------------------------------------------------------------------SP---SFGVA-WFCHG----I--------------------------------LPEKLQ---------FHE------AR-K-------------------------- LS64_RS01615_Helicobacter_sanguini_736557575 QN----SY------ENSDERYTKPEAIFPLLKY---I--------------------------PKD-KVIWCPCDLESSYFVRIFRLN--------GY---KVIHSHI---NLGQ--DFYHF---EP-K--------EW-DILITNPPFSN-----KKE-----------------------------------FISRV----------L--------SFKKPFCLLLPLTYLNDSTPYHLF---------------------------------------------------------------------------------------------K--DI------------------------------------------------------------DLE-----LLIFD--KR---MEF----------I------NAD-------------------------SK-----------------------------------------------------N--------------------------------RI---SFKSG-YYAWKV-F----------------------------------NKQVVF---------EKL------ES--------LHFLESNKWNEI--IENMKY AW14_RS07635_Siansivirga_zeaxanthinifaciens_765310844 PP--L-KK------GSPDDFQTPPEALTPLLPF---L--------------------------KKD-WTIWECA-SGKGNLSTYLKQQ--------GF---KVISTDI---LAGK--DFLSY---EP-K--------QY-DCIITNPPYAF-----KQE-----------------------------------FLERA----------Y--------CLGKPFAFLLPLTTFETAKRQQYF---------------------------------------------------------------------------------------------K--HC------------------------------------------------------------GLE-----VIFLD--KR---INF-----------------E-T-----------------------P-DG-SGD----------------------------------------------------------------------------------GS---WFATA-WFTNWL-K-I--------------------------------GKQMSF---------TSL------------------------------------ HMPREF1087_RS05735_[Clostridium]_clostridioforme_740438568 YL----TS--NKK-Q--DDLFTPAYAVDPIIKY---L--------------------------SKD-KIIWCPWDCEWSAFYQRLKEE--------GF---KVVRSSL---EEGE--DFFEY---EP-D--------EW-DIVVSNPPFSI-----KDK-----------------------------------VLERL----------Y--------SFNKPFAILLPLNSLQGKTRYKYF------------------------------------------------------------------------------------------------KQ------------------------------------------------------------GIQ-----ILSFD--AR---VCY-----------------H-D-------------------------KNHMDSV-----------------------------------------------VK--------------------------------GS---PFATA-YFCRDL-L----------------------------------PKDLIV---------EKL------VT--------YE----RPLMTR-------- LS74_RS07390_Helicobacter_magdeburgensis_736576773 YL----NA--RHD-ESSDECMTPFYAVEPLLKY---I--------------------------PRN-KTVWCPFDKEWSAFVK-LLST--------RN---EVIHSHI---DDGK--DFFTY---KP-K--------HF-DIIISNPPFSC-----KDK-----------------------------------VLQRC----------Y--------ELNKPFAMLLPVSCIQGKKRVEMF---------------------------------------------------------------------------------------------M--KN------------------------------------------------------------GLQ-----ILAFD--LR---VDY-----------------H-T-------------------------RGNMQET-----------------------------------------------TK--------------------------------AT---YFGSA-FFCKDI-L----------------------------------PLSLMF---------APL------KK--------YE----QSLGEK--ASAKRE HMPREF1074_RS00110_Bacteroides_xylanisolvens_495301957 ----------MFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---NTGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KIE-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--YRDVDN VK10_RS18230_Bacteroides_acidifaciens_765333434 PS----KLSRMFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---NTGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KIE-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQF-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN LGA01S_RS01170_Lactococcus_garvieae_754856863 ----L-DK--VAN-SGNDEFYTPEYAIKPLLKY---I--------------------------PKN-AKVWCPFDTSDSLFVKLLLEH--------GC---EVVNTHI---SRGE--DFF-E---LS-N-SE-I-A-DWCDYIISNPPYSR-----KTE-----------------------------------VLEEL----------F--------MTEKPFAMLLGCVGLFESQKRFEM--------------------------------------------------------------------------------F------------R--DN------------------------------------------------------------SFE-----IMMFN--RR---ISY--FQS------------Y-E-EQK-------------------P-SK--------------------------------------------------------------------------------------NP---PFSS--WYLCKG-I-L--------------------------------NKPFVF---------EEV------VK---------------------------- YS40_029_Thermus_phage_phiYS40_118197649 LV----EH--VKK-EKDDEFHTPRFAVEPLLKY---I--------------------------PKD-KVIWCPFDTEESNYVKVFMEN--------GY---KVVYSHI---SMGQ--DFFFY---EP-E--------NY-DVIVSNPPFSV-----KTE-----------------------------------ILKRA----------Y--------SLGKPFAFLLPITSLEGKKRGELF---------------------------------------------------------------------------------------------R--KY------------------------------------------------------------GLQ-----LIVFD--RR---IEF------------------------------------------------MSTT-----------------------------------------------KT--------------------------------GV---WFNTS-YFCYKL-L----------------------------------PRDLIF---------EQL------EV---------------------------- TMA_029_Thermus_phage_TMA_343960410 LV----EH--VKK-ERDDEFHTPRFAVEPLLKY---I--------------------------PKD-KVIWCPFDTEESNYVKVFMEN--------GY---KVVYSHI---SMGQ--DFFFY---EP-E--------NY-DVIVSNPPFSV-----KTE-----------------------------------ILKRA----------Y--------SLGKPFAFLLPITSLEGKKRGELF---------------------------------------------------------------------------------------------R--KY------------------------------------------------------------GLQ-----LIVFD--RR---IEF------------------------------------------------MSTT-----------------------------------------------KT--------------------------------GV---WFNTS-YFCYKL-L----------------------------------PRDLIF---------EQL------KV---------------------------- HMPREF1033_RS05355_Tannerella_sp_6_1_58FAA_CT1_496675013 YN----NW---HI-RANDERYTPRYTVLPIIKY---L--------------------------PQK-AVIWCPFDTENSEFVLTLKEN--------GF---KVTHSHI---VNGD--DFYTY---EP-E--------YW-DIIVSNPPFSN-----KRQ-----------------------------------IFERC----------L--------SFGKPFALIMSNLALNDSFPCRLF---------------------------------------------------------------------------------------------K--DK------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------N-------------------------LL-----------------------------------------------------E--------------------------------RI---PFASS-YFCHKL-L----------------------------------PKQIIF---------ENL------DV--------VK----GQMSRM--YKDMED HMPREF1069_06304_Bacteroides_ovatus_CL02T12C04_392661135 ----------MFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KMD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN HMPREF1074_RS00030_Bacteroides_xylanisolvens_495301979 ----------MFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KKD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSHM--HRDVDN _Bacteroides_ovatus_769142550 PS----KLSRMFL-NCSDEKYTPGYGVAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KMD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQY-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN NI20_RS0109935_Oscillibacter_sp_ER4_696600425 ------KA----R-KASDLYPTPPEVTVALMRF---L------------------------KLPAG-TDIWEPA-RGQGDMVRALADC--------GM---AVYGTDI---RDGI--DFLTT---RQ-P-GN-A-P-AA-DWIITNPPFSL-----ADE-----------------------------------FIRHA----------A--------EIGKPFAMLLKAQYWHAAKRAQLF---------------------------------------------------------------------------------------------R----------------------------------------------------------------EIP-P-S-YVLPLTWRP-D-FLF-------------------K-------------------------ER-DGKK------------------------------------------------G--------------------------------AS---PLMDV-MWCVWL------------------------------------TPQMQG-V-------QTV-F----KP-----L--MR----PEKEK--------- P694_RS05110_Entomoplasma_luminosum_737023823 MR----NI--LGLQKSNNEFYTPEEPIIDLLDNFLNI--------------------------PKS-KIIWCPFDTEDSEFVKQLKHR--------GY---KIISSHI---ENGK--DFYEY---EPNE--------EW-DMILSNPPFSG-----KRI-----------------------------------LIERC----------E--------SFKKPFCLLYG-----ATIFSQSM----------------------------------------------------------------------------------------GN---T--LN------------------------------------------------------------RCE-----FIFIQ--RN---IKF------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ _Bacteroides_ovatus_490425600 --------------------------MAPIIKY---L--------------------------PEG-KIIWCPFDTCHSEFVLHLQEA--------GF---QVKYSHI---STGQ--DFFAY---EP-D--------CW-DIIVSNPPFSR-----KKD-----------------------------------VFERC----------L--------QLDKPFALLMSNFWLNSVGPCRLF---------------------------------------------------------------------------------------------K--DR------------------------------------------------------------ELQ-----LLMFD--KR---IQF-------------------D-------------------------KG-----------------------------------------------------G--------------------------------GV---PFGSS-YYCHRL-L----------------------------------PKQIVF---------EEL------AV--------CR----NDYSRM--HRDVDN X558_RS03965_Mycoplasma_pirum_738499893 LG--L-QK------RENNEFYTPKETVENIVNL---V-----------------------IKKLKN-KVVWCPFDTQDSNFVKVLKEK--------NI---SVINTHI-N-IKNG--DFYK-------N-KT-I-PKKW-DLILSNPPFSK-----KRE-----------------------------------LIERC----------L--------SFNKDFCLL---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Q453_RS00595_Mycoplasma_hyorhinis_504101400 IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---WDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YIHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------IK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VN--------YK----KDGKEF--F----- SY27_07790_Flavobacterium_sp_316_759871867 KI----DY--IKR-GAFDELYTPKEAIECILPY---I--------------------------PDTVKIIWECTAIENSEIVTVLKAN--------DF---EVIKSHI---KDGL--DFFEY---EP-P--------QY-DLIITNPPYSL-----KDQ-----------------------------------FLKRA----------F--------ELDKPFMMLLPITTLEGKKRSEMF---------------------------------------------------------------------------------------------Q--QH------------------------------------------------------------KVQ-----VLIPS--KR---FNF-----------------I-K-------------------------EK-----------------------------------------------------K--------------------------------GS---WFQTS-WFTWKLNL----------------------------------KSDLIF---------MNV------------------------------------ MOS_RS00605_Mycoplasma_hyorhinis_504896920 IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---WDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------IK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VN--------YK----KDGKEF--F----- RUMFLAFD1_RS0120915_Ruminococcus_flavefaciens_497673944 YL----TA--NRT-SAGDEVYTPFYAIEPLLEF---L--------------------------PKD-KKIWCPFDEEWSAFYQFLSEK--------GY---EVERSSL---KEGQ--DFFRY---EP-E--------QW-DILVSNPPFSK-----KND-----------------------------------VLKRA----------F--------SFQKPFALLLPVNSIQGKARYKIF------------------------------------------------------------------------------------------------NN------------------------------------------------------------EIQ-----MLSFD--GR---VDY-----------------H-T-------------------------RQNMECT-----------------------------------------------TK--------------------------------GN---HFGSA-YFCRDL-L----------------------------------PSKLEL---------RQL------VK--------YD----RPLVTP--TIGGDE F801_RS0102175_Mycoplasma_hyorhinis_518948704 IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---WDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------IK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VD--------YK----KDGKEF--F----- M081_5001_Bacteroides_fragilis_str_3998_T(B)_4_596015177 YS----KY--LHS-NKSDEKYTPQYAVLPIIKY---L--------------------------PRK-AVIWCPFDTENSEFVLALKEA--------GY---RVVYSHI---FTGQ--DFFEY---EP-K--------RW-DIIVSNPPFSN-----KAR-----------------------------------IFERC----------L--------AFRKPFALLMSNFWLNDSAPCRLF---------------------------------------------------------------------------------------------K--ER------------------------------------------------------------ELQ-----LLLFD--KR---VEY-------------------N-------------------------DL-----------------------------------------------------S--------------------------------RV---PFGSS-YFCHKV-L----------------------------------PKQIVF---------ENL------TK--------IK----GEKSRM--WADVEK MHR_0113_Mycoplasma_hyorhinis_HUB-1_304309105 IT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---LDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH-LNYD-----------------------------------------------TK--------------------------------NLQRPSFASM-WIANDL-F----------------------------------DKDILV---------WNG------VD--------YK----KDGKEF--F----- H740_RS04285_Campylobacter_showae_489041364 HR------------SKSDFYQTPYAITRRLLEV--------EKF-----S-----------------GRILEPA-CGAGAITAILKEA--------GY-E-DVTAYDL-L-LDGK--DFLA-------E-TR-----KF-DVIITNPPFSL-----AKE-------------------------------F---ILKAC-------------------EIAPRFAFLLPLNYLHGKERLDEI--------------------------------------------------------------------------------Y---------------SR---E--------------------------------------------------------ILE-K-V-YVFAR-------YPL------------L-S--A-Q-------------IR--------P-DG----------------------------KY--------------------------------------------------------ET-G-MMVYA-WYIFDT-K-H--------K-----------------------GAPTIH---------WID------NS-E-D-VV-RK-GK--------------- _Mycoplasma_hyorhinis_752716488 MT----KA--FNY-KNDDEWYTTREDVQFFIDN-ANI--------------------------PKD-KVIWCPFDVEDSNFVTVFKKN--------GY---KVLHSHI---LDQQ--DFYEY---EPKE--------KW-DIIISNPPFKG-----KHR-----------------------------------LLARL----------L--------EFNKPWALIFGIQALNSEKFCHEL-----------------------------------------------------------------------------------------Q---K--FK------------------------------------------------------------RVQ-----YVHLK--RR---MCF-----------------T-K-------------------------DH------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- M125_5712_Bacteroides_fragilis_str_3998T(B)3_596000932 YS----KY--LHS-NKSDEKYTPQYAVLPIIKY---L--------------------------PRK-AVIWCPFDTENREFVLALKEA--------GY---RVVYSHI---FTGQ--DFFEY---EP-K--------RW-DIIVSNPPFSN-----KAR-----------------------------------IFERC----------L--------AFRKPFALLMSNFWLNDSAPCRLF---------------------------------------------------------------------------------------------K--ER------------------------------------------------------------ELQ-----LLLFD--KR---VEY-------------------N-------------------------DL-----------------------------------------------------S--------------------------------RV---PFGSS-YFCHKV-L----------------------------------PKQIVF---------ENL------TK--------IK----GEKSRM--WADVEK M081_RS20695_Bacteroides_fragilis_695479665 YS----KY--LHS-NKSDEKYTPQYAVLPIIKY---L--------------------------PRK-AVIWCPFDTENSEFVLALKEA--------GY---RVVYSHI---FTGQ--DFFEY---EP-K--------RW-DIIVSNPPFSN-----KAR-----------------------------------IFERC----------L--------AFRKPFALLMSNFWLNDSAPCRLF---------------------------------------------------------------------------------------------K--ER------------------------------------------------------------ELQ-----LLLFD--KR---VEY-------------------N-------------------------DL-----------------------------------------------------S--------------------------------RV---PFGSS-YFCHKV-L----------------------------------PKQIVF---------ENL------TK--------IK----GEKSRM--WADVEK consensus/100% .....................................................................................................................Dh.....................s...ssPPh.................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................. consensus/95% .................s...p.......h......l................................hh.s...........h...........sh...ph..............Dhh....................DhhlsNPPap............................................hh..........................sahhl.................................................................................................................................................................................................................................................................................................................................................................................................................................................... consensus/90% .................Dc..os..sh..l......l................................la.Ph....u.....h...........Gh...pl...p..........DFa........p...........DhlloNPPao........c...................................hh..h.......................Pahhlhs....p..............................................................................................................................................................................h.............b...........................................................................................................................................h....h............................................................................................... consensus/85% ................sDc.bTs.bsh..l..h...l................................lasPa..p.u.h...h...........Gh...pV..ppb.........DFa........p........pa.DhlloNPPao........c...................................hhp.h......................bPahhLhs...hp.......h......................................................................................................................................................................hh..........h.a...........................................................................................................................................a....ahh............................................................................................. consensus/80% ..............p.sDchbTs.bsh.sl..h...l...........................p..b.lasPa..p.u.h...h...........Gh...pV.pppb.........DFa........p........pa.DhlloNPPaS......bpc...................................hhpbh......................+PahhLhs...hp.......h......................................................................................................................................................................hl..........h.a...........................................................................................................................................F.s..ahh............................................................................................. consensus/75% ..............p.sDchbTs.buh.slh.h...l...........................pp.b.lasPa..p.u.h...h.p.........Ga...pV.pppb.........DFa........p........pa.DhlloNPPaSs.....bpc...................................hhpbh......................+PahhLhP..hhpp.bb..hh......................................................................................................................................................................hl..p.......h.a......................................................................................................................................s....F.s..ahh............................................................................................. consensus/70% .s............p.sDchbTs.buh.slhsh...l...........................pp.b.IasPa.hhsu.h.p.h.ph........Ga...pVhpppb.........DFa........p........pa.DllloNPPaSs.....bp+...................................lhpbh...................p..+PahlLhP.bhhpp.bb.phh................................................................................................................................................................p.....hl..p..p....h.a.............................................p........................................................................................s...sF.oh.aah.............................................................................................Back to Contents
# 1; Eukaryotic homologs GI Domain-arch Pfam arch Gene name Len Taxonomy Species Genbank # 1; 514687493 N6-MTase - PTSG_07527 290 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_07527 [Salpingoeca rosetta]. 514693110 N6-MTase - PTSG_04809 713 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_04809 [Salpingoeca rosetta]. 167525218 N6-MTase - MONBRDRAFT_26429 428 eukaryota>choanoflagellida Monosiga brevicollis MX1 hypothetical protein [Monosiga brevicollis MX1]. 167523896 N6-MTase FG-GAP_2 MONBRDRAFT_25858 1215 eukaryota>choanoflagellida Monosiga brevicollis MX1 hypothetical protein [Monosiga brevicollis MX1]. 470514790 N6-MTase - ACA1_068850 456 eukaryota>amoebozoa>acanthamoebidae Acanthamoeba castellanii str. Neff hypothetical protein ACA1_068850 [Acanthamoeba castellanii str. Neff]. 470455333 N6-MTase - ACA1_037140 442 eukaryota>amoebozoa>acanthamoebidae Acanthamoeba castellanii str. Neff hypothetical protein ACA1_037140 [Acanthamoeba castellanii str. Neff]. 302843234 N6-MTase - VOLCADRAFT_105839 730 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_105839 [Volvox carteri f. nagariensis]. 302830991 N6-MTase - VOLCADRAFT_120421 198 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_120421 [Volvox carteri f. nagariensis]. 693498069 N6-MTase - OT_ostta10g00600 336 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri DNA methylase, N-6 adenine-specific, conserved site [Ostreococcus tauri]. 693497110 N6-MTase - OT_ostta14g01460 234 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed product [Ostreococcus tauri]. 308811312 N6-MTase - Ot14g01730 236 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 308810727 N6-MTase - Ot13g02010 525 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 308808370 N6-MTase - Ot10g00600 259 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 145356641 N6-MTase - OSTLU_29451 166 eukaryota>viridiplantae>chlorophyta Ostreococcus lucimarinus CCE9901 predicted protein [Ostreococcus lucimarinus CCE9901]. 145353330 N6-MTase - OSTLU_27164 533 eukaryota>viridiplantae>chlorophyta Ostreococcus lucimarinus CCE9901 predicted protein [Ostreococcus lucimarinus CCE9901]. 145351428 N6-MTase - OSTLU_26028 333 eukaryota>viridiplantae>chlorophyta Ostreococcus lucimarinus CCE9901 predicted protein [Ostreococcus lucimarinus CCE9901]. 761971839 N6-MTase - MNEG_5609 613 eukaryota>viridiplantae>chlorophyta Monoraphidium neglectum hypothetical protein MNEG_5609 [Monoraphidium neglectum]. 255088714 N6-MTase - MICPUN_64290 293 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 255086063 N6-MTase - MICPUN_103519 422 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 255079482 N6-MTase+CCCH zf-CCCH MICPUN_59514 296 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 255078302 N6-MTase - MICPUN_59025 641 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 303288509 N6-MTase - MICPUCDRAFT_53312 283 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 303284947 N6-MTase - MICPUCDRAFT_51291 413 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 303284171 N6-MTase - MICPUCDRAFT_62794 178 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 303283096 N6-MTase - MICPUCDRAFT_60475 709 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 303278244 N6-MTase+CCCH zf-CCCH MICPUCDRAFT_57902 CCCH 366 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 159485216 N6-MTase - CHLREDRAFT_205675 410 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein [Chlamydomonas reinhardtii]. 612399992 N6-MTase - Bathy01g03440 243 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 612393160 N6-MTase - Bathy07g04630 585 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 612389594 N6-MTase MTS Bathy11g00290 307 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 551675275 N6-MTase - GUITHDRAFT_46531 166 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_46531, partial [Guillardia theta CCMP2712]. 551658644 N6-MTase+CCCH zf-CCCH GUITHDRAFT_109156 CCCH 292 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_109156 [Guillardia theta CCMP2712]. 551638519 N6-MTase - GUITHDRAFT_90353 235 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_90353 [Guillardia theta CCMP2712]. 551648195 N6-MTase - GUITHDRAFT_113893 411 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_113893 [Guillardia theta CCMP2712]. 551646515 N6-MTase - GUITHDRAFT_165084 426 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_165084 [Guillardia theta CCMP2712]. 224015927 N6-MTase - THAPSDRAFT_bd1109 244 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 224009558 N6-MTase - THAPSDRAFT_9806 337 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 224005064 N6-MTase - THAPS_23466 489 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 224003919 N6-MTase - THAPSDRAFT_6523 230 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 223996249 N6-MTase Methyltransf_26 THAPSDRAFT_21256 257 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 397625596 N6-MTase Methyltransf_26 THAOC_11048 320 eukaryota>stramenopiles Thalassiosira oceanica hypothetical protein THAOC_11048, partial [Thalassiosira oceanica]. 397605573 N6-MTase - THAOC_20767 347 eukaryota>stramenopiles Thalassiosira oceanica hypothetical protein THAOC_20767, partial [Thalassiosira oceanica]. 219130185 N6-MTase - PHATRDRAFT_41248 320 eukaryota>stramenopiles Phaeodactylum tricornutum CCAP 1055/1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]. 219120494 N6-MTase - PHATRDRAFT_36383 217 eukaryota>stramenopiles Phaeodactylum tricornutum CCAP 1055/1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]. 585103216 N6-MTase Methyltransf_26 Naga_100023g2 371 eukaryota>stramenopiles Nannochloropsis gaditana DNA methylase, N-6 adenine-specific, conserved site [Nannochloropsis gaditana]. 298715150 N6-MTase - Esi_0085_0104 334 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. 298713748 N6-MTase - Esi_0571_0002 544 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. 676390061 DUF501+N6-MTase+SAP DUF501 AURANDRAFT_65195 3593 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_65195 [Aureococcus anophagefferens]. 551539647 N6-MTase - EMIHUDRAFT_250132 371 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_250132 [Emiliania huxleyi CCMP1516]. 551557088 KH+N6-MTase KH_3 EMIHUDRAFT_212966 KH 570 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_212966 [Emiliania huxleyi CCMP1516]. 551561218 KH+N6-MTase KH_3 EMIHUDRAFT_119528 KH 450 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_119528 [Emiliania huxleyi CCMP1516]. 551614698 N6-MTase - EMIHUDRAFT_440832 205 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_440832 [Emiliania huxleyi CCMP1516]. 551626801 N6-MTase - EMIHUDRAFT_439419 205 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_439419 [Emiliania huxleyi CCMP1516]. 528246120 N6-MTase - STCU_02709 364 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis hypothetical protein STCU_02709 [Strigomonas culicis]. 294899807 N6-MTase - Pmar_PMAR017622 296 eukaryota>alveolata Perkinsus marinus ATCC 50983 hypothetical protein Pmar_PMAR017622 [Perkinsus marinus ATCC 50983]. 554942173 N6-MTase - TRSC58_02209 341 eukaryota>euglenozoa>kinetoplastida Trypanosoma rangeli SC58 hypothetical protein TRSC58_02209 [Trypanosoma rangeli SC58]. 686647126 N6-MTase - DQ04_18661000 284 eukaryota>euglenozoa>kinetoplastida Trypanosoma grayi hypothetical protein DQ04_18661000 [Trypanosoma grayi]. 71666516 N6-MTase - Tc00.1047053506297.320 340 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71409237 N6-MTase - Tc00.1047053508041.40 340 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 407859934 N6-MTase - TCSYLVIO_001620 340 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi hypothetical protein TCSYLVIO_001620 [Trypanosoma cruzi]. 407425172 N6-MTase - MOQ_000466 340 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi marinkellei hypothetical protein MOQ_000466 [Trypanosoma cruzi marinkellei]. 528257849 N6-MTase - AGDE_06588 301 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_06588 [Angomonas deanei]. 588317381 N6-MTase - GSEM1_T00001947001 369 eukaryota>euglenozoa>kinetoplastida Phytomonas sp. isolate EM1 unnamed protein product [Phytomonas sp. isolate EM1]. Smin1000013068 N6-MTase - Smin1000013068 252 eukaryota>alveolata>dinophyceae Symbiodinium minutum Mf 1.05b.01 6 GI Operon Dom-arch Pfam arch Gene name Len Taxonomy Species Genbank # 68; Type IV secretion system associated 765333434 <-N6-MTase*<-DUF3872-Ig<-TraO<-TraN N6-MTase - - 208 bacteria>bacteroidetes Bacteroides acidifaciens tRNA (adenine-N6)-methyltransferase [Bacteroides acidifaciens]. 765333433_?-><-765333434_N6-MTase*<-765333437_DUF3872-Ig<-765333435_TraO<-765333436_TraN 769142550 TraO->DUF3872-Ig->N6-MTase*-><-?||?->?->?-><-N6-MTase N6-MTase - - 207 bacteria>bacteroidetes Bacteroides ovatus tRNA (adenine-N6)-methyltransferase [Bacteroides ovatus]. 490452835_TraO->490452836_DUF3872-Ig->769142550_N6-MTase*-><-490452838_?||769142554_?->490452840_?->490452841_?-><-490452845_N6-MTase<-490452848_?<-490452850_? 736576773 N6-MTase*-> N6-MTase MTS - 202 bacteria>proteobacteria>epsilonproteobacteria Helicobacter magdeburgensis hypothetical protein [Helicobacter magdeburgensis]. 736576728_?->736576730_?->736576773_N6-MTase*->736576732_?->736576735_?->736576737_?->736576740_?->736576743_?->736576746_?->736576749_?-> 499516379 TraK->?->N6-MTase->TraM->?->N6-MTase*->TraN->?->?->?->VirD4_TraG-> N6-MTase - - 201 bacteria>bacteroidetes Bacteroides fragilis tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis]. 53714095_?->53714096_?->53714097_TraK->53714098_?->53714099_N6-MTase->53714100_TraM->53714101_?->499516379_N6-MTase*->53714103_TraN->53714104_?->53714105_?->53714106_?->53714107_VirD4_TraG-><-53714108_?<-53714109_? 260623613 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-cpN6-MTase<-TraM<-?<-TraK N6-MTase - BACFIN_05770 199 bacteria>bacteroidetes Bacteroides finegoldii DSM 17565 hypothetical protein BACFIN_05770 [Bacteroides finegoldii DSM 17565]. <-260623606_?<-260623607_?<-260623608_VirD4_TraG<-260623609_?<-260623610_?<-260623611_?<-260623612_TraN<-260623613_N6-MTase*<-260623614_cpN6-MTase<-260623615_TraM<-260623616_?<-260623617_TraK<-260623618_?<-260623619_?||260623620_?-> 490439159 TraK->?->N6-MTase->TraM->?->N6-MTase*->TraN->?->?->?->VirD4_TraG-> N6-MTase SP - 199 bacteria>bacteroidetes Bacteroidales MULTISPECIES: tRNA (adenine-N6)-methyltransferase [Bacteroidales]. 490439165_?->490439164_?->490439163_TraK->490439162_?->499516378_N6-MTase->490439160_TraM->490443600_?->490439159_N6-MTase*->490439158_TraN->490439157_?->496044155_?->490439155_?->655320168_VirD4_TraG-><-490439153_?<-490439152_? 490451191 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK N6-MTase - - 199 bacteria>bacteroidetes Bacteroidales MULTISPECIES: tRNA (adenine-N6)-methyltransferase [Bacteroidales]. 490439152_?->490439153_?-><-490439154_VirD4_TraG<-490439155_?<-490439156_?<-695491981_?<-490439158_TraN<-490451191_N6-MTase*<-490451190_?<-490439160_TraM<-490439161_N6-MTase<-490439162_?<-490439163_TraK<-490439164_?<-695491989_? 494415009 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK N6-MTase - - 199 bacteria>bacteroidetes Bacteroidales MULTISPECIES: tRNA (adenine-N6)-methyltransferase [Bacteroidales]. 490439152_?->494414996_?-><-494414999_VirD4_TraG<-494415001_?<-494415003_?<-494415005_?<-494415007_TraN<-494415009_N6-MTase*<-490451190_?<-490439160_TraM<-496051721_N6-MTase<-496051722_?<-490439163_TraK<-494415019_?<-490439165_? 496051720 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK N6-MTase - - 199 bacteria>bacteroidetes Bacteroides sp. 2_2_4 tRNA (adenine-N6)-methyltransferase [Bacteroides sp. 2_2_4]. 490439152_?->494414996_?-><-496051719_VirD4_TraG<-494415001_?<-494415003_?<-494415005_?<-494415007_TraN<-496051720_N6-MTase*<-490451190_?<-490439160_TraM<-496051721_N6-MTase<-496051722_?<-490439163_TraK<-494415019_?<-490439165_? 496308428 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK N6-MTase - - 199 bacteria>bacteroidetes Parabacteroides sp. D13 tRNA (adenine-N6)-methyltransferase [Parabacteroides sp. D13]. 490439152_?->496308427_?-><-490439154_VirD4_TraG<-490439155_?<-496044155_?<-494415005_?<-494415007_TraN<-496308428_N6-MTase*<-496308429_?<-490439160_TraM<-496308430_N6-MTase<-496044157_?<-490439163_TraK<-490439164_?<-496308431_? 495916348 TraK->?->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->?-><-?<-?<-?<-ParB N6-MTase - - 195 bacteria>bacteroidetes Bacteroides sp. 1_1_30 tRNA (adenine-N6)-methyltransferase [Bacteroides sp. 1_1_30]. 495916341_TraK->695334858_?->495916343_?->495916344_TraM->495916345_TraN->495916346_TraO->696264670_DUF3872-Ig->495916348_N6-MTase*->696264709_?-><-490455318_?<-495916364_?<-495916370_?<-495916373_ParB<-495916380_?<-495916381_? 496422695 TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->?->?-><-?||?-><-?<-?<-ParB N6-MTase MTS - 195 bacteria>bacteroidetes Bacteroides oleiciplenus hypothetical protein [Bacteroides oleiciplenus]. 496422688_?->496422689_TraK->496422690_?->763277032_TraM->496422692_TraN->763277085_TraO->496422694_DUF3872-Ig->496422695_N6-MTase*->763277086_?->496422697_?-><-496422699_?||496422700_?-><-496422701_?<-496422702_?<-496422703_ParB 695334862 TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*->?-><-?<-?<-?<-?<-ParB N6-MTase - - 195 bacteria>bacteroidetes Bacteroides fragilis tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis]. 695334857_?->695334929_TraK->695334858_?->695334859_TraM->495916345_TraN->695334860_TraO->695334861_DUF3872-Ig->695334862_N6-MTase*->695334930_?-><-695334863_?<-490455318_?<-695334864_?<-695334865_?<-695341265_ParB<-695334867_? 696261854 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-cpN6-MTase<-TraM<-?<-TraK N6-MTase SP - 191 bacteria>bacteroidetes Bacteroides finegoldii tRNA (adenine-N6)-methyltransferase [Bacteroides finegoldii]. <-495024507_?<-495024509_?<-495024511_VirD4_TraG<-495024513_?<-495024516_?<-495024519_?<-495024523_TraN<-696261854_N6-MTase*<-495024528_cpN6-MTase<-495024530_TraM<-495024532_?<-495024534_TraK<-495024536_?<-495024547_?||696261855_?-> 492357380 TraK->?->TraM->TraN->TraO->cpN6-MTase->DUF3872-Ig->N6-MTase*-> N6-MTase - - 190 bacteria>bacteroidetes Bacteroides fragilis tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis]. 492357371_TraK->492357373_?->492357374_TraM->492357375_TraN->492357376_TraO->492357377_cpN6-MTase->492357378_DUF3872-Ig->492357380_N6-MTase*->492357382_?->695340121_?-><-492357385_?<-492357387_?<-492357388_?<-695340204_?<-492357390_? 495301957 TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*-> N6-MTase - - 190 bacteria>bacteroidetes Bacteroides xylanisolvens tRNA (adenine-N6)-methyltransferase [Bacteroides xylanisolvens]. 495299922_?->490425606_TraK->495299921_?->696232227_TraM->696232252_TraN->495301955_TraO->495301956_DUF3872-Ig->495301957_N6-MTase*-><-495301958_?||696232363_?->495301960_?->696232364_?-><-495301962_? 495301979 TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*-> N6-MTase - - 190 bacteria>bacteroidetes Bacteroides xylanisolvens tRNA (adenine-N6)-methyltransferase [Bacteroides xylanisolvens]. 495299922_?->490425606_TraK->495299921_?->696232227_TraM->696232252_TraN->495301975_TraO->495301977_DUF3872-Ig->495301979_N6-MTase*-><-495301980_?<-495301962_? 695345663 TraK->?->TraM->TraN->TraO->cpN6-MTase->DUF3872-Ig->N6-MTase*-> N6-MTase - - 190 bacteria>bacteroidetes Bacteroides fragilis tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis]. 695345658_TraK->695345659_?->695345660_TraM->492357375_TraN->695345661_TraO->492357377_cpN6-MTase->695345662_DUF3872-Ig->695345663_N6-MTase*->695345664_?->695340121_?-><-492357385_?<-695345665_?<-695345666_?<-695345685_?<-695345667_? 265525156 N6-MTase*-> N6-MTase - PCMG_00077 189 viruses>dsdna viruses, no rna stage>caudovirales Prochlorococcus phage P-SSM2 conserved hypothetical protein [Prochlorococcus phage P-SSM2]. 265525149_?->265525150_?->265525151_?->265525152_?->265525153_?->265525154_?->265525155_?->265525156_N6-MTase*->265525157_?->265525158_?->265525159_?->265525160_?->265525161_?->265525162_?->265525163_?-> 291544920 N6-MTase*->VirB6_TrbL->?->VirB4_TraE-> N6-MTase EcoRI_methylase RUM_19970 189 bacteria>firmicutes Ruminococcus champanellensis 18P13 = JCM 17042 hypothetical protein RUM_19970 [Ruminococcus champanellensis 18P13 = JCM 17042]. 291544913_?->291544914_?->291544915_?->291544916_?->291544917_?->291544918_?->291544919_?->291544920_N6-MTase*->291544921_VirB6_TrbL->291544922_?->291544923_VirB4_TraE-><-291544924_?<-291544925_?||291544926_?->291544927_?-> 392661135 TraO->DUF3872-Ig->N6-MTase*-><-?||?->?->?-><-?<-?<-N6-MTase N6-MTase - HMPREF1069_06304 189 bacteria>bacteroidetes Bacteroides ovatus CL02T12C04 hypothetical protein HMPREF1069_06304 [Bacteroides ovatus CL02T12C04]. 392661133_TraO->392661134_DUF3872-Ig->392661135_N6-MTase*-><-392661136_?||392661137_?->392661138_?->392661139_?-><-392661140_?<-392661141_?<-392661142_N6-MTase 585220856 <-VirB4_TraE<-?<-VirB6_TrbL<-N6-MTase*<-?<-VirD4_TraG<-HNH<-VirD4-TraG N6-MTase EcoRI_methylase RF007C_04375 189 bacteria>firmicutes Ruminococcus flavefaciens 007c tRNA (adenine-N6)-methyltransferase [Ruminococcus flavefaciens 007c]. <-585220860_VirB4_TraE<-585220861_?<-585220862_VirB6_TrbL<-585220856_N6-MTase*<-585220863_?<-585220864_VirD4_TraG<-585220865_HNH<-585220866_VirD4-TraG<-585220867_?<-585220868_?<-585220869_? 815703720 <-VirB4_TraE<-?<-VirB6_TrbL<-N6-MTase*<-?<-VirD4-TraG N6-MTase EcoRI_methylase - 189 bacteria>firmicutes Ruminococcus sp. UNK.MGS-30 tRNA (adenine-N6)-methyltransferase [Ruminococcus sp. UNK.MGS-30]. <-815703705_?<-815703707_?<-815703709_?<-815703712_?<-815703713_VirB4_TraE<-815703715_?<-815703718_VirB6_TrbL<-815703720_N6-MTase*<-815704307_?<-815704309_VirD4-TraG<-547318573_?<-547318574_?<-815703723_?<-815703725_?||815703727_?-> 492311699 <-N6-MTase*<-DUF3872-Ig<-TraO<-TraN<-TraM<-?<-TraK N6-MTase MTS - 188 bacteria>bacteroidetes Bacteroides fragilis tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis]. 492311683_?->492311684_?->695340692_?->492311687_?-><-492311689_?||492311691_?->492311694_?-><-492311699_N6-MTase*<-492311701_DUF3872-Ig<-492311703_TraO<-492311706_TraN<-492311710_TraM<-695340771_?<-492311717_TraK<-492311720_? 492375163 TraK->?->TraM->TraN->TraO->DUF3872-Ig->N6-MTase*-> N6-MTase MTS - 188 bacteria>bacteroidetes Bacteroides uniformis tRNA (adenine-N6)-methyltransferase [Bacteroides uniformis]. 492375149_?->492375151_TraK->736509373_?->492375155_TraM->492375157_TraN->492375159_TraO->492375161_DUF3872-Ig->492375163_N6-MTase*-><-492375167_?<-492375168_?||492311689_?-><-492375179_?<-492375181_?<-492375183_?<-492375185_? 763415988 TraM->TraN->TraO->DUF3872-Ig->N6-MTase*-> N6-MTase - - 188 bacteria>bacteroidetes Candidatus Bacteroides timonensis tRNA (adenine-N6)-methyltransferase [Candidatus Bacteroides timonensis]. 763415981_TraM->763415983_TraN->763415984_TraO->763415985_DUF3872-Ig->763415988_N6-MTase*-><-763415990_? 752678067 VirD4-TraG->?->N6-MTase*->VirB6_TrbL->?->VirB4_TraE-> N6-MTase EcoRI_methylase - 185 bacteria>firmicutes Ruminococcus champanellensis tRNA (adenine-N6)-methyltransferase [Ruminococcus champanellensis]. 505371826_?->505371827_?->505371828_?->752677690_?->505371830_?->752678065_VirD4-TraG->752678066_?->752678067_N6-MTase*->505371834_VirB6_TrbL->505371835_?->505371836_VirB4_TraE-><-505371837_?<-505371838_?||505371839_?->752677691_?-> 546873189 TraK-><-?||TraM->TraN->DUF3872-Ig->N6-MTase*-> N6-MTase MTS - 184 bacteria>actinobacteria Eggerthella sp. CAG:1427 hypothetical protein [Eggerthella sp. CAG:1427]. <-546873154_?||546873159_?->546873163_TraK-><-546873168_?||546873173_TraM->546873178_TraN->546873184_DUF3872-Ig->546873189_N6-MTase*-><-546873191_? 596000932 TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH-> N6-MTase CoA_binding_2 M125_5712 184 bacteria>bacteroidetes Bacteroides fragilis str. 3998T(B)3 hypothetical protein M125_5712 [Bacteroides fragilis str. 3998T(B)3]. 596000934_TraK->596000933_?->596000943_TraM->596000946_TraN->596000945_TraO->596000941_DNA-primase->596000940_DUF3872-Ig->596000932_N6-MTase*->596000937_?->596000949_AbiH->596000950_?-><-596000948_?<-596000931_?<-596000942_?||596000944_?-> 596015177 TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH-> N6-MTase Methyltransf_26 M081_5001 184 bacteria>bacteroidetes Bacteroides fragilis str. 3998 T(B) 4 hypothetical protein M081_5001 [Bacteroides fragilis str. 3998 T(B) 4]. 596015170_TraK->596015171_?->596015172_TraM->596015173_TraN->596015174_TraO->596015175_DNA-primase->596015176_DUF3872-Ig->596015177_N6-MTase*->596015178_?->596015179_AbiH->596015180_?->596015181_?-><-596015182_?<-596015183_?<-596015184_? 695479665 TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH-> N6-MTase Methyltransf_26 - 183 bacteria>bacteroidetes Bacteroides fragilis tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis]. 695479568_TraK->695479571_?->695479574_TraM->695479576_TraN->695479579_TraO->695542931_DNA-primase->695542948_DUF3872-Ig->695479665_N6-MTase*->695479586_?->695479589_AbiH-><-695479596_?<-695479599_?||695479602_?->695542935_?-> 757750547 TraK->?->TraM->TraN->TraO->DNA-primase->DUF3872-Ig->N6-MTase*->?->AbiH-> N6-MTase CoA_binding_2 - 183 bacteria>bacteroidetes Bacteroides fragilis tRNA (adenine-N6)-methyltransferase [Bacteroides fragilis]. 695479568_TraK->695479571_?->757750544_TraM->695479576_TraN->695479579_TraO->757750538_DNA-primase->757750546_DUF3872-Ig->757750547_N6-MTase*->695479586_?->757750540_AbiH->695479593_?-><-695479596_?<-695479599_?||695479602_?->757750542_?-> 739436373 <-VirB4_TraE<-?<-VirB6_TrbL<-N6-MTase*<-?<-VirD4_TraG<-HNH<-VirD4-TraG N6-MTase EcoRI_methylase - 182 bacteria>firmicutes Ruminococcus flavefaciens tRNA (adenine-N6)-methyltransferase [Ruminococcus flavefaciens]. <-739436256_VirB4_TraE<-739436259_?<-739436261_VirB6_TrbL<-739436373_N6-MTase*<-739436374_?<-739436375_VirD4_TraG<-739436376_HNH<-739436377_VirD4-TraG<-739436265_?<-739436267_?<-739436269_? 156112085 N6-MTase->?->?-><-?<-N6-MTase*<-DUF3872-Ig<-TraO<-TraN<-TraM<-?<-TraK N6-MTase - BACOVA_00455 175 bacteria>bacteroidetes Bacteroides ovatus ATCC 8483 hypothetical protein BACOVA_00455 [Bacteroides ovatus ATCC 8483]. 156112078_?->156112079_?->156112080_?->156112081_N6-MTase->156112082_?->156112083_?-><-156112084_?<-156112085_N6-MTase*<-156112086_DUF3872-Ig<-156112087_TraO<-156112088_TraN<-156112089_TraM<-156112090_?<-156112091_TraK<-156112092_? 695393939 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK N6-MTase - - 159 bacteria>bacteroidetes Bacteroides sp. 2_1_16 tRNA (adenine-N6)-methyltransferase, partial [Bacteroides sp. 2_1_16]. 496044153_?->490439153_?-><-490439154_VirD4_TraG<-496044154_?<-496044155_?<-490439157_?<-490439158_TraN<-695393939_N6-MTase*<-490443600_?<-490439160_TraM<-490439161_N6-MTase<-496044157_?<-490439163_TraK<-490439164_?<-490439165_? 263256225 <-VirD4_TraG<-?<-?<-?<-TraN<-N6-MTase*<-?<-TraM<-N6-MTase<-?<-TraK N6-MTase - HMPREF0101_01185 150 bacteria>bacteroidetes Bacteroides sp. 2_1_16 hypothetical protein HMPREF0101_01185 [Bacteroides sp. 2_1_16]. 263256218_?->263256219_?-><-263256220_VirD4_TraG<-263256221_?<-263256222_?<-263256223_?<-263256224_TraN<-263256225_N6-MTase*<-263256226_?<-263256227_TraM<-263256228_N6-MTase<-263256229_?<-263256230_TraK<-263256231_?<-263256232_? 355531994 <-N6-MTase*<-DUF3872-Ig<-TraO<-TraN<-TraM<-?<-TraK N6-MTase - HMPREF9441_00695 119 bacteria>bacteroidetes Paraprevotella clara YIT 11840 hypothetical protein HMPREF9441_00695 [Paraprevotella clara YIT 11840]. 355531987_?->355531988_?->355531989_?->355531990_?->355531991_?->355531992_?->355531993_?-><-355531994_N6-MTase*<-355531995_DUF3872-Ig<-355531996_TraO<-355531997_TraN<-355531998_TraM<-355531999_?<-355532000_TraK<-355532001_? 511045000 N6-MTase*->?-><-Relaxase_TraI N6-MTase EcoRI_methylase - 179 bacteria>firmicutes Lachnospiraceae bacterium COE1 hypothetical protein [Lachnospiraceae bacterium COE1]. 737642640_?->511044992_?->665909753_?->511044994_?->511044995_?->511044996_?->511044997_?->511045000_N6-MTase*->511045001_?-><-511045002_Relaxase_TraI<-490160044_?||550995956_?->511045003_?->511045004_?->511045005_?-> # 68;DCM associated 497673944 <-VirB4_TraE<-PrgI<-VirB6_TrbL<-N6-MTase*||DUF3786->DCM+N6-MTase->HNH->?-><-Resolvase<-Relaxase_TraI N6-MTase EcoRI_methylase - 189 bacteria>firmicutes Ruminococcus flavefaciens hypothetical protein [Ruminococcus flavefaciens]. <-497673934_?<-497673935_?<-497673936_?<-497673938_?<-497673939_VirB4_TraE<-497673941_PrgI<-497673943_VirB6_TrbL<-497673944_N6-MTase*||497673945_DUF3786->497673946_DCM+N6-MTase->497673947_HNH->497673949_?-><-497673951_Resolvase<-657802676_Relaxase_TraI<-497673954_? 738971076 N6-MTase*->CMP-hydrolase->DCM->DCM->HNH->cpN6-MTase-> N6-MTase - - 180 bacteria>bacteroidetes Prevotella amnii sugar-phospahte nucleotidyltransferase [Prevotella amnii]. 738971062_?->738971265_?->738971267_?->738971064_?->738971068_?->738971071_?->738971073_?->738971076_N6-MTase*->738971079_CMP-hydrolase->738971268_DCM->738971081_DCM->738971272_HNH->738971083_cpN6-MTase->738971085_?->738971274_?-> 739000163 <-cpN6-MTase<-N6-MTase<-DCM<-?<-?<-CMP-hydrolase<-N6-MTase* N6-MTase - - 180 bacteria>bacteroidetes Prevotella disiens sugar-phospahte nucleotidyltransferase [Prevotella disiens]. <-739000148_?<-739000151_cpN6-MTase<-739000155_N6-MTase<-739000157_DCM<-739000158_?<-739000160_?<-739000162_CMP-hydrolase<-739000163_N6-MTase*<-739000242_?<-739000165_?<-739000167_?<-739000169_?<-739000172_?<-739000174_?<-739000243_? 696600425 DCM->?->N6-MTase*-> N6-MTase EcoRI_methylase - 186 bacteria>firmicutes Oscillibacter sp. ER4 hypothetical protein [Oscillibacter sp. ER4]. <-696600348_?||696600350_?->696600352_?->696600354_?->696600356_?->696600358_DCM->696600360_?->696600425_N6-MTase*->696600362_?->696600364_?->696600367_?->696600370_?->696600427_?->696600430_?->696600372_?-> # 68; 61805950 N6-MTase*-> N6-MTase - PSSM2_078 197 viruses>dsdna viruses, no rna stage>caudovirales Prochlorococcus phage P-SSM2 hypothetical protein PSSM2_078 [Prochlorococcus phage P-SSM2]. 61805944_?->61805945_?->312281380_?->312281387_?->61805947_?->61805948_?->61805949_?->61805950_N6-MTase*->61805951_?->61805952_?->61805953_?->61805954_?->61805955_?->61805956_?->61805957_?-> 736557575 SSB->?->?->?->?->?->N6-MTase*->N6-MTase-> N6-MTase EcoRI_methylase - 189 bacteria>proteobacteria>epsilonproteobacteria Helicobacter sanguini hypothetical protein, partial [Helicobacter sanguini]. <-736557556_?||736557574_SSB->736557560_?->736557563_?->736557565_?->736557566_?->736557567_?->736557575_N6-MTase*->736557568_N6-MTase->736557571_?-> 480695228 <-N6-MTase* N6-MTase - HMPREF1097_02603 187 bacteria>firmicutes [Clostridium] bolteae 90B8 hypothetical protein HMPREF1097_02603 [[Clostridium] bolteae 90B8]. <-480695224_?<-480695225_?<-480695226_?<-480695227_?<-480695228_N6-MTase*<-480695229_?<-480695230_?<-480695231_?<-480695232_?<-480695233_?||480695234_?-><-480695235_? 496675013 <-N6-MTase* N6-MTase MTS - 186 bacteria>bacteroidetes Tannerella sp. 6_1_58FAA_CT1 tRNA (adenine-N6)-methyltransferase [Tannerella sp. 6_1_58FAA_CT1]. <-496675006_?<-496675007_?||748669647_?-><-496675008_?||748669840_?->496675010_?->748669842_?-><-496675013_N6-MTase*<-496675014_?<-748669844_?<-496675016_?<-748669648_?<-748669649_?<-496675020_?||496675021_?-> 736576648 N6-MTase*-> N6-MTase - - 186 bacteria>proteobacteria>epsilonproteobacteria Helicobacter magdeburgensis hypothetical protein, partial [Helicobacter magdeburgensis]. 736576613_?->736576615_?->736576616_?->736576618_?->736576619_?->736576622_?->736576624_?->736576648_N6-MTase*->736576625_?->736576628_?->736576631_?->736576633_?->736576635_?->736576638_?->736576641_?-> 655453737 N6-MTase*-> N6-MTase - - 185 bacteria>proteobacteria Proteobacteria bacterium JGI 0000113-L05 tRNA (adenine-N6)-methyltransferase [Proteobacteria bacterium JGI 0000113-L05]. 655453730_?->655453731_?->655453732_?->655453733_?->655453734_?->655453735_?->655453736_?->655453737_N6-MTase*->655453738_?->655453739_?->655453740_?->655453741_?->655453742_?-><-655453743_?<-655453744_? 740463513 <-N6-MTase* N6-MTase - - 182 bacteria>firmicutes [Clostridium] bolteae tRNA (adenine-N6)-methyltransferase [[Clostridium] bolteae]. <-488633370_?<-488635648_?<-740463510_?<-488635650_?<-740463513_N6-MTase*<-488635653_?<-488628681_?<-740463515_?||488635656_?-><-488635657_?<-740463417_?<-488635660_? 753855623 N6-MTase*-> N6-MTase - - 182 bacteria>spirochaetes Treponema primitia hypothetical protein [Treponema primitia]. <-505823760_?<-505823761_?||505823762_?->505823763_?->505823764_?->505823765_?->753855622_?->753855623_N6-MTase*->505823768_?->505823769_?->505823770_?->505823771_?->505823772_?->505823773_?->505823774_?-> 740438568 <-N6-MTase* N6-MTase EcoRI_methylase - 181 bacteria>firmicutes [Clostridium] clostridioforme tRNA (adenine-N6)-methyltransferase [[Clostridium] clostridioforme]. <-488659667_?<-488659666_?<-488659665_?<-488659664_?<-488659663_?<-488659662_?<-488659661_?<-740438568_N6-MTase*<-488659660_?<-488659659_?<-740438566_?<-488659657_?<-488659656_?<-488659655_?<-488659654_? 763125949 <-REase+N6-MTase<-?<-?<-?<-?<-N6-MTase*<-?<-cpN6-MTase N6-MTase EcoRI_methylase - 179 bacteria>firmicutes Lactobacillus salivarius sugar-phospahte nucleotidyltransferase [Lactobacillus salivarius]. <-763125942_?<-763125943_?<-763125944_REase+N6-MTase<-763125945_?<-763125946_?<-763125947_?<-763125948_?<-763125949_N6-MTase*<-763125950_?<-763125951_cpN6-MTase<-763125952_?<-763125953_?<-763125954_?<-763125955_?<-763125956_? 333739213 N6-MTase*-> N6-MTase - TREPR_0896 178 bacteria>spirochaetes Treponema primitia ZAS-2 sugar-phospahte nucleotidyltransferase [Treponema primitia ZAS-2]. <-333738183_?<-333739473_?||333740528_?->333738456_?->333738592_?->333739981_?->333738228_?->333739213_N6-MTase*->333738037_?->333740303_?->333739983_?->333738732_?->333741471_?->333740250_?->333739655_?-> 489159998 <-N6-MTase* N6-MTase CoA_binding_2 - 176 bacteria>firmicutes Streptococcus intermedius hypothetical protein [Streptococcus intermedius]. <-489159987_?<-739738008_?<-739738010_?<-489159991_?<-489159993_?<-489159995_?<-489159997_?<-489159998_N6-MTase*<-489160000_?<-489160003_?<-489160004_?<-489160006_?<-489160010_?||489160012_?->489160014_?-> 754856863 <-Terminase_LS<-?<-HNH<-?<-?<-N6-MTase* N6-MTase - - 176 bacteria>firmicutes Lactococcus garvieae sugar-phospahte nucleotidyltransferase [Lactococcus garvieae]. <-754856851_?<-754856852_?<-754856854_Terminase_LS<-754856857_?<-754856859_HNH<-754857000_?<-754856861_?<-754856863_N6-MTase*<-754856865_?<-754857002_?<-754856867_?<-754857005_?<-754856869_?<-754857007_?<-754856871_? 501311522 <-N6-MTase*<-SSB<-?<-?<-RecT N6-MTase - - 174 bacteria>firmicutes Clostridium botulinum sugar-phospahte nucleotidyltransferase [Clostridium botulinum]. <-501302841_?<-501306070_?<-501311386_?<-501311817_?<-501313272_?<-501302860_?<-501311840_?<-501311522_N6-MTase*<-501312032_SSB<-501312919_?<-501302806_?<-501306054_RecT<-501310854_?<-501312320_?<-501311584_? 643012036 N6-MTase*-> N6-MTase - - 174 viruses>dsdna viruses, no rna stage>caudovirales Podovirus Lau218 putative phage protein [Podovirus Lau218]. 643012029_?->643012030_?->643012031_?->643012032_?->643012033_?->643012034_?->643012035_?->643012036_N6-MTase*->643012037_?->643012038_?->643012039_?->643012040_?->643012041_?->643012042_?->643012043_?-> 739429399 REase->?->?->?->?->?->N6-MTase*->N6-MTase->cpN6-MTase-> N6-MTase EcoRI_methylase - 174 bacteria>firmicutes Ruminococcus albus hypothetical protein [Ruminococcus albus]. 739429387_?->739429389_REasew->739429391_?->739430531_?->739429393_?->739429395_?->739429397_?->739429399_N6-MTase*->739429401_N6-MTase->739429402_cpN6-MTase->739429404_?->739429406_?->739429408_?->739429410_?->739429412_?-> 294983942 N6-MTase*-> N6-MTase - ZPR_4103 171 bacteria>bacteroidetes Zunongwangia profunda SM-A87 N-6 adenine-specific DNA methylase [Zunongwangia profunda SM-A87]. <-294983935_?<-294983936_?<-294983937_?||294983938_?->294983939_?->294983940_?->294983941_?->294983942_N6-MTase*-><-294983943_?||294983944_?->294983945_?->294983946_?->294983947_?->294983948_?-><-294983949_? 118197649 <-REase<-?||?->N6-MTase*-> N6-MTase EcoRI_methylase YS40_029 169 viruses>dsdna viruses, no rna stage>caudovirales Thermus phage phiYS40 sugar-phospahte nucleotidyltransferase [Thermus phage phiYS40]. <-118197642_?<-118197643_?<-118197644_?<-118197645_?<-118197646_REase<-118197647_?||118197648_?->118197649_N6-MTase*-><-118197650_?<-118197651_?<-118197652_?||118197653_?-><-118197654_?<-118197655_?<-118197656_? 343960410 <-REase<-?||?->N6-MTase*-> N6-MTase EcoRI_methylase TMA_029 169 viruses>dsdna viruses, no rna stage>caudovirales Thermus phage TMA sugar-phospahte nucleotidyltransferase [Thermus phage TMA]. <-343960403_?<-343960404_?<-343960405_?<-343960406_?<-343960407_REase<-343960408_?||343960409_?->343960410_N6-MTase*-><-343960411_?<-343960412_?<-343960413_?||343960414_?-><-343960415_?<-343960416_?<-343960417_? 652129793 N6-MTase*-> N6-MTase MTS - 168 bacteria>bacteroidetes Flavobacterium soli hypothetical protein [Flavobacterium soli]. 652128964_?->652129035_?->652129206_?->652129327_?->652129406_?->652129588_?->652129708_?->652129793_N6-MTase*-><-652129930_?<-652130157_?<-652130255_?<-652130455_?<-652130548_?<-652130719_?<-652130816_? 800940615 N6-MTase*-><-?<-?||?-><-N6-MTase<-HNH N6-MTase Methyltransf_26 - 168 bacteria>bacteroidetes Flavobacterium sp. 316 hypothetical protein [Flavobacterium sp. 316]. 800940601_?->800940603_?->800940605_?-><-800940607_?||800940609_?->800940611_?->800940613_?->800940615_N6-MTase*-><-800940617_?<-800940619_?||800940621_?-><-800940623_N6-MTase<-800940625_HNH<-800940627_?<-800940629_? 425715747 N6-MTase*-> N6-MTase SP+Dam HMPREF9282_00530 165 bacteria>firmicutes Veillonella ratti ACS-216-V-Col6b hypothetical protein HMPREF9282_00530 [Veillonella ratti ACS-216-V-Col6b]. 425715740_?->425715741_?->425715742_?->425715743_?->425715744_?->425715745_?->425715746_?->425715747_N6-MTase*->425715748_?->425715749_?->425715750_?->425715751_?->425715752_?->425715753_?->425715754_?-> 753824499 N6-MTase*-> N6-MTase - - 165 bacteria>bacteroidetes Zunongwangia profunda tRNA (adenine-N6)-methyltransferase, partial [Zunongwangia profunda]. <-502838502_?<-502838503_?<-502838504_?||502838505_?->502838506_?->502838507_?->502838508_?->753824499_N6-MTase*->502838511_?->753823556_?->502838513_?->502838514_?->502838515_?-><-753823558_?||502838519_?-> 765310844 SNF2->?->?->?->?->?->N6-MTase*->Methylase_S->N6-MTase->Methylase_S-> N6-MTase Methyltransf_23 - 165 bacteria>bacteroidetes Siansivirga zeaxanthinifaciens hypothetical protein [Siansivirga zeaxanthinifaciens]. 765310837_?->765310838_SNF2->765310839_?->765310840_?->765310841_?->765310842_?->765310843_?->765310844_N6-MTase*->765310845_Methylase_S->765310846_N6-MTase->765312112_Methylase_S->765310847_?->765310848_?->765310849_?->765310850_?-> # 68; R-M system 737023823 <-ParB+HNH<-N6-MTase* N6-MTase - - 136 bacteria>tenericutes Entomoplasma luminosum hypothetical protein, partial [Entomoplasma luminosum]. <-647290174_ParB+HNH<-737023823_N6-MTase* 738315783 N6-MTase*->ParB+HNH-> N6-MTase - - 120 bacteria>tenericutes Mesoplasma seiffertii hypothetical protein, partial [Mesoplasma seiffertii]. 652736106_?->652736107_?->652736108_?->652736109_?-><-652736110_?||652736111_?->652736112_?->738315783_N6-MTase*->738315769_ParB+HNH->652736114_?->738315772_?->652736117_?->652736118_?->652736120_?->652736122_?-> 738499893 N6-MTase*->ParB+HNH->cpN6-MTase-> N6-MTase EcoRI_methylase - 108 bacteria>tenericutes Mycoplasma pirum hypothetical protein, partial [Mycoplasma pirum]. <-738499704_?||652846545_?->652846547_?->652846548_?-><-738499706_?<-652846551_?<-738499707_?||738499893_N6-MTase*->652846554_ParB+HNH->738499896_cpN6-MTase->652846556_?-><-652846558_?||738499708_?->652846561_?->738499709_?-> # 3; 497336777 <-N6-MTase*<-SSB<-?<-RecT N6-MTase MTS - 189 bacteria>proteobacteria>epsilonproteobacteria Campylobacter sp. FOBRC14 hypothetical protein [Campylobacter sp. FOBRC14]. 497336866_?->497336786_?->497336887_?-><-497336838_?<-736970004_?<-497336817_?<-497336860_?<-497336777_N6-MTase*<-497336833_SSB<-497336829_?<-497336785_RecT<-497336846_?<-497336835_?<-497336831_?<-497336775_? 500772938 RecT->?->SSB->N6-MTase*-> N6-MTase MTS - 189 bacteria>proteobacteria>epsilonproteobacteria Campylobacter curvus hypothetical protein [Campylobacter curvus]. <-500772932_?||500772933_?->754105770_?->754105772_?->500772935_RecT->754105775_?->500772937_SSB->500772938_N6-MTase*->500772939_?->500772940_?->754105777_?->754105779_?->500772942_?-><-754105874_?<-500772945_? 489041364 <-N6-MTase*<-?<-RecT N6-MTase MTS - 188 bacteria>proteobacteria>epsilonproteobacteria Campylobacter showae hypothetical protein [Campylobacter showae]. <-489041352_?<-489041353_?<-489041355_?<-489041357_?<-489041359_?<-489041360_?<-489041362_?<-489041364_N6-MTase*<-489041368_?<-489041371_RecT<-489041373_?<-489041374_?<-489041377_?<-489041378_?<-489041380_? # 2; 431004013 N6-MTase*->N6-MTase-><-PLDc N6-MTase RelB+EcoRI_methylase A15U_04136 415 bacteria>proteobacteria>gammaproteobacteria Escherichia coli KTE210 RelB/DinJ family addiction module antitoxin [Escherichia coli KTE210]. <-431004006_?<-431004007_?||431004008_?->431004009_?->431004010_?->431004011_?->431004012_?->431004013_N6-MTase*->431004014_N6-MTase-><-431004015_PLDc||431004016_?->431004017_?-><-431004018_?<-431004019_?<-431004020_? 692950787 N6-MTase*->N6-MTase-><-PLDc N6-MTase EcoRI_methylase - 344 bacteria>proteobacteria>gammaproteobacteria Escherichia coli restriction endonuclease subunit M [Escherichia coli]. <-585312902_?<-505582380_?<-446688926_?<-486190256_?<-486190259_?<-486190260_?<-486190261_?||692950787_N6-MTase*->486190280_N6-MTase-><-692950788_PLDc<-692946043_?||486190285_?->692950789_?-><-585312903_?<-486190286_? # 2; 323436523 <-ParB<-?<-ASCH<-?<-?<-ASCH||HTH-><-N6-MTase*<-?<-cpN6-MTase<-?||HTH-> N6-MTase SP Weevi_0265 278 bacteria>bacteroidetes Weeksella virosa DSM 16922 ParB-like nuclease [Weeksella virosa DSM 16922]. <-323436516_ParB<-323436517_?<-323436518_ASCH<-323436519_?<-323436520_?<-323436521_ASCH||323436522_HTH-><-323436523_N6-MTase*<-323436524_?<-323436525_cpN6-MTase<-323436526_?||323436527_HTH-><-323436528_?<-323436529_?<-323436530_? 754544258 <-ParB<-?<-ASCH<-?<-?<-ASCH||HTH-><-N6-MTase*<-?<-cpN6-MTase<-?||HTH-> N6-MTase - - 248 bacteria>bacteroidetes Weeksella virosa chromosome partitioning protein ParB [Weeksella virosa]. <-754544257_ParB<-503362712_?<-503362713_ASCH<-503362714_?<-503362715_?<-503362716_ASCH||503362717_HTH-><-754544258_N6-MTase*<-503362719_?<-503362720_cpN6-MTase<-503362721_?||503362722_HTH-><-503362723_?<-754544068_?<-503362725_? # 2; 748595473 <-ParB<-?<-?<-?<-?<-?<-REase<-N6-MTase* N6-MTase SP - 224 bacteria>proteobacteria>alphaproteobacteria Ochrobactrum intermedium hypothetical protein [Ochrobactrum intermedium]. <-748595461_ParB<-748595463_?<-748595465_?<-748595502_?<-748595466_?<-748595469_?<-493515914_REase<-748595473_N6-MTase*<-748595475_?<-748595477_?<-748595479_?||493515916_?->748595504_?->748595480_?->493515918_?-> 763458170 N6-MTase*->REase->?->?->?->?->?->ParB-> N6-MTase SP - 224 bacteria>proteobacteria>alphaproteobacteria Brucella abortus hypothetical protein [Brucella abortus]. <-763458159_?<-763458161_?<-748595504_?||763458163_?->763458165_?->763458167_?->748595475_?->763458170_N6-MTase*->763458172_REase->763458174_?->763458176_?->763458212_?->763458178_?->748595463_?->748595461_ParB-> # 2; 523845400 <-N6-MTase*<-?<-RecT N6-MTase - M638_00220 193 bacteria>firmicutes Listeria monocytogenes sugar-phosphate nucleotidyltransferase [Listeria monocytogenes]. <-523847903_?<-523847904_?<-523847905_?<-523847906_?<-523847907_?<-523847908_?<-523845399_?<-523845400_N6-MTase*<-523847909_?<-523845401_RecT<-523845402_?<-523845403_?<-523847910_?<-523847911_?<-523847912_? 752526171 N6-MTase*-> N6-MTase - - 176 bacteria>firmicutes Listeria monocytogenes sugar-phosphate nucleotidyltransferase [Listeria monocytogenes]. <-502716505_?<-644855401_?||770723372_?-><-489827227_?||497615352_?->506520144_?->559010092_?->752526171_N6-MTase*->770723442_?->558988748_?->770723481_?-> # 2; 488893482 <-N6-MTase* N6-MTase EcoRI_methylase - 180 bacteria>proteobacteria>epsilonproteobacteria Campylobacter MULTISPECIES: Sugar-phospahte nucleotidyltransferase [Campylobacter]. <-488893482_N6-MTase*<-488932284_? 488923876 <-N6-MTase* N6-MTase EcoRI_methylase - 180 bacteria>proteobacteria>epsilonproteobacteria Campylobacter coli Sugar-phospahte nucleotidyltransferase [Campylobacter coli]. <-488923876_N6-MTase*<-488932284_? # 1; 655449388 <-N6-MTase* N6-MTase DUF4238 - 238 bacteria>proteobacteria Proteobacteria bacterium JGI 0000113-E04 hypothetical protein [Proteobacteria bacterium JGI 0000113-E04]. <-655449381_?||655449382_?-><-655449383_?||655449384_?-><-655449385_?||655449386_?-><-655449387_?<-655449388_N6-MTase*||655449389_?-><-655449390_? 291336302 <-N6-MTase<-N6-MTase* N6-MTase - - 208 uncultured organism MedDCM-OCT-S12-C54 hypothetical protein, partial [uncultured organism MedDCM-OCT-S12-C54]. <-291336295_?||291336296_?-><-291336297_?||291336298_?-><-291336299_?||291336300_?-><-291336301_N6-MTase<-291336302_N6-MTase* 291336301 <-N6-MTase*<-N6-MTase N6-MTase - - 123 uncultured organism MedDCM-OCT-S12-C54 hypothetical protein [uncultured organism MedDCM-OCT-S12-C54]. <-291336295_?||291336296_?-><-291336297_?||291336298_?-><-291336299_?||291336300_?-><-291336301_N6-MTase*<-291336302_N6-MTaseBack to Contents
** Str-1 Str-2 Str-4 Str-5 Str-6 Str-7 FINAL ------------------------------------------------------HHHHHHHHHH----EEEHHHH---HHHHHHHHHHH-HH-----HH-------------------------------------------E----EEEE------------------------------------HHHHHHH-H-HH-H-------E-EEEEE-----HHHHH-HHH-H---------------------E-EEEEE------------EEE--------------------------E-EEEEE----------------------------------------------------------------------EEEE---------------------HHH---HHHH------------------------------HHHHH----------------------H--- ALIGN ------------------------------------------------------HHHHHHHHHHHH-HHHHHH-----HH-HHH--EE-----------------------------------------------------------EEE-------------------------------------HHHHHHH-H-HH-H-------E-EEEEE------HHHH-HHH-H---H---------------E-E-HEEE-------------EEE---E------------------------EEEEE----------------------------------------------------------------------EEEE-E------HH-----------HHH---HHHH------HH------H----HH---HH-HHHHHHHH-------------------------- HMM -----------------------------------E--EEEE-----------HHHHHHHHHHH---EEEE-------HHHHHHHHH-H-HH-----HHHHH-HH------------------------------E------E----EEEEE------H------H-HHHHHHH-----------HHHHHHHHH-H-HH-H-------E-EEEEE---HHHHHHH-HHH-H---H---------------E-E-EEEEE------------EEE---EE---------------------E-EEEEE----------------------------------------------------------------------EEEE-EEE-----------------HHH---HHHHHH--HHHH------H----H-----H---HHHHHH----------------------HH-- FREQ ------------------------------------------------------HHHHHHHHHHHH-HHHHHHH------HHHHHHHHH-HH--------------H-------------------------H--------------EE-------------------EEEE------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- PSSM ------------------------------------------------------HHHHHHHHH------E-----------------------------------------------------------------------------EEEE------------------------------------HHHHHHH-H-HH-H---------EEEEE-----HHHHH-HHH-H---------------------E-EEEEE------------EEE--------------------------E-EEEE-----------------------------------------------------------------------EEE----------------------HHH---HHHHH-----HH------H---------------HHHHH----------------------H--- consensus/90% ...............................................ps.W.TP..hF..Ls..F..F.lDssA.....pN.bs..ahs.........ssL...p.a.........................................hahNPPYup...............................al.+Ah.........p......sVhLlPsc.ss.aa...h................-.........lchl...........GRl.F...................sss..bss.hlhla........................................................................................................................................................................................................ EMIHUDRAFT_111979_Emiliania_huxleyi_CCMP1516_551608163 IT-PAA--A-A-------------------D-AD-V--DILS--VQKTQQWETPPQVWEYVSARWA-VDFDACAS---PINALAPRYST-VD-----DDFLA-REDL-------------------------K--D------I----TIYCNPPYAL-D------RYGTGSCAA-----------IEPFVRRLV-E-LA-ST-R-GC-T-CIALVPVLSHQLWFH-TCV-T---G-A--A-S--G-GRAAH-E-IHWVQ----------GLLKW---NN-PF-H-E-REP-ASPYI--YPF-ALCVW------------------------------------------R-P--G-AP----P-DRA-----H----EVVA-SLPRP-SDD-----------VSR---SFHFRR--CRRR------G----CG---KV-RLLPRHVD----------------------LLRA---------------------------------- EMIHUDRAFT_240085_Emiliania_huxleyi_CCMP1516_551578908 IA-PAA--A-A-------------------D-AD-V--DILS--VQKTQQWETPLPVWEYVSARWA-VDFDACAS---PINALAPRYST-VD-----DDFLA-REDL-------------------------K--D------I----TIYCNPPYAL-D------RYGTGSCAA-----------IEPFVRRLV-E-LA-ST-R-GC-T-CIALVPVLSHQSWFH-TCV-T---G-A--A-S--G-GRAAH-E-IHWVQ----------GLLKW---NN-PF-H-E-REP-ASPYI--YPF-ALCVW------------------------------------------R-P--G-AP----P-DLG-----D----SQYG-S-----CDA-----------HEY---VMHFT---------------------------------------------------------------------------------------------- DX12_RS0110285_Vibrio_parahaemolyticus_646896396 FS------S-A-------------------R-NG----------SSKQDKWQTPPAVFEKLNEEFN-FTLDATAE---PETALCDHYFT-ID-----DDAL--TQDW-------------------------G--N--Q--------TVYCNPPYSQ----------------------------LKDFAKKAQ---EE-AK-K-GA-T-VVMLVPARTDTKAFH-DYL-----S-H--G----E---------VRLIK----------GRLKF---L-------------MEGKE--QDA-A------------------------------------------------P--F-PS----M---------V----CVMG-------KDR-----------EQK---IGTTTQ-------------------------DALTLESK------------------------------------------------------------ Q331_RS21100_Afifella_pfennigii_736470177 ---------------------------M----VH-Q--SLY---SSRTEEWETPPALFERLDRIFG-FRLDACAS---PANRKCETWFS-AA-----DNAL--ERSW-------------------------AEHG-----------RVWLNPPYGR-R--------------------------IAGFMRKAF---EE-SQ-K-GA-L-VVALVPARTDTLWWH-EWV---N-G-K--A----D---------IVFLK----------GRLKY-------L-DEN-RRE-RSPAP--FPS-ALVVY-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- HMPREF0179_03455_Bilophila_wadsworthia_3_1_6_316921487 MN--------P---------------------------ALF---SSAKEDWETPREFFERLDGEFH-FDLDVCAF---PHNAKCPTYFT-KE-----DDGL--ARDW-------------------------G--NR----------VCWMNPPYGK-A--------------------------IKAWMTKAL---DA-SR-R-GA-T-VVCLVPSRTDTAWWH-DTV-IAG-G-A-------E---------VRFAR----------GRLRF-------------VGA-EHPAP--FPS-AVVIF------------------------------------------R-P--P-PS--------------P----SQQKETNDENNDPQ-------------------------------------------------------------------------------------------------------------------- OAC_RS0107480_Vibrio_cyclitrophicus_515155813 FS------S-A-------------------R-TG----------NPKRDKWQTPPAVFKKLNEEFH-FTLDATAE---PETALCDHYFT-MD-----DDAL--TQDW-------------------------S--N--Q--------TVYCNPPYSQ----------------------------LKDFAKKAQ---EE-AN-K-GA-T-VVMLVPARTDTKAFH-DHL-----S-H--G----E---------VRLIK----------GRLKF---L-------------QDGEE--QDA-A------------------------------------------------P--F-PS----M---------V----CVMG-------NDV-----------EQK---IGTTTQ-------------------------DKLKLEPK------------------------------------------------------------ A148_RS0111015_Vibrio_splendidus_695353200 FS------S-A-------------------R-TG----------NPKRDKWQTPPAVFKKLNEEFH-FTLDATAE---PETALCDHYFT-MD-----DDAL--TQDW-------------------------S--N--Q--------TVYCNPPYSQ----------------------------LKDFAKKAQ---EE-AK-K-GA-T-VVMLVPARTDTKAFH-DHL-----S-H--G----E---------VRLIK----------GRLKF---L-------------QDGEE--QDA-A------------------------------------------------P--F-PS----M---------V----CV-------------------------------------------------------------------------------------------------------------------------------- HMPREF0179_RS04985_Bilophila_wadsworthia_749811142 MN--------P---------------------------ALF---SSAKEDWETPREFFERLDGEFH-FDLDVCAF---PHNAKCPTYFT-KE-----DDGL--ARDW-------------------------G--NR----------VCWMNPPYGK-A--------------------------IKAWMTKAL---DA-SR-R-GA-T-VVCLVPSRTDTAWWH-DTV-IAG-G-A-------E---------VRFAR----------GRLRF-------------VGA-EHPAP--FPS-AVVIF------------------------------------------R-P--P-PS--------------P----SQQ------------------------------------------------------------------------------------------------------------------------------- OR63_RS06485_Clostridium_tetani_737140426 ---------------------------M--N-TA----VMF---SSETDLWATPQEFYNELNKEFN-FDLDPCAT---HENAKCPKYYT-VV-----EDGL--KQDW-------------------------Q--G--H--------KVFCNPPYGR-E--------------------------ISKWVEKAY---KE-SK-KENT-T-VVMLIPARTDTKYFH-SYI-Y---R-K--A---KE---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MVVVF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- VPUCM_1151_Vibrio_parahaemolyticus_UCM-V493_584469889 FS------S-A-------------------N-SG----------DKSKDKWQTPPEIFAQLNDRFG-FTLDAAAE---PETALCEKYFT-EE-----DDAL--KQDW-------------------------S--G--H--------VVFCNPPYSK----------------------------LRVFAKKAY---EE-SL-K-GT-T-VVMLVPARTDTQACH-DYL-----A-N--G----E---------MYFIR----------GRLKF---L-------------KVGEL--QDA-A------------------------------------------------P--F-PS----V---------V----CVLG-------PGV-----------ERKGGGLLTKKT-------------------------CCFGNKNLDEA----G---------------------------------------------------- HMPREF1020_RS23965_Clostridium_sp_7_3_54FAA_496656604 -------------M------------------ND----ALL---SSKNMCWCTPPDFFAELDREFH-FELDPAST---DKSAKCAKHFT-PD-----DDGL--KQDW-------------------------G--G--Y--------RVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I---------------------------------------------------------------------------------------------------------------------------- CLOM621_RS14915_Clostridiales_492715347 -------------M------------------ND----ALL---SSKNMCWCTPPDFFAELDREFH-FELDPAST---DKSAKCAKHFT-PD-----DDGL--KQDW-------------------------G--G--Y--------CVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I---------------------------------------------------------------------------------------------------------------------------- CLOM621_08346_Clostridium_sp_M62/1_291074040 ------------------------------------------------MCWCTPPDFFAELDREFH-FELDPAST---DKSAKCAKHFT-PD-----DDGL--KQDW-------------------------G--G--Y--------CVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I---------------------------------------------------------------------------------------------------------------------------- VPUCM_RS05810_Vibrio_parahaemolyticus_740612810 FS------S-A-------------------N-SG----------DKSKDKWQTPPEIFAQLNDRFG-FTLDAAAE---PETALCEKYFT-EE-----DDAL--KQDW-------------------------S--G--H--------VVFCNPPYSK----------------------------LRVFAKKAY---EE-SL-K-GT-T-VVMLVPARTDTQACH-DYL-----A-N--G----E---------MYFIR----------GRLKF---L-------------KVGEL--QDA-A------------------------------------------------P--F-PS----V---------V----CVLG------------------------------------------------------------------------------------------------------------------------------ G454_RS0114655_Desulfovirgula_thermocuniculi_654109520 M-------------------------------LN-R--GLF---SSASSEWETPQKFFETLDVEFG-FTLDVCAR---PENAKCPRYFS-PE-----EDGL--RQEW-------------------------A--PE----------VCWMNPPYGR-E--------------------------IGKWIQKAY---EE-AQ-K-GA-T-VVCLLPSRTDTAWWH-EYV-M---RAA-------E---------VRFIR----------GRLRF-------------GGA-ENGAP--FPS-CVVVF------------------------------------------R-P----GY--------------S--GLPVVK-------SMA--------AR---------------------------------------------------------------------------------------------------------- BN981_RS01320_Halobacillus_737532221 ---------------------------M--NKMD----VHY---SSKTNEWATPQDFFDELNTEFN-FTLDPCAT---PDNAKCDKYFT-EK-----DDGL--EQSW-------------------------E--G--E--------TVFCNPPYGR-G--------------------------IKHWVKKAY---QE-ST-KPNT-T-VVLLIPSRTDTRYFH-DYV-Y---H-K--S----E---------IRFLK----------GRLKF-------------GDG-SGNAP--FPS-MVAIYR------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RH85_RS11625_Vibrio_ichthyoenteri_748690860 YS------S-A-------------------R-TG----------EKQQDRWQTPAEIFRQLNDEFH-FTLDAAAE---PSTALCSNYFT-EQ-----DDAL--AKNW-------------------------G--S--H--------VVYCNPPYSK----------------------------LREFARKAY---EA-SL-T-GA-T-VVMLVPARTDTQAFH-HYL-----S-K--G----E---------VRFIK----------GRLKF---L-------------QAGEA--QNT-A------------------------------------------------P--F-PS----M---------I----CVLG-------AGV-----------ERK---MITVLQ-------------------------DSLHNAVV------------------------------------------------------------ TH16_RS01985_Staphylococcus_caprae_488372936 --------------------------------MS----VHF---SSKSNEWYTPQYLFDELNEKYQ-FTLDPCAS---HENAKCDKYFT-IE-----DDGL--TKDW-------------------------S--K--D--------IVFMNPPYGR-N--------------------------IKHWIKKAY---EE-SV-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-N--A---YN---------IKFLK----------GRIKF-------------GGA-VNSAP--FPS-AIVVF------------------------------------------KPKGDG-LK----------------------------------------------------------------------------------------------------------------------------------------------------- BZ26_RS0118830_Clostridium_botulinum_489480013 ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDKLNKEFN-FDLDPCAT---KENAKCSKYFT-KE-----IDGL--KQDW-------------------------G--R--Y--------RVFCNPPYGR-E--------------------------IGKWVEKAY---KE-SK-KQNT-T-VVMLIPARTDTKYFH-SYI-Y---H-K--A---KE---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MIVVFRG------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ SD74_RS18965_Clostridium_botulinum_752703286 ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDELNKEFD-FDLDPCAT---HENAKCDKYYT-IV-----EDGL--KQDW-------------------------Q--G--H--------KVFCNPPYGR-G--------------------------IKDWVEKAY---KE-SK-KENT-T-VVMLIPARTDTKYFH-SYI-Y---H-K--A---KE---------IRFIK----------GRLKF-------------GDA-KNSAP--FPS-MVVVF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- J532_4398_Acinetobacter_baumannii_691154760 -N--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE----------------------------------------------------------------------------------------------------------------- K041_RS17240_Acinetobacter_baumannii_690981431 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE----------------------------------------------------------------------------------------------------------------- W9I_03525_Acinetobacter_nosocomialis_493629840 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ J532_4398_Acinetobacter_baumannii_940793_630464595 ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE----------------------------------------------------------------------------------------------------------------- J517_3010_Acinetobacter_baumannii_691065210 MN--------T----------------M----TK-N--KLFGLAEERTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VNWE-------KSA-------------------------------------------------------------------------------------------------------------------- LJ44_RS16470_Acinetobacter_baumannii_447017697 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ F985_01871_Acinetobacter_sp_NIPH_973_490838153 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PGNAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ J689_1368_Acinetobacter_calcoaceticus/baumannii_complex_645913983 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ J660_0735_Acinetobacter_calcoaceticus/baumannii_complex_493629922 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGC-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ J523_3197_Acinetobacter_baumannii_691027491 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA-------------------------------------------------------------------------------------------------------------------- J660_0735_Acinetobacter_baumannii_88816_593668543 ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGC-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ ACIN5021_2863_Acinetobacter_sp_OIFC021_444754682 ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ J660_1691_Acinetobacter_baumannii_691157882 MN--------S----------------M----SK-N--KLFGLAEDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VNWE-------KSA-------------------------------------------------------------------------------------------------------------------- K035_3853_Acinetobacter_baumannii_691039522 MN--------S----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWD-------KGASHE----------------------------------------------------------------------------------------------------------------- RL05_RS02180_Staphylococcus_aureus_446374007 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLSEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- FL80_RS05360_Acinetobacter_baumannii_690988986 MN--------N----------------M----TK-N--KLFGLAEERTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AN-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VNWE-------KSA-------------------------------------------------------------------------------------------------------------------- PI74_RS05125_Clostridium_botulinum_500994137 ---------------------------M--N-TA----VMF---SSGTDLWATPQDFFDKLNKEFD-FDLDPCAT---HKNAKCSKYFT-KE-----IDGL--KQDW-------------------------Q--G--Y--------KVFCNPPYGR-S--------------------------IKDWVEKAY---KE-SK-KENT-T-VVMLIPARTDTRYFH-EYI-Y---N-K--A---KE---------IRFVK----------GRLKF-------------GDA-KNSAP--FPS-MVVVF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- ABBL099_02355_Acinetobacter_baumannii_690996743 MN--------N----------------M----TK-N--KLFGLAEERTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AN-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA-------------------------------------------------------------------------------------------------------------------- J697_3983_Acinetobacter_baumannii_691093639 MN--------S----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-Q--S-LI--------------D----VSWE-------KSA-------------------------------------------------------------------------------------------------------------------- RQ87_RS18135_Acinetobacter_baumannii_447010248 MN--------S----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA-------------------------------------------------------------------------------------------------------------------- V006_02512_Staphylococcus_aureus_686297326 --------------------------------ME----VHY---SSKTNEWTTPQNLFDELNGEFN-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- A11W_RS0107210_Staphylococcus_hominis_515743089 --------------------------------ME----VHY---SSKSNEWATPQNLFDELNEEFN-FTLDPCAT---DENAKCSKYFT-IE-----DDGL--SKDW-------------------------S--K--D--------VVFMNPPYGR-E--------------------------IKKWNKKAY---EE-SL-N-GA-T-VVCLIPARTDTTYWH-DFI-F---D-R--A---DD---------IRFLR----------GRLKF-------------GNS-KNSAP--FPS-AIVVY------------------------------------------R----G-VTT---------------------------------------------------------------------------------------------------------------------------------------------------- G454_RS0102995_Desulfovirgula_thermocuniculi_654100680 M-------------------------------FN-R--VLF---SSATSEWETPQELFARLHAEFG-FTLDVCAR---PWNAKCTRYFS-PE-----QNGL--IQEW-------------------------A--PE----------TCWMNPPYGR-E--------------------------ISRWVRKAW---EE-AQ-K-GA-T-VVCLLPSRTDTAWWH-EYV-M---RAA-------E---------IRFIR----------GRLHF-------------EGA-KNGAP--FPS-CVVVF------------------------------------------R-P----GC--------------T--GPPVIR-------SMA--------AR---------------------------------------------------------------------------------------------------------- J546_RS10975_Acinetobacter_baumannii_736663998 ---------------------------M----AN-H--QLFGLAENRTDIWATPQDFFDKLNAVFK-FDLDVCAL---PNNAKCERFFS-PE-----DDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IIEWVAKAA---CT-AK-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KSNAP--FGC-CVVVF------------------------------------------R-P--T-LN--------------D----VEWE-------NA---GGGV-------------------------------------------------------------------------------------------------------------- AWRIB429_RS09790_Oenococcus_oeni_768719850 ---------------------------M--N-NE----LMF---SSKTDLWSTPNDFFDKLNDEFH-FTLDPCST---HENAKCYKHFT-KE-----ENGL--LQDL-------------------------G--N--E--------VVFCNPPYGR-Q--------------------------IKDWVKKSY---EE-SQ-KDNT-T-VVMLIPARTDTIYFH-EYI-Y---H-K--A----E---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MVVIF--E----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- NZ45_03810_Clostridium_botulinum_700273311 ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDKLNKEFD-FDLDPCAT---HENAKCSKYFT-KE-----IDGL--KQDW-------------------------Q--G--H--------KVFCNPPYGR-G--------------------------IKDWVEKAY---KE-SK-KENT-T-VVMLIPARTDTRYFH-EYI-Y---H-K--A---KE---------IRFVK----------GRLKF-------------GSA-KNSAP--FPS-MVVVFRGE----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RK90_RS13240_Staphylococcus_aureus_446374006 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- SA930_RS14870_Staphylococcus_aureus_446374005 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-------------------------------------------------------------------------------------------------------------------------------------------------------- VII00023_15021_Vibrio_ichthyoenteri_ATCC_700023_342803448 YS------S-A-------------------R-TG----------EKQQDRWQTPAEIFRQLNDEFH-FTLDAAAE---PSTALCSNYFT-EQ-----DDAL--AKNW-------------------------G--S--H--------VVYCNPPYSK----------------------------LREFARKAY---EA-SL-T-GA-T-VVMLVPARTDTQAFH-HYL-----S-K--G----E---------VRFIK----------GRLKF---L-------------QAGEA--QNT-A------------------------------------------------P--F-PS----M---------I----CVLG-------AGV-----------ERK---MITVLQ-------------------------DSLHNAVV------------------------------------------------------------ KU40_RS04850_Clostridium_botulinum_737823765 ---------------------------------------MF---SSKTDMWSTPQDFYNKLNQEFN-FNLDPCST---NENAKCERHYT-IA-----EDGL--KQNW-------------------------V--G--S--------TVFCNPPYGR-V--------------------------LKDWVKKCY---EE-SK-KDNT-T-VVMLIPARTDTTYFH-NYI-Y---K-K--V---KE---------IRFIR----------GRLKF-------------GDC-KNAAP--FPS-MVVVF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- DESKU_RS03925_Desulfotomaculum_kuznetsovii_503587829 M-------------------------------LN-E--SMF---SSRTGEWETPQTFFDALDAEFH-FTLDVCAR---PENAKCARFFT-PE-----QDGL--RQSW-------------------------A--GE----------TCWMNPPYGR-E--------------------------IGRWVEKAY---NE-AR-R-GA-V-VVALLPARTDTRWWH-RYV-M---RAA-------E---------IRFVE----------GRLKF-------------GGA-ENSAP--FPS-VVVVF------------------------------------------T-P--EKAV--------------S--DGPVVR-------SMR--------VK---------------------------------------------------------------------------------------------------------- CO98_RS04645_Staphylococcus_aureus_739716594 --------------------------------MS----VHF---SSKSNEWTTPQYLFDELNEEFN-FTLDPCAT---DENAKCSKYFT-IE-----DDGL--SKDW-------------------------S--N--D--------VVFMNPPYGR-E--------------------------IKKWIKKAY---EE-SL-N-GA-T-VVCLIPARTDTTYWH-DFI-F---D-K--A---DD---------IRFLK----------GRLKF-------------GNS-KNSAP--FPS-SIVIY------------------------------------------E----C-KEAEQ-------------------------------------------------------------------------------------------------------------------------------------------------- ERS140248_02184_Staphylococcus_aureus_678260344 --------------------------------ME----VHY---SSKTNEWATPQNLFDDLNREFN-FTLDPCST---DENAKCQKHYT-AK-----DNGL--IQDW-------------------------S--E--D--------VVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SV-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------SES-KNSAP--FPS-AIIVY------------------------------------------R----G-GR----------------------------------------------------------------------------------------------------------------------------------------------------- HMPREF9988_RS10060_Staphylococcus_epidermidis_488427723 --------------------------------ME----VHY---SSKSNEWATPQKLFDELDKEFN-FTLDPCAT---DENAKCNKHFT-IE-----DDGL--SKDW-------------------------S--K--D--------VVFMNPPYGR-E--------------------------IKKWIKKAY---EE-SL-N-GA-T-VVCLIPARTDTTYWH-DFI-F---D-K--A---DD---------IRFLR----------GRLKF-------------GNS-KNSAP--FPS-AIVVY------------------------------------------L----G-VTT---------------------------------------------------------------------------------------------------------------------------------------------------- T666_02640_Staphylococcus_aureus_686391504 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKCWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- BBRE_RS02915_Bifidobacterium_breve_518557238 MS--------DFTG------------------AG-G--AAY---MSNRMNWETPQELFDQLDAEFH-FTLDAASS---ATNHKCQKYYT-AE-----DSAF--DHEW-------------------------G--G--E--------TVFCNPPYGK-A--------------------------IAEWVRKCS---AE-AS-RKDT-L-VVMLLPARTDTRWFQ-QFI-L---N-R--A----E---------VRFLK----------GRLRF-E-TN--------GIP-GGPAP--FPS-MIVVM------------------------------------------R-T--G-ER----------------------------------------------------------------------------------------------------------------------------------------------------- AS94_12270_Staphylococcus_aureus_686449191 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVEKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- Q332_RS01180_Pseudobacteroides_cellulosolvens_739064083 -------------MN-----------------TE----IMF---SSKSDEWETPQQFFDKLHKEFN-FQLDVCAT---AENAKCDKYYT-KI-----DDGL--SQSW-------------------------H--HWAQ--------RCWMNPPYGR-N--------------------------IDKWIKKAF---DE-SQ-E-GA-T-VVCLIPARTDTKYWH-TYC-M-----K--A---HE---------IRFVK----------GRLKF-------------SNS-KDCAP--FPS-AIVVF------------------------------------------K-P--T-LK--QLKVSSY-------------------------------------------------------------------------------------------------------------------------------------------- QI18_RS10395_Lactococcus_lactis_746045508 ---------------------------M--N-RE----LMF---SSKTDLWSTPWNFFEKLNDEFH-FTLDPCST---HENAKCYKHFT-IK-----EDGL--LQDW-------------------------G--N--E--------VVFCNPPYGR-K--------------------------IKDWVKKAY---EE-SQ-KDNT-T-VVMLIPARTDTIYFH-EYV-Y---H-K--A----E---------VRFIK----------GRLKF-------------GDA-KNAAP--FPS-MVVIF--RKDNQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- APL_RS02615_Actinobacillus_pleuropneumoniae_500173972 ---------------------------M-------T--------NFDKNTWQTPPECANYVKRRWA-IKWDGAAT---VENKICDYFIT-PEI------------DF-LNFESID---R-II--------E-N--N--A--------RIFINPPYGR-----------------------GY---VEKFVKQAV---RLMNE-K-RC-F-VVMLLNADKSTEWFK-LIR-----E-H--A-T--E-------V-IDIVG----------KRVAF---IN-PI-T---KKP-VEDNP--KWQ-MFAVF------------------------------------------D-P--Y-AE--------------G----FVTS-------YVS-----------YDK---IKQVGT-----------------ND-N-NA--------------------------------------------------------------------- APPSER11_RS02705_Actinobacillus_pleuropneumoniae_491783102 ---------------------------M-------T--------DFDKNTWQTPPECANYVKRRWA-IKWDGAAT---VENKICDYFIT-PEI------------DF-LNFESID---R-II--------E-N--N--A--------RIFINPPYGR-----------------------GY---VEKFVKQAV---RLMNE-K-RC-F-VVMLLNADKSTEWFK-LIR-----E-H--A-T--E-------V-IDIVG----------KRVAF---IN-PI-T---KKP-VEDNP--KWQ-MFAVF------------------------------------------D-P--Y-AE--------------G----FVTS-------YVS-----------YDK---IKQVGT-----------------ND-N-NA--------------------------------------------------------------------- ABSDF2497_Acinetobacter_baumannii_SDF_169152788 MN--------T----------------M----TK-N--KLFGLADDRTDVWATPQDFFEKLDRVFK-FDLDVCAL---PDNAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---DT-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LV--------------D----VNWE-------KSA-------------------------------------------------------------------------------------------------------------------- SAZ172_RS05790_Staphylococcus_aureus_554679133 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--LPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- W619_00569_Staphylococcus_aureus_686419170 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNGEFN-FTLDPCST---DENAKCQKHYT-AK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKHWVKKAY---EE-SV-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GES-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- Q7S_RS08715_Rahnella_aquatilis_505727589 TS-EFA----S-------------------T-TP----------IEHKDRWQTPVEVFTALDLEFG-FYLDAAAD---YQNALCARYLT-EG-----DDAL--ATEW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWIEKAA---EQ-CRAQ-HQ-P-VVMLLPADTSTGWFS-LAL-----T-T--A-D--E---------IRFIT-----D----GRLSF---IN-AG-T---GKPGKNGNS--KGS-MLVIW------------------------------------------R-P--F-IK----P---------R----SQFT-------TVS-----------RDA---LITAGA-------------------------DYLQEVAA------------------------------------------------------------ ERIC1_1c08270_Paenibacillus_larvae_subsp_larvae_DSM_25719_567770034 MN--------K---------------------------VHY---SSKTDMWETPQNLFDRLNEEFK-FDLDVCAI---PENAKCKRYFT-PS-----EDGL--KQEW-------------------------K--G-----------ACWMNPPYGR-Q--------------------------IGKWIAKAY---ES-SL-E-GA-T-VVCLVPSRTDTKWWH-GYC-M---K-G-------E---------IRFIR----------GRLKF-------------GGS-PHNAP--FPN-AVVIF------------------------------------------R-G--R-KE-------------SL----HGQK-------RNE--------TKDDCA------------------------------------------------------------------------------------------------------ J596_3741_Acinetobacter_baumannii_691117543 MN--------S----------------M----AK-L--GLYGNAEGKTDVWATPQNLFDALDQIFN-FDLDVCAL---PENAKCERYFT-PE-----LDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-N-GH-T-VVGLLPVRTDVVWWQ-EHI-L---H-R-------E---------IHYIK----------GRLKF-------------GGS-KHNAP--FGC-ALVVF------------------------------------------R-P--S-LK--------------D----VQSD-------KSI-------------------------------------------------------------------------------------------------------------------- T259_RS08765_Clostridium_botulinum_748203410 ---------------------------M--N-TA----VMF---SSETDLWATPQDFFDKLNKEFN-FDLDPCAT---HENAKCSKYFT-KE-----IDGL--KQDW-------------------------Q--G--Y--------KVFCNPPYGR-V--------------------------LKDWVKKCY---EE-SL-KPNT-T-VVMLIPARTDTKYFH-EYI-Y---H-K--V---KE---------IRFVK----------GRLKF-------------GDA-KNSAP--FPS-MVVVF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- U183_02276_Staphylococcus_aureus_686300364 --------------------------------ME----VHY---SSKTNEWTTPQNLFDDLNREFN-FTLDPCST---DENAKCQKHYT-EN-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKHWVKKAY---EE-SI-K-GA-T-VVCLIPARTDTTYWH-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GES-KNSAP--FPS-AIIVY------------------------------------------R----G-VR----------------------------------------------------------------------------------------------------------------------------------------------------- IH28_RS0115430_Acinetobacter_baumannii_663438128 ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------S--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------G--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- BN927_RS09785_Lactococcus_lactis_554763517 ---------------------------M--N-KE----LMF---SSKTDLWSTPWNFFDKLNDEFH-FTLDPCST---HENAKCYKHFT-IE-----EDGL--LQDW-------------------------G--N--E--------VVFCNPPYGR-Q--------------------------IKDWVKKAY---EE-SQ-KDDT-T-VVMLIPARTDTIYFH-EYI-Y---H-K--A----E---------IRFIK----------GRLKF-------------GDA-KNAAP--FPS-MVVIF--RKDNQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MC49_RS06655_Morganella_morganii_738462811 MA-VHT----R---------------------------NTP---GEYKDSWQTPEWLFTALDLEFG-FYLDAAAS---DINALCSRYLT-EQ-----DDAL--KSEW-------------------------V--S--H-----G--AIWCNPPYSN----------------------------IRPWVEKAA---EQ-SRMQ-NQ-P-VVMLVPEDMSVGWFL-EAL-----K-T--V-D--E---------IRVIT-----G----GRINF---VN-PV-T---GEE-KKGNS--KGS-MLLIW------------------------------------------R-P--F-IT----P---------R----RLSS-------FAL-----------KQE---LEAIGN-------------------------QYLAEVSA------------------------------------------------------------ ERIC1_RS03940_Paenibacillus_larvae_738763505 MN--------K---------------------------VHY---SSKTDMWETPQNLFDRLNEEFK-FDLDVCAI---PENAKCKRYFT-PS-----EDGL--KQEW-------------------------K--G-----------ACWMNPPYGR-Q--------------------------IGKWIAKAY---ES-SL-E-GA-T-VVCLVPSRTDTKWWH-GYC-M---K-G-------E---------IRFIR----------GRLKF-------------GGS-PHNAP--FPN-AVVIF------------------------------------------R------------------------------------------------------------------------------------------------------------------------------------------------------------- F931_01759_Acinetobacter_pittii_507070967 MN--------S----------------M----AK-L--GLYGNAEGKTDVWATPQNLFDAIDHIFN-FDLDVCAL---PENAKCDRYFT-PE-----LDGL--KQEW-------------------------V--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-N-GH-T-VVGLLPVRTDVVWWQ-EHI-L---H-R-------E---------IHYIK----------GRLKF-------------GGC-KHNAP--FGC-ALVVF------------------------------------------R-P--S-LK--------------D----VRWE-------SSI-------------------------------------------------------------------------------------------------------------------- SAGV69_RS11740_Staphylococcus_aureus_506511035 --------------------------------ME----VHY---SSKTNEWTTPQHLFDDLNEEFS-FTLDPCST---DENAKCRKYYT-VK-----DNGL--IQDW-------------------------S--E--D--------IVFMNPPYGR-S--------------------------IKRWVKKAY---EE-SL-K-GA-T-VVCLIPARTDTTYWR-DYI-F---N-K--A---DD---------IRFLR----------GRLKF-------------GDS-KNSAP--FPS-AIIVY------------------------------------------R----G-AQ----------------------------------------------------------------------------------------------------------------------------------------------------- TT45_RS11045_Acinetobacter_baumannii_758882462 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWISKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWA-------MA---VNEFRKAESKEG------------------------------------------------------------------------------------------------------ ANACOL_RS13845_Anaerotruncus_colihominis_493931641 -------------M------------------NK----ALL---SSKRLDWCTPRDFFDALDVEFH-FTLDAAAT---EKSAKCAKYYT-PE-----TDGL--SASW-------------------------A--G--E--------TVFCNPPYGR-E--------------------------IKAWIKKGF---EE-GQ-QSGT-T-VVLLIPSRTDTEYFH-KYI-L---G-K--A----E---------IRFLK----------GRLKF---------TDEEGLT-QDAAP--FPS-MLVIY------------------------------------------R----G-QG------KEQNDG----------------------------------------------------------------------------------------------------------------------------------------- Phi93_04_Lactococcus_phage_phi93_673939868 ---------------------------M--N-NE----LMF---SSKTDLWSTPNDFFDKLNDEFH-FTLDPCST---HENAKCYKHFT-KE-----ENGL--LQDW-------------------------G--N--E--------VVFCNPPYGR-Q--------------------------IKEWIKKSY---EE-SQ-KDNT-T-VVMLIPARTDTIYFH-EYI-Y---H-K--A----E---------IRFIK----------GRLKF-------------GNA-KNSAP--FPS-MVVIF--E----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- D478_RS25245_Brevibacillus_agri_748713908 ---------------------------------------MF---TSEREEWETPQDFFEKLNKEFG-FQLDVCAL---PTNAKCERYFT-PD-----EDGL--KQEW-------------------------T--G-----------VCWMNPPYGR-E--------------------------IGKWVKKAY---ES-AK-Q-GA-T-VVCLLPARTDVKWWH-DYC-M---K-G-------E---------IRLVR----------GRMKF-------------VGA-DNMAP--FPN-AVVIF------------------------------------------S-P--A-SA--------------G----CSYK-------AID--------K----------------------------------------------------------------------------------------------------------- C236_RS0118880_Brevibacillus_laterosporus_517503045 ------------------------------MAIN-E--GMF---TSSTDLWETPQDFFNQLNKEFG-FQLDVCAL---PENAKCERYFS-PD-----EDGL--QQEW-------------------------T--G-----------ICWMNPPYGR-Q--------------------------IGKWIKKAY---ES-SL-N-GA-T-VVCLIPARTDASWWH-AHC-M---K-G-------E---------IRLVK----------GRLKF-------------GGS-KWNAP--FPN-AVVIF------------------------------------------R-K--V-GS--------------Q----HSYK-------AID--------KYGYFI------------------------------------------------------------------------------------------------------ D478_26539_Brevibacillus_agri_BAB-2500_432181416 MI--------K----------------TSDNIIN-K--AMF---TSEREEWETPQDFFEKLNKEFG-FQLDVCAL---PTNAKCERYFT-PD-----EDGL--KQEW-------------------------T--G-----------VCWMNPPYGR-E--------------------------IGKWVKKAY---ES-AK-Q-GA-T-VVCLLPARTDVKWWH-DYC-M---K-G-------E---------IRLVR----------GRMKF-------------VGA-DNMAP--FPN-AVVIF------------------------------------------S-P--A-SA--------------G----CSYK-------AID--------K----------------------------------------------------------------------------------------------------------- BN981_00304_Halobacillus_trueperi_635344555 ---------------------------M--GKMN----VHY---SSKSNDWATPQDFFDGLDNEFN-FTLDPCAT---SENAKCDNYFT-IE-----DDGL--KQSW-------------------------E--G--E--------TVFCNPPYGR-E--------------------------IKLWVKKAF---QE-SK-KPNT-K-VVMLIPARTDTKYFH-DYI-Y---M-Q--A----R---------VRFIK----------GRLKF-------------GNG-KGNAP--FPS-MVVIF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- BN911_RS03730_Morganella_morganii_738472851 MA-GYA----S------------------------------NTAPEHKDSWQTPEWLFTALDLEFG-FYLDAAAS---DINALCSRYLT-EQ-----DDAL--KSEW-------------------------V--S--H-----G--AIWCNPPFSN----------------------------IRPWVEKAA---EQ-ARMQ-NQ-P-VVMLVPEDMSVGWFL-EAL-----K-T--V-D--E---------IRVIT-----G----GRINF---VN-PV-T---GEE-KKGNS--KGS-MFLIW------------------------------------------R-P--F-IT----P---------R----RLPS-------FAL-----------KQD---LESIGN-------------------------QYLAEVRA--A--------------------------------------------------------- J689_1349_Acinetobacter_baumannii_691068978 MN--------T----------------M----AQ-R--KLFGLAENRTDVWATPQDFFDKLNAVFN-FDLDVCAL---PENAKCERFFS-PE-----QNGL--KQEW-------------------------I--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQW-----------------DKGASRE------------------------------------------------------------------------------------------------------- BN981_RS01350_Halobacillus_737533832 --------------------------------MN----VHY---SSKSNDWATPQDFFDGLDNEFN-FTLDPCAT---SENAKCDNYFT-IE-----DDGL--KQSW-------------------------E--G--E--------TVFCNPPYGR-E--------------------------IKLWVKKAF---QE-SK-KPNT-K-VVMLIPARTDTKYFH-DYI-Y---M-Q--A----R---------VRFIK----------GRLKF-------------GNG-KGNAP--FPS-MVVIF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RN38_RS21980_Hafnia_paralvei_746124400 MS-EFA----S-------------------N-TP----------LEHKDRWQTPIEVFSALDAEFG-FYLDAAAE---HGNALCARYLT-ER-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CKAQ-SQ-P-VVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------GIMAGVKA--A--------------------------------------------------------- J594_4091_Acinetobacter_baumannii_259052_588219826 ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ TS65_RS13365_Aneurinibacillus_migulanus_759006369 M-------------------------------NT-A--VMF---SSATDEWATPQDFFDQLNQEFH-FTLDPCAT---HESAKCARYFT-EE-----DNGL--AQDW-------------------------T--GE----------IVFMNPPYGR-V--------------------------LGQWVKKAF---EE-SI-K-GA-T-VVCLLPARTDTRWFH-DYI-Y---HRA-------E---------IRFVK----------GRLKF-------------GDS-KNSAP--FPS-MVVIF------------------------------------------N-R--A-GV--------------KVGG----------------------------------------------------------------------------------------------------------------------------------- J595_RS19805_Acinetobacter_baumannii_691047241 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWIAKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VQWI-------VT---ETDFRKAESKEG------------------------------------------------------------------------------------------------------ HMPREF0454_RS07315_Hafnia_alvei_490192932 MS-EFA----S-------------------N-TP----------LEHKDRWQTPIEVFAALDAEFG-FYLDAAAD---HGNALCARYLT-ES-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CKAQ-SQ-P-IVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RNE---LIAIGS-------------------------SIMAGVKA--A--------------------------------------------------------- GEAM_RS21330_Ewingella_americana_736793592 MN-EFA----S-------------------H-TP----------VEHKDRWQTPLEVFTALDLEFG-FYLDAAAD---DQNALCARYLS-EA-----DNAL--ATEW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CHVQ-NQ-P-VVMLLPADTSTGWFA-QAL-----A-T--A-D--E---------IRFIT-----E----GRLSF---IN-AG-T---GKPGKNGNS--KGS-MLVIW------------------------------------------R-P--F-IK----P---------R----GQFT-------TVC-----------RDV---LLSIGA-------------------------DYLQEVAA------------------------------------------------------------ Mm0Y_RS16130_Morganella_morganii_802097985 MA-GYA----S-------------------K-TA----------PEHKDSWQTPEWLFTALDLEFG-FYLDAAAS---DINALCSRYLT-EQ-----DDAL--KSEW-------------------------I--S--H-----G--AIWCNPPFSN----------------------------IRPWVEKAA---EQ-SRMQ-NQ-P-VVMLVPEDMSVGWFL-EAL-----K-T--V-D--E---------IRVIT-----G----GRINF---VN-PV-T---GEE-KKGNS--KGS-MFLIW------------------------------------------R-P--F-IT----P---------R----RVLN-------TTL-----------KQE---LEAIGN-------------------------QYLAEVSA------------------------------------------------------------ HMPREF0864_RS08005_Enterobacteriaceae_bacterium_9_2_54FAA_496089880 MS-EFA----S-------------------N-TP----------LEHKDRWQTPIEVFAALDAEFG-FYLDAAAD---HGNALCARYLT-ES-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVGKAT---EQ-CRAQ-SQ-P-VVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRIIT-----G----GRLAF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------DIMAGVKA--A--------------------------------------------------------- ACINWC323_A0077_Acinetobacter_sp_WC-323_425484490 ---------------------------M----AK-S--KLFGLAEDRTDVWATPQDFFDKLNAIFD-FDLDVCAL---PENAKCERYFT-PE-----IDGL--SQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-A-GY-T-VVALLPARTDVGWWQ-SHC-L---N-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-AVVVF------------------------------------------R-P--S-LN--------------D----VRWE-------QSQ-------------------------------------------------------------------------------------------------------------------- ACINWC323_RS01110_Acinetobacter_sp_WC-323_696306260 MN--------S----------------M----AK-S--KLFGLAEDRTDVWATPQDFFDKLNAIFD-FDLDVCAL---PENAKCERYFT-PE-----IDGL--SQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---ET-AN-A-GY-T-VVALLPARTDVGWWQ-SHC-L---N-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-AVVVF------------------------------------------R-P--S-LN--------------D----VRWE-------QSQ-------------------------------------------------------------------------------------------------------------------- SG0729_Sodalis_glossinidius_str_'morsitans'_84779227 LI--------S-------------------N-TP----------KSFKDRWQTPIEVFRALDAEFN-FKLDAAAD---KSNALCKAFLT-EQ-----HDAL--KSDW-------------------------N--S--K-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CKKQ-NQ-T-IVMLLPSDTSTAWFY-EAL-----K-T--S-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GKEGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----P---------G----CWMT-------YVQ-----------RKE---LLKKGV-------NRDNRTPLPAQR-------------------------------------------------------------------------- SGP1_RS15415_Sodalis_glossinidius_499730784 IV--------S-------------------Q-TP----------KACKDKWQTPVEIFRALDAEFG-FGLDAAAD---FANALCRRYLT-EE-----DDAL--NCEW-------------------------H--T--R-----G--AIFCNPPYSN----------------------------ITPWVSKAA---EQ-CAVQ-KQ-T-IVMLLPSDTSTGWFR-MGL-----E-S--V-D--E---------VRVIT-----G----GRLSF---IS-AA-T---GVCGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----N---------R----CQFT-------TVD-----------KSD---LIRIGT-------EAVR----EVAA-------------------------------------------------------------------------- AB64_RS00770_Escherichia_coli_486273694 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------L--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- J635_1953_Acinetobacter_baumannii_690997976 MN--------T----------------M----AK-L--GLFGNAEGRTDVWATPQKLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------A--G-----------TCWMNPPYGR-E--------------------------IVDWISKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LK--------------D----VKWG-------DQ--------------------------------------------------------------------------------------------------------------------- ARN_24250_Arsenophonus_nasoniae_284008293 LI--------S-------------------H-TP----------KPFKDRWRTPIGVFKTLDAEFN-FKLDAAAD---KNNALCKAFLT-EQ-----QDAL--TCDW-------------------------N--S--N-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CRKQ-NQ-T-IVMLLPSDTSTAWFY-EGL-----N-T--A-D--E---------IRFIT-----E----GRLSF---VS-AE-T---GEQGISGNS--KGS-VLFIW------------------------------------------R-P--L-GR----E---------M----CRMT-------HIR-----------KKE---LLPLTI-------GCST---------------------------------------------------------------------------------- M655_RS0109725_Bacillus_sp_NSP21_737442515 ---------------------------------------MF---KSEREEWETPQEFFDKLNDEFG-FQLDVCAL---PTNAKCERYFT-PD-----DDGL--HQEW-------------------------T--G-----------VCWMNPPYGR-E--------------------------IGKWVKKAY---ES-AK-Q-GA-T-VVCLLPARTDVKWWH-DYC-M---K-A-------E---------IRLVR----------GRMKF-------------VGA-DNMAP--FPN-AVVIF------------------------------------------S-P--A-SA--------------G----CSYK-------AID--------K----------------------------------------------------------------------------------------------------------- BE89_RS22035_Escherichia_coli_446051431 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- AB660_RS05030_Chromobacterium_subtsugae_828144310 MT--------D----------------A----S-----IHF---RSSTDEWPTPQLLFDELHAEFQ-FTVDVCAT---PGNAKCPRYYT-RA-----DDGL--AQDW-------------------------S--AE----------TVWMNPPFGH-G--------------------------IKFWMEKAL---KS-AR-A-GA-T-VVCLVPSRTDTRWWH-RYA-MW--A-A-------E---------IRCLD----------KRLQF-------------DGG-SAKAP--FPA-VVIVF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- CG67_RS0113335_Tatumella_sp_UCD-D_suzukii_647630238 MT-D----K-S-------------------N-TS----------AEHKDSWQTPPEVFRALNAEFQ-FQLDAAAS---AHNALCRKYIT-AE-----QDTL--QTEW-------------------------G--D--YVE--NG--YAWLNPPYSA----------------------------PLPFVEKAG---KE-KELN-HV-G-CVMLLPADISVGWFK-EAV-----K-T--A-S--E---------VRLIT-----G----GRLAF---IS-SQ-T---GKP-VGGNN--KGS-LLIIW------------------------------------------H-P--W-PT----G---------S----CQFK-------TVD-----------RDQ---LINFGK-------------------------RLMERAA------------------------------------------------------------- ABOUO_79_Paenibacillus_phage_Abouo_525335850 ------------------------------MAIN-E--GMF---TSSTDLWETPQEFFNQLNQEFG-FQIDVCAL---PENAKCERYFS-PD-----EDGL--QQEW-------------------------T--G-----------ICWMNPPYGR-Q--------------------------IGKWIKKAY---ES-SL-N-GA-T-VVCLIPARTDARWWH-DYC-M---K-G-------E---------IRLVK----------GRLKF-------------GSS-KWSAP--FPN-ALVIF------------------------------------------K-E--A-GS--------------Q----HSYK-------AID--------KYGSLL------------------------------------------------------------------------------------------------------ SGP1_RS06170_Sodalis_glossinidius_754366340 LI--------S-------------------N-TP----------KSFKDRWQTPIEVFRALDAEFN-FKLDAAAD---KSNALCKAFLT-EQ-----HDAL--KSDW-------------------------N--S--K-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CKKQ-NQ-T-IVMLLPSDTSTAWFY-EAL-----K-T--S-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GKEGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----P---------G----CWMT-------YVQ-----------RKE---LL------------------------------------------------------------------------------------------------- CLOSCI_00567_[Clostridium]_scindens_ATCC_35704_167664126 -------------MTDRGKKGDLTMAPL--N--K----ALF---SSAKEDWATPQDFFDELNKEFH-FDLDPCAD---AENAKCKEFFT-KE-----QNGL--LQDW-------------------------G--G--R--------CVFCNPPYGRTS--------------------------TGEWIKKCY---EE-AQ-KPGT-V-VVALIPARTDTRFFH-DYI-Y---H-K--A----E---------IRFIK----------GRLHF-------------GGC-KDAAP--FPS-MVVVF-----RKGKENEEEKKTGCTAAGHTEEKAAEKDDGSENGVDGI------------------------------------------------------------------------------------------------------------------------------------------------------------- G468_RS0114315_Arsenophonus_nasoniae_652428396 LI--------S-------------------H-TA----------KPFKDRWQTPIEVFRTLDAEFT-FRLDAAAD---ENNALCTAFLS-EK-----ADAL--KCDW-------------------------N--S--D-----G--AIFCNPPYSN----------------------------IKPWVNKAA---EQ-CRKQ-KQ-T-IVMLLPSDTSTAWFY-EGL-----N-T--A-D--E---------IRFIT-----E----GRLLF---VS-AE-T---GEQGTSGNS--KGS-VLFIW------------------------------------------R-P--L-ER----E---------V----CKIT-------HIR-----------KKE---LLPLTT-------GCST---------------------------------------------------------------------------------- AT03_RS13490_Hafnia_alvei_647467325 MS-EFA----S-------------------N-TP----------LEHKDRWQTPIGVFSALDAEFG-FYLDAAAD---HGNALCARYLT-ER-----DDAL--NSEW-------------------------V--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CKAQ-SQ-P-IVMLLPADTSTGWFP-LAL-----E-S--V-D--E---------VRIIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------SIMAGVKA--A--------------------------------------------------------- MY84_RS08540_Escherichia_coli_446051432 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAV---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- HA49_RS14705_Tatumella_morbirosei_740176027 MT-D----K-S-------------------N-TP----------AEHKDTWQTPPEIFRALNAEFQ-FQLDAAAS---PHNALWRKFIT-AE-----QDTL--RTEW-------------------------G--D--YVE--NG--YAWLNPPYSA----------------------------PLPFVEKAA---KE-KELN-HV-G-CVMLLPADISVGWFR-EAV-----K-T--A-S--E---------VRLIT-----G----GRLAF---IS-SQ-T---GKP-VGGNN--KGS-LLIIW------------------------------------------H-P--W-PT----G---------S----CQFK-------TVD-----------RDQ---LMDFGK-------------------------RLIARAA------------------------------------------------------------- J644_3880_Acinetobacter_baumannii_691073319 ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ J641_4016_Acinetobacter_baumannii_1188188_589421412 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ EMTOL_RS19950_Emticicia_oligotrophica_504839093 ---------------------------M----NI-K--AIF---SCKTTNWETPQDLFDELDKQYN-FTLDVCAT---SENAKCNEFFT-PE-----IDGL--KQEW-------------------------K--G-----------MCWMNPPYGR-E--------------------------IGKWVRKAH---LE-VI-T-GRCR-IIALLPARTDTKWFH-EWV-L---NKH-------E---------IKFIK----------GRLRF-------------SDS-KNSAP--FPS-MLVIF------------------------------------------E-G--R-----------------P-------------------------------------------------------------------------------------------------------------------------------------- BTS2_RS02440_Bacillus_sp_TS-2_780117918 --------------------------------MN-Q--AMF---SSSTDKWSTPQSFYDKLNQEFQ-FDIDVCAT---DSDKKCERYFS-PE-----QDGL--KQEW-------------------------T--G-----------ICWMNPPYGR-G--------------------------IGPWIQKAY---ES-SQ-Q-GA-T-VVCLLPSRTDTKWWH-EYC-M---K-G-------E---------IRFIK----------GRLKF-------------GDS-KNSAP--FPS-VVVIF------------------------------------------R-P--K-VV---------------------------------------------SM------------------------------------------------------------------------------------------------------ LILY_61_Bacteriophage_Lily_755258783 MS--------N----------------T--MAVH------Y---SSKTDMWETPQDFFDKLHAEFG-FTLDVCAV---PENAKCERFFS-PD-----DNGL--LQNW-------------------------K--G-----------VCWMNPPYGR-Q--------------------------IGAWIAKAY---ES-SL-E-GA-T-VVCLVPSRTDTKWWH-DYC-L---K-G-------E---------VRFIK----------GRLKF-------------GGS-PHNAP--FPN-AIVIF------------------------------------------R-G--K-GQ----------------------------------------------------------------------------------------------------------------------------------------------------- GAP32_068_Cronobacter_phage_vB_CsaM_GAP32_414086984 -------------M------------------EDNNMSVHF---SSASNTWDTPDDFYQKLHAVWN-FTLDPAAM---DETAKCEKYYT-PE-----TDGL--AHSW-------------------------A--G--E--------TVWCNPPYGR-E--------------------------ISKWFKKFD---EE-FK-QNGT-T-IIALPPARTDTTYFH-KYV-R---D-S--A---TA---------ICFVK----------GRLKFDN-RSLPSWKEDGSHK-KTGAP--FPS-MIVIY------------------------------------------D----N-NI------TQEKYEVLNSLGFVVQP-FLLG------------------------------------------------------------------------------------------------------------------------- BTS2_0497_Bacillus_sp_TS-2_591276954 ------------------------------MTIN-Q--AMF---SSSTDKWSTPQSFYDKLNQEFQ-FDIDVCAT---DSDKKCERYFS-PE-----QDGL--KQEW-------------------------T--G-----------ICWMNPPYGR-G--------------------------IGPWIQKAY---ES-SQ-Q-GA-T-VVCLLPSRTDTKWWH-EYC-M---K-G-------E---------IRFIK----------GRLKF-------------GDS-KNSAP--FPS-VVVIF------------------------------------------R-P--K-VV---------------------------------------------SM------------------------------------------------------------------------------------------------------ J532_3860_Acinetobacter_baumannii_940793_630469298 MN--------T----------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWF------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- J532_3860_Acinetobacter_baumannii_691154170 ---------------------------M----AQ-S--KLFGLAENRTDVWSTPQDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWF------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- BLAHAN_04217_Blautia_hansenii_DSM_20583_260542363 -----------------------------------------------------------------------------------MRETLH-AG-----RRRP--KARL-------------------------G--G--Y--------RVFCNPPYGR-A--------------------------IADWVRKGY---EE-SR-KPGT-T-VVMLIPSRTDTAYFH-DWI-F---G-K--A---SE---------VRFLR----------GRLKF---------TDEDGNG-EDAAP--FPS-AVIVW------------------------------------------R----S-PE------STGRE-------FATWH-I---------------------------------------------------------------------------------------------------------------------------- P262_01673_Cronobacter_sakazakii_CMCC_45402_564117231 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KCLEALKC--ENGKAA---------------------------------------------------- JO80_RS0108885_Cronobacter_malonaticus_696416059 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENGKAA---------------------------------------------------- P262_RS05820_Cronobacter_sakazakii_752821882 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KCLEALKC--ENGKAA---------------------------------------------------- ECC34793_RS0111695_Escherichia_coli_585346834 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YIWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- FL80_RS15355_Acinetobacter_baumannii_690990657 MS--------T----------------M----AK-L--GLFGNAEGRTDVWATPQTLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------IVDWISKAA---YT-AE-Q-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-SSNAP--FGC-AVVVF------------------------------------------R-P--S-LK--------------D----VQWG-------AQ--------------------------------------------------------------------------------------------------------------------- H627_RS17735_Lactobacillus_harbinensis_737460398 MS--------D--F-------------L--K-PG-G--AAL---TSNKDDWETPQAFFESLNAKYH-FAIDLAAS---KDNAKCDRYFS-VA-----DDSL--LQDWSD---------------------DFG--G-----------AMYLNPPYGR-H--------------------------IGDWVKKAY---ET-SL-RVNV-P-IVLLIPARTDTSYWH-DYI-F---G-K-------A--S------IKFIR----------GRLKF-E-QN--------GMA-GGPAP--FPS-AIIVY-------------------------------------N----G-D--G-AE--------------K-------------------------------------------------------------------------------------------------------------------------------------- ASU2_RS02700_Actinobacillus_491812488 ---------------------------M-------T--------DFDKNTWQTPQECRTYAKYRWL-VIWDGAAT---AENAICERFIT-PEI------------DF-LNFDAVT---Q-II--------P-N--H--A--------RIFINPPYGR-----------------------GY---VKKFVRQAI---RLMRE-K-QC-F-IVMLLNADKSTEWFQ-LIR-----E-N--A-T--E-------V-IDIIG----------QRVAF---IN-PV-T---GKP-VSDNP--KWQ-MFAVF------------------------------------------D-P--H-AE--------------G----FTTS-------YVT-----------YDK---ILEVAQ-----------------YD-K------------------------------------------------------------------------ ECOM_RS18005_Escherichia_coli_485729004 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNQLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- RJ36_RS12145_Enterobacteriaceae_bacterium_strain_FGI_57_506488274 MT-DYN--G-S-------------------N-TP----------ADQRDLWRTPPSLFASLDAEFC-FQLDAAAA---PLNALCRKFIT-AE-----QNTL--ETPW-------------------------A--N--YLT-VPG--YVWLNPPYSD----------------------------ITPFVKKSA---VE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CLFT-------TVE-----------RDH---LMAFGN-------------------------KLLARREA--A--------------------------------------------------------- IO46_03040_Gallibacterium_anatis_703606824 -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTAWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ EC2850400_RS19395_Escherichia_coli_487555289 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--S-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- IO46_RS02995_Gallibacterium_anatis_757675697 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTAWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ SR72_RS20425_Enterobacter_cloacae_complex_695720049 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VTG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLSRREA--A--------------------------------------------------------- SEEM1923_08275_Salmonella_enterica_555233171 MT-D----K-S-------------------N-TP----------IEIKDLWRTPPEIFHALNAEFC-FVLDAAAN---AENALCRLYIT-EQ-----QNTL--FTPW-------------------------K--E--VMPDIPG--YVWLNPPYSR----------------------------PMPFVKKAV---NE-NEDN-GI-G-CVMLLPADISVSWFI-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AI-T---GKP-VNGNN--KGS-MLVIW------------------------------------------H-P--Y-PR----S---------GG---CRMN-------TVD-----------RNV---LMKYGK-------------------------RRMKVTA------------------------------------------------------------- WQ86_RS10285_Escherichia_446051430 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- PU01_RS20405_Hafnia_alvei_810918351 MS-EFS----S-------------------N-TP----------LEHKDRWQTPIEVFAALDAEFG-FYLDAAAD---HRNALCARYLT-DR-----DDAL--NSEW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWVENAA---EQ-CKAQ-SQ-P-VVMLLPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIAIGS-------------------------GIMTGVKA--A--------------------------------------------------------- CLOSCI_RS06430_[Clostridium]_scindens_748651356 ------------------------MAPL--N--K----ALF---SSAKEDWATPQDFFDELNKEFH-FDLDPCAD---AENAKCKEFFT-KE-----QNGL--LQDW-------------------------G--G--R--------CVFCNPPYGRTS--------------------------TGEWIKKCY---EE-AQ-KPGT-V-VVALIPARTDTRFFH-DYI-Y---H-K--A----E---------IRFIK----------GRLHF-------------GGC-KDAAP--FPS-MVVVF-----RKGKENEEEKKTGCTAAGHTEEKAAEKDDGSENGVDGI------------------------------------------------------------------------------------------------------------------------------------------------------------- X858_RS0107890_Bacillus_subtilis_647261410 ------------------------------MDVH------F---SSKTDLWATPQYFFDELHKEFD-FELDVCAL---EDNAKCEKYFT-PE-----MDGL--KQEW-------------------------N--S-----------TCWMNPPYGR-G--------------------------IGEWVQKAY---ES-SL-K-GS-T-VVCLLPARTDTRWWH-DYC-M---K-G-------E---------IRLVK----------GRLKF-------------GES-KDNAP--FPN-AVVIF------------------------------------------G-E--K-AK--------------K----HTLI-------AM--------------------------------------------------------------------------------------------------------------------- EC2867750_RS19465_Escherichia_coli_487513381 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------GRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- CDL33251.1_Enterobacter_cloacae_ISC8_571251222 RS-GYG--G-S-------------------N-TP----------SDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AD-----QNTL--ETPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GLLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- EH105704_RS01255_Escherichia_hermannii_488393402 MT-D----K-S-------------------N-TP----------PEDKDRWRTPPEIFHALNAEFC-FVLDAAAS---KENALCRSYIT-EM-----QDTL--ATDW-------------------------N--A--VMPDIPG--YAWLNPPYSK----------------------------PMPFVKKAA---QE-NADN-FT-G-CVMLLPADTSVAWFR-EAI-----S-T--A-H--E---------VRFIT-----G----GRLSF---LN-AT-T---GKA-VNGNN--KGS-ILVIW------------------------------------------H-P--Y-PR----T---------H----CQFS-------TVE-----------RDV---LMEYGR-------------------------RRTKAAA------------------------------------------------------------- SEEA0292_19103_Salmonella_enterica_554632055 MT-D----K-S-------------------N-TP----------IEIKDLWRTPPEIFHALNAEFC-FVLDAAAN---AENALCRLYIT-EQ-----QNTL--FTPW-------------------------K--E--VMPDIPG--YVWLNPPYSR----------------------------PMPFVKKAV---NE-NEDN-GI-G-CVMLFPADISVSWFI-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AI-T---GKP-VNGNN--KGS-MLVIW------------------------------------------H-P--Y-PR----S---------GG---CRMN-------TVD-----------RNV---LMKYGK-------------------------RRMKVTV------------------------------------------------------------- RDMS_RS01750_Deinococcus_sp_RL_736377798 M---------A---------------------------VHY---SSEKHDWTTPRSFFDELNAEFN-FTLDAAAS---PHNALCSRYFT-EA-----DDGL--SQPW-------------------------T--GT----------V-WCNPPYGR-Q--------------------------IGRWIAKAA---QS-AC-E-GA-T-VVMLIPARTDTAAWH-DHI-LFNPQ-A-------E---------VRFVR----------GRLRF-------------GDA-TANAP--FPS-AVIIF------------------------------------------R-P--G-GQ--------------G-------------------------------------------------------------------------------------------------------------------------------------- NV79_RS06670_Enterobacter_hormaechei_757619257 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AD-----QNTL--ETPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-ST-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RGE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- K035_3825_Acinetobacter_baumannii_691039509 ------------------------------------------------------QDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R------------------------------------------------------------------------------------------------------------------------------------------------------------- WQ88_RS24815_Escherichia_coli_823642731 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALHAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--S-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- YA69_RS12920_Cronobacter_sakazakii_765034080 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFIKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENEKAA---------------------------------------------------- VE18_RS11090_Enterobacter_cloacae_782730169 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFAYLDTEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YIWLNPPYSD----------------------------ITPFVKKAA---AE-SS-N-QI-G-TVMLVPADTLVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- MSQ_RS0117370_Escherichia_coli_485798016 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PV----H---------T----ATLR-------PLI-----------VES------------------------------------------------------------------------------------------------------ BN129_RS03185_Cronobacter_sakazakii_495122741 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PIPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVD-----------RDE---LMAYGR-------------------------KRLEALKC--ENEKAA---------------------------------------------------- A323_gp73_Acinetobacter_bacteriophage_AP22_388570840 --------------------------------MN----VHF---SSDKQTWETPQDLFDKLNDIFN-FNLDACAE---HDTAKVKKYFT-ID-----DNAL--IQDW-------------------------I--G-----------SVWCNPPYNR-E--------------------------QIKFIEKAL---NE-SL-KHKS-T-VVLLIPARPETKVWQ-NVIFK---S-A--S----Q---------ICFIK----------GRLKF-------------GNS-KYNAP--FPS-ALIVF-----------------------------------------------G-KH----------------IDLSEFG-FCVY------------------------------------------------------------------------------------------------------------------------- K035_3825_Acinetobacter_baumannii_42057_4_629017472 ------------------------------------------------------QDFFEKLDRVFN-FDLDVCAL---PENAKCERYFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGK-E--------------------------IIDWVAKAA---ET-AS-K-GH-T-VVALVPVRTDARWFQ-DYC-L---G-R-------E---------IHFIR----------GRLKF-------------GGS-KTNAP--FGC-CVVVF------------------------------------------R-P--S-LI--------------D----VSWE-------KSA-------------------------------------------------------------------------------------------------------------------- CSK29544_RS00070_Cronobacter_sakazakii_655998119 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTRFID-EM-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRMEALKC--ENGKAA---------------------------------------------------- E05_32470_Plautia_stali_symbiont_549071051 LI--------S-------------------N-TP----------KSFKDRWRTPIGVFKTLDAEFG-FKLDAAAD---KSNALCKAHLT-EQ-----QDAL--KCDW-------------------------N--S--K-----G--AIFCNPPYSK----------------------------IMPWVKKAA---EQ-CRKQ-KK-T-IVMLLPSDTSTAWFH-EAL-----K-T--S-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GKEGKAGNS--KGS-VLFIW-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- BX81_RS08930_Escherichia_coli_693032238 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALHAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- HMPREF9548_RS22140_Escherichia_coli_446051428 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------IPPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- ECKD1_RS05095_Escherichia_coli_446051426 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAN---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- SG0744_Sodalis_glossinidius_str_'morsitans'_84779242 II--------S-------------------K-TP----------KVCKDRWQTPVEIFRALDAEFG-FGLDAAAD---HDNTRCRHYLT-EE-----DDAL--SCDW-------------------------H--T--R-----G--AIFCNPPYSN----------------------------IMPWVKKAA---EQ-CALQ-QQ-T-VVMLLPSDTSTAWFA-QAQ-----K-T--A-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GEKGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----T---------P----KGMT-------TVS-----------KQV---LIN------------------RMWG-------------------------------------------------------------------------- SG64_RS18700_Enterobacter_cloacae_798873157 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFR-FQLDAAAA---PHNALCRRYIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLIARREA--A--------------------------------------------------------- SGP1_RS06345_Sodalis_glossinidius_754365623 II--------S-------------------K-TP----------KVCKDRWQTPVEIFRALDAEFG-FGLDAAAD---HDNTRCRHYLT-EE-----DDAL--SCDW-------------------------H--T--R-----G--AIFCNPPYSN----------------------------IMPWVKKAA---EQ-CALQ-QQ-T-VVMLLPSDTSTAWFA-QAQ-----K-T--A-D--E---------VRFIT-----E----GRLSF---IS-AE-T---GEKGKAGNS--KGS-VLFIW------------------------------------------R-P--W-RI----T---------P----KGMT-------TVS-----------KQV---LIN------------------RMWG-------------------------------------------------------------------------- MMA_RS11485_Janthinobacterium_sp_Marseille_501027971 --------------------------------MS-K--VHF---SSATPEWYTPQSTFDVLNAEFG-FTLDPCCT---HENAKCDRHFT-MA-----ENGL--SQDW-------------------------S--NE----------VTFMNPPYGR-E--------------------------IKEWMRKAY---ES-SL-S-GA-T-VVCLVPARTDTAWWH-DYS-I---K-G-------E---------IRFLR----------GRLKF-------------GGA-KTNAP--FPS-AIVIF------------------------------------------R-P-------------------------LPIK-------ELA-------------------------------------------------------------------------------------------------------------------- JO78_RS0107935_Cronobacter_malonaticus_696399167 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTKFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRMEALKC--ENGKAA---------------------------------------------------- JP29_01125_Gallibacterium_anatis_702419560 -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRVA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDL------------------------------------------------------------------------------------------------------ JP29_RS01080_Gallibacterium_anatis_746017794 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRVA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDL------------------------------------------------------------------------------------------------------ L361_01863_Enterobacter_sp_MGH_15_578296709 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-LLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- PU15_RS12295_Escherichia_coli_757742433 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNVEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- EQS_RS0120745_Escherichia_sp_TW15838_446051429 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADISVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- MC75_02025_Klebsiella_pneumoniae_721491398 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAV---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- TJ25_RS25930_Escherichia_coli_766962597 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFST-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILACREA--A--------------------------------------------------------- SG79_RS07240_Enterobacter_cloacae_798841194 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PYNALCRRFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- UOK_RS0120300_Cronobacter_sakazakii_742402431 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNALCTKFID-EM-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFVKKAA---QE-NADH-SV-G-CVMLLPADTSAQWFK-EAI-----K-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-VNGNN--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENEKAA---------------------------------------------------- Q770_03340_Klebsiella_pneumoniae_subsp_pneumoniae_PittNDM01_667708234 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAI-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- RN00_RS02255_Klebsiella_pneumoniae_742851006 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-FLFIW------------------------------------------R-P--F-IS----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- DY84_RS0103980_Klebsiella_pneumoniae_639443683 MT-DYG--G-S-------------------N-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQNNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- L347_RS0123085_Enterobacter_cloacae_complex_695653183 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-LLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- L371_00766_Enterobacter_sp_MGH_25_555187647 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A--------------------------------------------------------- H922_23640_Citrobacter_freundii_GTC_09629_486073301 SG-DYG--G-S-------------------K-TP----------PDQRDLWRTPPALFASLNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------G--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- UOM_RS0110205_Cronobacter_malonaticus_742403302 ---D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNTLCTKFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFIKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENGKAA---------------------------------------------------- BN131_RS17085_Cronobacter_malonaticus_696395149 MT-D----K-S-------------------N-TP----------VEIKDLWQTPPEIYRALRSEFP-FFLDAAAS---QSNTLCTKFID-EK-----ENTL--EANW-------------------------L--S--KMPIGVGRAYAWLNPPYSA----------------------------PMPFIKKAA---QE-NADH-SV-G-CVMLLPADTSVQWFK-EAI-----R-T--A-H--E---------VRFIT-----G----GRLSF---LN-AS-T---GKP-AEGNP--KGS-MLIIW------------------------------------------H-P--W-PR----A---------GE---CRMT-------TVE-----------RDE---LMAYGR-------------------------KRLEALKC--ENGKAA---------------------------------------------------- ETDT_RS00685_Edwardsiella_tarda_737631614 MT--IK----S-------------------N-TP----------ASAKDCWQTPLWLFDALDIEFG-FWLDAAAS---ESNALCAKYLT-EE-----DNAL--GCEW-------------------------E--S--A-----G--AIWCNPPYSK----------------------------IGPWVAKAA---EQ-SDRQ-IQ-T-VVMLVPEDMSVGWFT-DAL-----K-S--V-D--E---------VRVIT-----G----GRVNF---VH-AV-T---GAE-QKGNS--KGS-MLLIW------------------------------------------R-P--F-IN----P---------R----RMIT-------TIS-----------KST---LEAIGR-------------------------PVRSAA-------------------------------------------------------------- SMDB11_RS12950_Serratia_marcescens_644361110 MSLVYA----S-------------------N-TP----------AEHKDRWQTPIEIFSALDVEFG-FYLDAAAD---HGNALCARYLT-EQ-----DNAL--AVDW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------ITPWVEKAA---EQ-CRAQ-NQ-P-VVMLLPADTSTGWFS-LAL-----Q-S--V-D--E---------VRLIT-----D----GRLAF---IN-SA-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TVS-----------RDE---LIGIGT-------------------------DILREVAA------------------------------------------------------------ BR71_RS03710_Chromobacterium_haemolyticum_759948263 MK--------S----------------K----TD-EASIHF---RSTRDDWETPQDLFDALHAEFG-FTVDVCAS---DKTAKCVRYYT-KA-----DNGL--AKDW-------------------------S--NE----------VVWMNPPFGH-V--------------------------TKRWMDKAR---LS-SM-R-GA-T-VVCLVPARVSVLWWH-RNV-FL--A-S-------E---------VRCLR----------PRLQF-------------VGA-AQKAP--FDA-VLVIF------------------------------------------R-P--G-DT--------------Q----AKLS------------------------------------------------------------------------------------------------------------------------------ Q770_RS00955_Klebsiella_pneumoniae_763022815 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAI-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- RP28_RS19415_Leclercia_adecarboxylata_743514479 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-VE-----QNTL--ETPW-------------------------A--D--YLT-IPG--YAWLNPPYSD----------------------------ITPFVKKAA---AE-SK-N-QI-G-TVMLVPADTSVGWFR-EAI-----E-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLLIW------------------------------------------R-P--F-PR----T---------H----CHFA-------TVE-----------RDE---LMTFGA-------------------------KLLARREA--A--------------------------------------------------------- HMPREF0731_4170_Roseomonas_cervicalis_ATCC_49957_296263068 LG-RSG--T-T-----PS----F----L--N-TA-A----F---SSAYEAWATPPDLLERLYAAVGSIDLDPCSPGKLRSRVKAPRHFT-ER-----DDGL--AQEW-------------------------S--G-----------KVYMNPPYGR-T--------------------------IGAWTTKAR-V-EV-TAGR-AE-C-VVGLVPARTDTRWWH-ADV-A---G-H--A----H---------VWLLK----------GRLAF-------------GDG-STPAP--FPS-ALLLW---------------------GGN------------------A-P--------T-I--------------AEMS-A-----SFP-----------DAQ-H-IPARHR--------------------S----PDGAKREA--A--------------------------------------------------------- JP30_07420_Gallibacterium_anatis_IPDH697-78_702412378 -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIEVLE----F-P--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGW-----------------YG-K------------------------------------------------------------------------ JP30_RS07235_Gallibacterium_anatis_746078506 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIEVLE----F-P--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGW-----------------YG-K------------------------------------------------------------------------ AC06_RS01890_Escherichia_coli_693106202 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---EKP-VSGNN--KGS-MLIIW------------------------------------------H-A--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- SG74_RS03495_Enterobacter_cloacae_798873142 MT-DFT--G-S-------------------N-TP----------AEQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-ST-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RGE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- ECNIH2_RS14490_Enterobacter_cloacae_764909418 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALSRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-LLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- J635_2258_Acinetobacter_baumannii_690998264 MN--------T----------------T----AK-L--GLFGNAEGRTDVWATPQKLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQDW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ISLWIEKAV---QT-AN-Q-GH-T-VVGLLPTRTDVAWWQ-EHV-M---N-R-------E---------IHYIK----------GRLKF-------------GGC-KHNAP--FGC-AVVVF------------------------------------------R-P--S-LK--------------D----VQWG-------TQ--------------------------------------------------------------------------------------------------------------------- JP35_RS08425_Gallibacterium_anatis_746004460 MS-------------------------F--D----------------KDAYPTPISLFNQINDEFN-FTIDGAAL---PHNAKLDRYIT-PE-----MDFM--TYPL-----------------------E-N--E-----------RIWINPPFSD----------------------------LHSFVKRAV---DL-YENH-DC-L-VVMLLPVDISTRWFS-LIV-----E-K--A-T--E---------IRFIV--G-------GRIKF---LN-PE-T---DK--WTDVC--RGN-HLAIF------------------------------------------D-P--K-HK----A-----M---G----QVIR-------HVH-----------------IDNFAN--LE--------W------------R------------------------------------------------------------------- N561_00905_Gallibacterium_anatis_665836508 -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ HMPREF0485_04750_Klebsiella_sp_1_1_55_289774595 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SIS-----------LAE---LKRIGN--WRLHDARRKRKRSPRPGSSLRRRDNQSDERK--A--------------------------------------------------------- N561_00905_Gallibacterium_anatis_12656/12_540073363 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ F349_RS0103215_Enterobacter_cloacae_complex_516289844 MT-DFT--G-S-------------------N-TP----------ADQRDLWRTPPALFSSLNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRLIT-----A----GRLAF---IN-PV-T---DKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- X657_RS20595_Klebsiella_pneumoniae_694095222 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SIS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- RK16_RS24410_Escherichia_coli_693100364 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- KPR_RS19370_Klebsiella_pneumoniae_529982416 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- MTE1_RS05745_Klebsiella_pneumoniae_490299083 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIRT--QEAA--------------------------------------------------------------------------------------- A225_RS06730_Klebsiella_oxytoca_504650526 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RHSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- N625_RS17185_Klebsiella_pneumoniae_757706267 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- L420_RS03025_Enterobacter_cloacae_556358052 MT-DYT--G-S-------------------N-TP----------EDQRDLWRTPPALFAALNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--GTPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDD---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- HMPREF0485_RS01290_Klebsiella_sp_1_1_55_695778461 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SIS-----------LAE---LKRIGN--------------------------------------------------------------------------------------------- TB84_RS11250_Klebsiella_pneumoniae_749592663 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVHF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IS----P---------R----HIIT-------TVS-----------LAE---LKRIGT--LEAA--------------------------------------------------------------------------------------- N035_RS243200_Klebsiella_pneumoniae_589884974 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- L461_RS22430_Klebsiella_pneumoniae_556221454 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- N559_RS20150_Klebsiella_pneumoniae_530706273 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RHSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGT--LEAA--------------------------------------------------------------------------------------- TB56_RS26830_Klebsiella_pneumoniae_556477177 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQNNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- XALC_0202_Xanthomonas_albilineans_GPE_PC73_283472241 MS-QYV----D--W-------------Y--G-------------KGRAQNWRTPQSIFDALHDEFQ-FTLDGASE---PGNGLLPLAST------A-DEQI----DW-------------------------T--G--H--------RVFCNPPWSN----------------------------IRPFLERAP---AA------DC---AVFLVPARTNAKWFH-RAI-----D-L--G-A--A---------VRFFE----------GRPKF-E-LP-HR-----SGP-GNSSP--VDC-LLLIL------------------------------------------R-K--D-VA--------------R---------------EVQ--G----------------------------------------------------------------------------------------------------------------- P244_RS19660_Klebsiella_pneumoniae_746037254 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LTE---LKRIGN--LEAA--------------------------------------------------------------------------------------- KPST82_RS05430_Klebsiella_pneumoniae_763385574 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- KPHS_12370_Klebsiella_pneumoniae_504108903 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LTE---LKRIGN--LEAA--------------------------------------------------------------------------------------- IO43_07585_Gallibacterium_anatis_7990_703617381 -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KV-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ IO43_RS07410_Gallibacterium_anatis_746082790 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KV-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----Q-N--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ TB82_RS16260_Klebsiella_pneumoniae_749548111 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASIT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- HMPREF1307_RS05885_Klebsiella_pneumoniae_490281197 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RKSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- F403_gp088_Enterobacteria_phage_vB_KleM-RaK2_422937337 KH------A-V-------------------H-FS----------TRKNDLWTTPKPLFDKLNALWN-FTVDVACS---NETALCLKHYT-PE-----DDGL--SQDW-------------------------S--N--E--------TFWLNPPYSD----------------------------LSPWLSKSV---ED-YN-R-GA-T-GLILVPARTDTRAFQ-NFA-----S-PFCD----A---------MCFIK----------GRLKFGNPL-------------KPNDK--LTS-A------------------------------------------------P--F-PS----C---------I----IVLD-------KNL-----------TQA---KIDCLK--------------------------SLGNTMV--N----I---------------------------------------------------- XA43_RS13245_Escherichia_coli_817696779 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SREQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- nADLYRO1b_RS11635_Yersinia_ruckeri_740410430 MS-DYG--G-S-------------------H-TP----------DNLKDLWQTPNDIFAALDLEFG-FYLDAAAS---HQSALCARYLT-ER-----DDAL--NCEW-------------------------I--S--Y-----G--AIWCNPPYSN----------------------------ITPWVQKAA---EQ-CREQ-NQ-I-VVMLIPADTSTGWFS-LAL-----E-S--V-D--E---------VRLIT-----G----GRLSF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CTFT-------IVK-----------RDE---LKAIGQ-------------------------EILTGSKT--A--------------------------------------------------------- B4086_RS03845_Bacillus_cereus_822506548 ------------------------------MTIN-K--GMF---TSKTDLWATPQYFFDELHKEFN-FELDVCAL---EDNAKCEKYFT-PE-----MDGL--KQEW-------------------------N--G-----------TCWMNPPYGR-G--------------------------IGKWVQKAY---ES-SL-T-GS-T-VVCLLPARTDTRWWH-DYC-M---N-G-------E---------IRLVK----------GRLKF-------------GDS-KNSAP--FPN-AVVIF------------------------------------------G-E--K-AK--------------K----HTLI-------AM--------------------------------------------------------------------------------------------------------------------- L383_01094_Enterobacter_sp_MGH_37_578289375 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- L383_01874_Enterobacter_sp_MGH_37_578286731 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A--------------------------------------------------------- AIA83183.1_Podovirus_Lau218_643012085 MK----------------------------N-RN----------IKHSDNWATPKELYNELDKEFN-FDFDPC-----PLNSSVDGL----------DEDL----SW-------------------------G--K-----------SNFVNPPYSL-K-------------------L------KTDFVKRAV-K-EK-HK---GN-T-CILLLPVSTSTKLFH-EDI-L---P-N--A-D--D---------IRFLK----------GRVKF---IG-TN-T-K-GVL-VSNKCGMHDT-MVVIF------------------------------------------K-G----KR--------------K-------------------------------------------------------------------------------------------------------------------------------------- AB186_07590_Klebsiella_pneumoniae_828953686 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RMSNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGT--LEAA--------------------------------------------------------------------------------------- U074_RS0104770_Escherichia_coli_657257859 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RRSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- L415_RS13205_Klebsiella_pneumoniae_556400945 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGT--LEAA--------------------------------------------------------------------------------------- AB07_0778_Escherichia_coli_5-172-05_S1_C1_660087059 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-KI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LVAFGA-------------------------KLLARREA--A--------------------------------------------------------- A965_RS0108215_Enterobacter_cloacae_648328174 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--EMPW-------------------------A--D--CLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---DKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- L964_RS00605_Leuconostoc_pseudomesenteroides_491052808 ---------------------------M--N-SK----ALF---SSKSMVWETPKDYFDKLNRKFK-FDLDACAS---DTNHKVDTYFT-ED-----DDAL--EQKW-------------------------G--G-----------NVFMNPPYGR-H--------------------------IGEFIKKAY---EE-HL-RDPN-RFIVMLIPSRTDTKYWH-EYI-Q---D-K--A----T---------VKFIK----------GRLKF-E-LD--------GRP-MNTAP--FPS-ALIIY-----------------------------------------------G-L------------------------------------------------------------------------------------------------------------------------------------------------------ JP28_09585_Gallibacterium_anatis_702415297 -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ ABR28_RS04400_Enterobacter_sp_GN02358_829891415 MT-DYT--G-S-------------------N-TP----------EDQRDLWRTPPALFAALNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--GTPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNS--KGS-ILIIW------------------------------------------R-P--Y-PR----T---------H----CEFT-------TVE-----------RDV---LMEFGT-------------------------KLLARREA--A--------------------------------------------------------- L365_RS11955_Klebsiella_pneumoniae_556494180 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCDHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- JL04_RS05590_Gallibacterium_anatis_746010315 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--N--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ ASUC_RS06415_Actinobacillus_succinogenes_501020288 MS--------------K----------F--D----------------KDTYPTPLSLFLPLDAEFN-FTLDGAAL---PNNAKCDRYVT-PE-----MDFL--TYQL-----------------------Q-N--E-----------RIFINPPFSD----------------------------PLSFIKRSI---EL-FEYY-NC-L-VVMLLPVDISTEWFS-LIT-----R-K--A-T--E---------IRFIV--G-------GRIKF---VS-PE-T---GD--WTDVC--RGN-HLAIF------------------------------------------D-P--R-HR----N-----M---G----QVIR-------NIH-----------------IDDLGK--FE--------W------------RVNSRKRK--P--------------------------------------------------------- ABR33_RS14555_Enterobacter_sp_GN02548_829940915 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-IA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- RM98_RS09640_Chromobacterium_violaceum_759929100 MA--------D----------------Q----AE-N--IHF---RSGRDDWETPHDLFASLNAEFG-FTVDVCAS---EKTAKCPRYYT-PA-----MNGL--AQDW-------------------------G--GE----------TVWMNPPFGH-V--------------------------TKRWMDKAR---LS-SL-Q-GA-T-VVCLVPARTSVLWWH-RNV-FL--A-S-------E---------VRCIR----------PRLQF-------------VGA-AQKAP--FDA-VLVVF-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- H922_RS0124235_Citrobacter_freundii_696358583 SG-DYG--G-S-------------------K-TP----------PDQRDLWRTPPALFASLNAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------G--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- HK32_RS26080_Klebsiella_pneumoniae_523682820 MT-DYV--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RMSNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEA---------------------------------------------------------------------------------------- J479_2646_Acinetobacter_baumannii_691127129 MS--------T----------------M----AK-L--GLFGNAEGRTDVWATPQTLFDALDQVFN-FDLDVCAL---PENAKCERFFT-PE-----IDGL--KQEW-------------------------T--G-----------TCWMNPPYGR-E--------------------------ITLWIDKAV---QT-AN-Q-GH-T-VVGLLPARTDVTWWQ-EHV-M---N-R-------E---------IHYIK----------GRLKF-------------GGC-KHNAP--FGC-AVVVF------------------------------------------R-P--S-LK--------------D----VQWG-------AQ--------------------------------------------------------------------------------------------------------------------- ABR28_RS09465_Enterobacter_sp_GN02358_829892043 MT-DYT--G-S-------------------N-TP----------AEQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETSW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SI-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-N--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLSRREA--A--------------------------------------------------------- AW35_RS0117640_Klebsiella_pneumoniae_657698125 MT-DYV--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------A--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RMSNP--KGS-ILFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- L466_01900_Enterobacter_sp_BIDMC_30_578249703 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- AB07_RS0123005_Escherichia_coli_696361303 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-KI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LVAFGA-------------------------KLLARREA--A--------------------------------------------------------- TA98_RS18955_Klebsiella_pneumoniae_749548558 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------S--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- PROVALCAL_RS03515_Providencia_alcalifaciens_493708059 MA-VYA----S-------------------H-TA----------PADKDCYQTPQWLFEAMTAEFG-FWLDVAAS---KQNALCVDFFT-QE-----QDAL--KQEW-------------------------F--S--K-----G--AIWCNPPYSN----------------------------IKPWVEKAA---EQ-YLEQ-NQ-P-IVMLVPEDKSTSWFS-LAL-----K-S--V-D--E---------IRVVI-----D----GRINF---VD-PT-T---GKE-KRGNN--KGS-MFLIW------------------------------------------R-P--F-TE----P---------K----RVTT-------HVS-----------KKR---LMEIGY-------------------------SILGVA-A------------------------------------------------------------ C243_RS0119615_gamma_proteobacterium_WG36_516062979 MKSDYIGAGQS-------------------Q-TP----------AEHKDRWQTPVEIFDALDLEFG-FYLDAAAD---LSNALCSHYLT-EY-----DDSL--SCDW-------------------------E--S--Y-----G--AIWCNPPYSA----------------------------VTPWVSKAA---EQ-CKAQ-NQ-P-VVMLLPADTSTGWFS-EAL-----K-T--V-D--E---------VRFIT-----D----GRIGF---IN-AG-T---GKPGKSGNS--KGS-MLFIW------------------------------------------R-P--F-IK----P---------R----CMFT-------TIS-----------RDD---LIVIGS-------------------------EV-RGVSA--A--------------------------------------------------------- SS16_RS19255_Enterobacter_cloacae_779858102 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--QTPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-S---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- TB70_RS08620_Klebsiella_pneumoniae_694081399 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- GGE_RS10915_Haemophilus_haemolyticus_763376104 ---------------------------MT------EQ-------QFDKDTWQTPRYVFEWLSQRFGWFDLDGCAT---ANNALTWRYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-L-D--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-RD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----YVTR-------SIS-----------LDF---IKKVGG-----------------YS-K------------------------------------------------------------------------ UO85_RS18470_Enterobacter_cloacae_770797848 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------S--D--YLS-IPG--YVWLNPPYSN----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- KPR_RS15135_Klebsiella_pneumoniae_529980423 MT-DYG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYIT-EL-----DDSL--NSEW-------------------------T--S--C-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- GRPL_RS04760_Raoultella_planticola_695777676 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSDW-------------------------T--S--Y-----G--SIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--F-IS----P---------R----HIIT-------SVS-----------LAE---LKRIGT--LEEA--------------------------------------------------------------------------------------- AAX16_RS03635_Haemophilus_haemolyticus_822519471 ---------------------------MT------EQ-------KFDKDTWQTPHYVFEWLSQRFGWFDLDGCAT---ANNALTWRYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-A--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-RD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--A-ME--------------D----FVTR-------SIS-----------LDF---IKKVGG-----------------YD-G--A--------------------------------------------------------------------- L383_RS0123770_Enterobacter_sp_MGH_37_695674014 ---DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- UA70_28325_Raoultella_planticola_767053612 -----------------------------------------------RDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---EQ-SRAQ-YQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASIT---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--F-IT----P---------R----HIIT-------SVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- AF41_RS16340_Citrobacter_sp_MGH_55_757783319 SG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------V--D--YLS-IPG--YVWLNPPYSD----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGV-------------------------KLLARREA--A--------------------------------------------------------- SS39_RS14470_Enterobacter_asburiae_779796092 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPAIFVSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- JP46_RS0122155_Enterobacter_cloacae_692191212 HS-GYG--G-S-------------------N-TP----------AEQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNTLCRKFIT-AE-----QDTL--ETPW-------------------------A--D--YLT-IPG--YAWLNPPYSD----------------------------ITPFVKKAA---AE-SK-N-QI-G-TVMLVPADTSVGWFR-EAI-----E-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--F-PR----T---------H----CEST-------FVE-----------RDV---LMTFGA-------------------------KLLARREA--A--------------------------------------------------------- T636_A2961_Enterobacter_cloacae_MRSN_11489_728967019 MT-DFT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--QTPW-------------------------A--D--YLN-VPG--SVWLNPPYSD----------------------------ISPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- AAV10_RS05315_Enterobacter_cloacae_complex_695626939 MT-DFT--G-S-------------------N-TP----------ADQRDLWRTPPVLFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SA-N-QI-G-TVMLVPADTSVGWFN-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- L371_RS0123590_Enterobacter_sp_MGH_25_695714549 ---DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A--------------------------------------------------------- ESMG_RS20740_Escherichia_coli_446051427 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAN---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- ABR38_RS19795_Enterobacter_sp_GN02825_829773120 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFTSLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A--------------------------------------------------------- G468_RS0102435_Arsenophonus_nasoniae_652426387 LI--------S-------------------H-TP----------KPFKDRWRTPIEVFRALDAEFN-FKLDAAAD---KNNALCKAFLT-EQ-----QDAL--TCDW-------------------------N--S--N-----G--AIFCNAPYSK----------------------------IMPWVKKAA---EQ-CRKQ-NQ-T-IVMLLPSDTSTAWFY-EGL-----N-T--A-D--E---------IRFIT-----E----GRLSF---VS-AE-T---GEQGISGNS--KGS-V------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ L466_RS0122390_Enterobacter_sp_BIDMC_30_695758762 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- AF41_03280_Citrobacter_sp_MGH_55_635724739 SG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------V--D--YLS-IPG--YVWLNPPYSD----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGV-------------------------KLLARREA--A--------------------------------------------------------- SG64_RS05740_Enterobacter_cloacae_798869855 SG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YMS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNS--KGS-ILIIW------------------------------------------R-P--Y-PR----T---------H----CEFT-------TVE-----------RDV---LMEFGS-------------------------KLLARREA--E--------------------------------------------------------- L349_RS23330_Enterobacter_cloacae_complex_550795579 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFY-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- ABR37_RS08920_Enterobacter_sp_GN02768_829838493 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- SG82_RS22470_Enterobacter_cloacae_798866512 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGT-------------------------KLLARREA--A--------------------------------------------------------- KU61_RS02975_Enterobacter_cloacae_704505064 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFR-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- SGP1_RS10390_Sodalis_glossinidius_499730277 IV--------S-------------------Q-TP----------KACKDRWQTPVEIFRALDAEFR-FCLDAAAN---HDNTLCRCYLT-EE-----DDAL--SCDW-------------------------Y--T--R-----G--AIFCNPPYSN----------------------------ITPWVRKAA---EQ-CVVQ-QQ-T-IVMLLPSDTSTGWFR-LGL-----E-S--V-D--E---------VRVIT-----G----GRLSF---IS-AA-T---GVCGKNGNS--KGS-LLFIW------------------------------------------R-P--F-FK----N---------R----CQFT-------TVD-----------KSD---LIRIGT-------EVVR----KVAA-------------------------------------------------------------------------- QQ39_06370_Pragia_fontium_827401593 MS-DFG--G-S-------------------N-TP----------AELKDRWQTPDNIFHALDAEFG-FYLDAASE---PHNALCSRFLT-SA-----DDSL--SCDW-------------------------G--S--Y-----G--SIWCNPPYSN----------------------------ITPWIVKAA---EQ-CKKQ-RQ-P-IVMLLPADTSTGWFS-LAL-----K-S--V-D--E---------IRIVT-----D----GRIQF---IN-AG-T---GKKGKNGPG--KGN-LFLIW------------------------------------------R-P--F-IK----P---------R----CQFT-------TIS-----------RDE---LIGIGE-------------------------SILEGVKT--A--------------------------------------------------------- RN16_RS04075_Chromobacterium_subtsugae_759887196 MA--------D----------------L----SE-Q--IHF---SSKTDEWPTPQALFDQLHAEFG-FTLDVCAT---QENAKCERFFT-RE-----QDGL--AQDW-------------------------S--RE----------VVWMNPPFGH-Q--------------------------IKLWMAKAY---RS-SI-D-GA-L-VVCLVPARTDTRWFH-RHA-LK--A-A-------E---------IRALD----------KRLRF-------------DGA-KAKAP--FPA-VLVVY-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- SS33_RS24310_Enterobacter_sp_35730_772624651 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PS----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLASREA--A--------------------------------------------------------- eiDWFOrf6_Edwardsiella_phage_eiDWF_318064446 MS-GYH--D-S-------------------K-TA----------PEDKDCWRTPPEVFRYAVRTWGAFEIDAAAA---DHNHLVADYWT-LA-----DNAL--VQDW-------------------------S--G--K--------RVWCNPPYSD----------------------------IGPWVEKAA---TA-EF--------CVMLVPADTSVKWFA-TAG-----E-L--G-A--S---------VIFIT-----R----GRLRF---IH-NA-T---GKP-GPSNK--MGS-CFLVF------------------------------------------G-G--S-RP----G---------R------VD-------FVT-----------RAG---VYQIGA----------------------RR-KVTVKRRV-----------RAPHNAT------------------------------------------ KV31_RS01780_Enterobacter_cloacae_complex_692189073 RT-GYG--G-S-------------------H-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-SE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- MYSTI_RS09680_Myxococcus_stipitatus_505160458 --------------------------------MN-P--VHF---SSASAEWATPRDLFARLHAEHE-FTLDVCAT---EENTVLPRFYT-RN-----DNGL--AQDW-------------------------A--G--E--------RCWMNPPYGT-AKHACKPDCAKKACEKRGQHIPEYVPGIQDWVEKAA---TC------GS-L-VVALLPARTDTRWWH-RHI-W---D-V-------DRDAPRPGVRVKFFR----------GRLKF-------------GGR-KTGAP--FPS-ALVTF-----------------------------------------------G-VQ--------------S-------------------------------------------------------------------------------------------------------------------------------------- SS49_RS01825_Enterobacter_cloacae_779812391 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------G--D--YLN-VPG--YVWLNPPYSD----------------------------IMPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLIARREA--A--------------------------------------------------------- OOC_RS12555_Providencia_rettgeri_491050038 MA-VYS----S-------------------N-TA----------PEDKDCWQTPQWLFEALTLEFG-FWLDAAAN---EQNALCPYFLT-IE-----QNAL--QSDW-------------------------V--S--R-----G--AIWCNPPYSK----------------------------IKPWIAKAA---EQ-CTKQ-NQ-P-IVMLLPADKSTSWYS-LAL-----K-S--V-D--E---------VRTII-----D----GRINF---VD-PN-T---GKE-KKGNS--KGS-ILLIW------------------------------------------R-P--F-VE----P---------K----AIGT-------HIS-----------KNR---LMEIGN-------------------------AILGVA-A------------------------------------------------------------ JP33_05990_Gallibacterium_anatis_CCM5995_702395340 -------------------M-------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----K-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ JP33_RS05860_Gallibacterium_anatis_746094489 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--VEDF-LTFDPLD-----LIAELE----F-S--H--F--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----K-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ UMN179_RS11845_Gallibacterium_anatis_503512608 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--AEDF-LTFDPLD-----LIEVLE----F-P--H--V--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KA-Q-GF-L-VVMLLPADKSTAWYK-VIE-----E-K--A-T--E-------V-IDITGYYDEKGRWKNGRISF---LH-PT-E---NVE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QVS-----------LDF---VMKCGG-----------------YG-N------------------------------------------------------------------------ SR65_RS01890_Enterobacter_asburiae_779958989 NG-DYG--G-S-------------------K-TP----------IDQRDLWRTPPALFASLDSEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETLW-------------------------A--D--YLS-IPG--YVWLNPPYSE----------------------------IMPFVKKAA---SE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RNE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- dam_Yersinia_phage_PY100_164414537 KR-DFG--G-S-------------------T-TP----------KDIRDLWATPQWLFDYFNEIYK-FDLDAAAN---DINHKCDNYLT-LE-----NDGIVEEHEW-------------------------I--C--E-----S--AVWCNPPYSD----------------------------PQPWIEKAI---NE-SS-L-GV-L-SVMLLPCDPSTEWFH-LAS-----K-S--A-S--K---------IYILT-----G----GRVQF---VR-AD-T---GEE-QRGNP--KGS-VLFVF------------------------------------------D-P--N-DG----D---------Q-------E-------TIY-----------LP----IWEAGG--KEPR--------------WF---KSWTLKEE--E--------E------------------------------------------------ EB105725_RS05190_Shimwellia_blattae_488371093 MS-DYG--G-S-------------------K-TP----------VPERDLWQTPASIFTALDIEFG-FYLDVAAA---PHNALCARFMT-EH-----EDAL--NSDW-------------------------S--S--Y-----G--AIWCNPPYSD----------------------------ITPWIRKAA---EQ-CQKQ-HQ-T-VVMLLPADISTGWFS-LAL-----Q-T--V-D--E---------IRLIT-----N----GRIQF---VP-ASVS---GK--RQSNP--KGS-LLFIW------------------------------------------R-P--F-IS----P---------R----GIFT-------TVS-----------KPA---LEDAGQQYLDEV-AA------------------------------------------------------------------------------------ BU34_RS16325_Escherichia_coli_643945869 MT-DFG--G-S-------------------K-TP----------KNERDYWQTPIEIFNALDREFG-FWLDAAAS---ESNALCAHYLT-EL-----DDSL--NSEW-------------------------T--S--Y-----G--AIWCNPPYSD----------------------------IGPWVEKAA---KQ-SRAQ-SQ-A-VVMLLPADISTGWFI-SAM-----Q-S--A-D--E---------LRLIT-----G----GRVQF---VP-ASVT---GK--RRSNP--KGS-LLFIW------------------------------------------R-P--Y-IT----P---------R----HIIT-------TVS-----------LAE---LKRIGN--LEAA--------------------------------------------------------------------------------------- ECCZ_RS00820_Escherichia_coli_559190709 MT-DFT--G-S-------------------N-TP----------AEHRDSWCTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LIAFGS-------------------------RILARREA--A--------------------------------------------------------- SS08_RS15735_Enterobacter_cloacae_749204695 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWMNPPYSD----------------------------ITPFVNKAA---TE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVG-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- ERIG_RS16875_Escherichia_fergusonii_446051425 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFD-FQLDAAAN---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- VP22_RS12395_Escherichia_fergusonii_803565941 MT-DFT--G-S-------------------N-TP----------AEHRDSWCTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKP-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- HMPREF0731_RS06220_Roseomonas_cervicalis_750330482 ---------------------------M--N-TA-A----F---SSAYEAWATPPDLLERLYAAVGSIDLDPCSPGKLRSRVKAPRHFT-ER-----DDGL--AQEW-------------------------S--G-----------KVYMNPPYGR-T--------------------------IGAWTTKAR-V-EV-TAGR-AE-C-VVGLVPARTDTRWWH-ADV-A---G-H--A----H---------VWLLK----------GRLAF-------------GDG-STPAP--FPS-ALLLW---------------------GGN------------------A-P----------------------------------------------------------------------------------------------------------------------------------------------------------- AB28_RS19280_Escherichia_coli_695802868 MT-DFT--G-S-------------------K-TP----------VEQRNLWQTPIPLFVALDAEFC-LTLDAAAS---TDNALCNRYIT-EE-----QNTL--TTPW-------------------------A--D--FLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-ST-N-QI-G-TVMLVPADTSVGWFR-EAI-----E-T--A-S--E---------VRFIV-----G----GRLAF---IN-PV-S---GKP-VSDNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CQFT-------TVE-----------RDA---LLSFGA-------------------------RLIAKREA--A--------------------------------------------------------- G869_RS17520_Escherichia_coli_486132694 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--D---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNS--KGS-ILIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- SEEB0179_06810_Salmonella_enterica_555260527 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPAIFAALDAEFC-FQLDTAAA---PHNALCRRFIT-EE-----QNTL--VTPW-------------------------A--D--YMS-IPG--HVWMNPPYSD----------------------------IMPFVKKAA---AE-SK-N-QI-G-TVMLVPSDTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLLIW------------------------------------------R-P--Y-PR----T---------Q----CDLT-------TVE-----------RDV---LIEFGS-------------------------ARLARREA--A--------------------------------------------------------- L743_RS07620_Serratia_marcescens_742394131 MSAVFA----S-------------------N-TP----------PEHKDRWQTPIEVFNALDVEFG-FFLDAAAD---DGNALCAHYQT-EQ-----DNAL--SIDW-------------------------V--S--Y-----G--AIWCNPPYSD----------------------------ITPWVIKAA---EQ-CHVQ-NQ-P-IVMLLPADTSTGWFS-LAL-----Q-S--V-D--E---------VRFIT-----D----GRLAF---IN-SA-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CQFS-------TIS-----------RDE---LLRIGK-------------------------GIAMEVSV--A--------------------------------------------------------- US97_C0007G0018_Microgenomates_bacterium_GW2011_GWF1_38_5_818397581 MD--------A---------------------------VLF---SRKSDEWTTPEATYVGLDAEFH-FTDDPCPL---GA-----------------TDGL--EREW-------------------------K--G-----------SVYVNPPYSK----------------------------IAAFVEKAIQELDE-GH---AH-T-VVFLVPSRTDTRWFH-RYV-L---G-R--G-G--E---------IRFIK----------GRLKF---------S---GS--KNSAP--FPS-MIVIW------------RDRKMGI-------------------------PD-Y-EE----R-L-------D----EVFT-G-AF--TTN-----------HPA---MIDWVQ--VKME-----------LK------RLRNNRRE------------------------------------------------------------ SPM24T3_RS16925_Serratia_sp_M24T3_497323801 MKSDHLGL--S-------------------S-TP----------AEHKDRWQTPVEIFDALDLEFG-FYLDAAAD---QSNALCSHYLT-EQ-----DDSL--SCEW-------------------------T--S--H-----G--AIWCNPPYSA----------------------------PPPWVAKAA---EQ-CRIQ-KQ-P-VVMLMPADTSTGWFS-EAL-----K-T--V-D--E---------VRFIT-----D----GRIGF---IN-AG-T---GKPGKNGNS--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CMFT-------TVS-----------RDE---LLVIGS-------------------------EV-RGVSA--A--------------------------------------------------------- AAX16_RS08715_Haemophilus_haemolyticus_822520202 ---------------------------MT------EQ-------QFDKDTWQTPKYVFNWLEIKCGSFDVDGCAS---SENALCKEYID---------------SDF--------DFLTCSMRGFQNCCEK-E--N--L--------KIYVNPPYSD----------------------------VTPFLIRAK---EL-RD-A-GH-L-VVMLLNNDKSTQWYQNHIH-----N-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SIS-----------LDF---IKKVGG-----------------YI-HVEK--------------------------------------------------------------------- P833_RS20130_Enterobacter_cloacae_695744722 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFR-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLS-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRLIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- CH09_gp15_Edwardsiella_phage_eiAU-183_589889277 MS-GYH--D-S-------------------K-TA----------PEDKDCWRTPPEVFRYAVRTWGSFEIDAAAA---DHNHLVADYWT-LA-----DNAL--VQDW-------------------------S--G--K--------RVWCNPPYSD----------------------------IGPWVEKAA---TA-EF--------CVMLVPADTSVKWFA-TAG-----E-L--G-A--S---------VIFIT-----R----GRLRF---IH-NA-T---GKP-GPSNK--MGS-CFLVF------------------------------------------G-G--S-RP----G---------R------VD-------FVT-----------RAG---VYQIGA----------------------RR-KVTVKRRV-----------RAPHNAT------------------------------------------ RM98_RS18265_Chromobacterium_violaceum_759932528 MA--------D----------------L----SE-Q--VHF---SSKTDEWPTPQALFDQLHEEFG-FTLDVCAT---AENAKCERFFT-RE-----QDGL--AQDW-------------------------S--RD----------VVWMNPPFGH-Q--------------------------IKLWMAKAY---RS-SI-D-GA-L-VVCLVPARTDTRWFH-RHA-LK--A-A-------E---------IRALD----------KRLRF-------------DGA-KAKAP--FPA-VLVVY-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- ABF72_01710_Enterobacter_cloacae_829343115 NG-DYG--G-S-------------------K-TP----------LDQRDLWRTPPALFAALDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SA-N-HI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- ETAC_RS11795_Edwardsiella_piscicida_505274905 MS--VK----S-------------------N-TP----------AEAKDCWQTPLWLFDALDLEFG-FWLDAAAS---ESNALCVKYLT-EV-----DNAL--GCEW-------------------------E--S--A-----G--AIWCNPPYSK----------------------------IGPWVAKAA---EQ-SARQ-IQ-T-VVMLVPEDMSVGWFS-EAL-----K-T--V-D--E---------VRVIT-----G----GRVNF---VH-AV-T---GAE-QKGNS--KGS-MLLIW------------------------------------------R-P--F-TT----P---------L----HRIT-------TVS-----------KSM---LEAIGR-------------------------PVRSAA-------------------------------------------------------------- IO48_RS08405_Gallibacterium_anatis_746097630 ---------------------------M----------------EFDKDCYRTPKYVFNWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--AEDF-LTFDPLD-----LIEVLE----F-P--H--V--------TIFVNPPYSN----------------------------PLPFVKRAA---EL-KK-W-GF-L-VVMLLPADKSTKWYQ-VIQ-----E-S--A-T--E-------V-IDIVG----------GRISF---LH-PL-T---GEE-VKGNN--KGS-MIAVF------------------------------------------D-P--T-MQ--------------D----FVIR-------QAN-----------LDF---VKMCGG-----------------YG-S------------------------------------------------------------------------ ETAF_RS16405_Edwardsiella_tarda_504339421 MS--IK----S-------------------N-TP----------AEAKDCWQTPLWLFDALDLEFG-FWLDAAAS---ESNALCVKYLT-EV-----DNAL--GCEW-------------------------E--S--A-----G--AIWCNPPYSK----------------------------IGPWVAKAA---EQ-SARQ-IQ-T-VVMLVPEDMSVGWFS-EAL-----K-T--V-D--E---------VRVIT-----G----GRVNF---VH-AV-T---GAE-QKGNS--KGS-MLLIW------------------------------------------R-P--F-TT----P---------L----HRIT-------TVS-----------KSM---LEAIGR-------------------------PVRSAA-------------------------------------------------------------- ABF80_10955_Enterobacter_cloacae_829278978 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLR-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SD-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARRES--A--------------------------------------------------------- BGIM_RS49305_Zavarzinella_formosa_750606593 MN----------------------------D-QH----------KEIRGCWRTSPAVFNKLEGIFG-FTIDACAD---RDNHLLPRYWT-EE-----DDAL--TQDW-----------------------S-E--E-----------RVFCNPPF---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- L467_RS08095_Klebsiella_pneumoniae_694065919 RH------G-A-------------------K-IT----------ETGSDDWQTPRVIYEALNKRFK-FTRDAAAT---KQNSHCARYWT-KE-----DDAL--LMDW-------------------------S--Q--E-------KSIFCNPPYSK----------------------------VAEFLAKAH---EP-------E-T-AVFLIPFRPQTGFFL-QFV-----W-A--SPYLHE---------MMIIH----------RGIRF---I-------------HPDRV--ESVRS------------------------------------------------P--M-PV----V---------V----LVYR-------NKP-----------RKR-D-LLITVN-------------------------CADSLHTL--H----VVAGQRPGHPLEHGHSIRNKIIQEYQRGATVAELVRKYEGKVSRRSIYRWVKG HICON_RS06920_Haemophilus_influenzae_503292971 ---------------------------MT------GQ-------QFDKDTWQTPHYVFEWLSQRFGLFDLDGCAT---ANNALTCHYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-A--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-CD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SIS-----------LDF---IKKIGG-----------------YS-K------------------------------------------------------------------------ SS28_RS07355_Enterobacter_cloacae_779872478 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWMNPPYSD----------------------------ITPFVNKAA---TE-SA-N-QI-G-AVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------AVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- L463_02765_Enterobacter_sp_BIDMC_27_578260023 RT-GYG--G-S-------------------H-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-SE-----QNTL--ETPW-------------------------A--D--YLN-VPG--YVWLNPPYSD----------------------------ITPFVKKAA---AE-SN-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- JP31_RS04730_Gallibacterium_anatis_746070689 ME-------------------------F--D----------------KDAYPTPISLFNQINDEFN-FTIDGAAL---PHNTKCERYIT-PE-----MDFL--KYPL-----------------------V-N--E-----------RIWINPPFSE----------------------------PLSFVKRAV---EL-YENH-DC-L-VVMLLPVDISTKWFS-LVA-----E-K--A-T--E---------IRFIV--G-------GRIKF---LN-PE-T---DK--WTDVC--RGN-HLAIF------------------------------------------N-P--A-HK----S-----M---G----QAIR-------HIH-----------------ISRFKN--LE--------W------------R------------------------------------------------------------------- HMPREF1315_RS07015_Bifidobacterium_longum_494116860 AS--------NFYK------------------AG-A--AAM---TSNKDDWETPQSLFDQLDEEFH-FILDAASS---DQNAKCEHHYT-AE-----NSGL--EHSW-------------------------E--G--E--------TVFCNPPYGR-N--------------------------IGDWIRKAS---QE-AS-KPDT-L-VVLLVPARTDTRWFQ-NHI-L---H-R--A----E---------VRFLP----------GRLKY-E-VN--------GQA-GEAAP--------SFW------------------------------------------R-E--G-TP--S-F------------------------------------------------------------------------------------------------------------------------------------------------ DJ57_RS06970_Yersinia_kristensenii_740850846 MS-DFG--G-S-------------------N-TP----------DNLKDLWMTPADIFTALDIEFG-FYLDAAAS---NKSALCARYLT-EQ-----DDAL--NSAW-------------------------E--S--Y-----G--AIWCNPPYSD----------------------------ISPWVTKAT---EQ-CKQQ-LQ-T-VVMLVPADSSVGWFS-QAL-----Q-S--V-D--E---------VRFIT-----D----GRISF---LR-SD-T---GKP-INGNN--KGS-LLFIW------------------------------------------R-P--F-IK----P---------R----CMFT-------RVK-----------RDE---LKAIGQ-------------------------EILTGSKA--A--------------------------------------------------------- KS43_RS19035_Pectobacterium_carotovorum_746454692 MT--LK----S-------------------N-TS----------ADDKDRWQTPLWLFDALDIEFG-FYLDVAAS---GKNALCANYLT-ES-----DDAL--NTDW-------------------------V--S--H-----G--SVWCNPPYSK----------------------------ITPWVEKAA---EQ-YRKQ-NR-N-VVMLIPEDMSVGWFS-LAL-----N-S--V-D--E---------VRVIT-----D----GRVNF---VE-PS-T---GME-KKGNS--KGS-MLLIW------------------------------------------R-P--F-TT----P---------R----RIIT-------TVS-----------KPL---LMNIGQ-------------------------GIRRAA-------------------------------------------------------------- L422_RS03620_Enterobacter_cloacae_556329507 MT-DYT--G-S-------------------N-TP----------ADQRDLWRTPPALFASLDAEFC-FQLDAAAA---PHNALCRKFIT-AE-----QNTL--ETPW-------------------------A--D--YLR-IPG--YVWLNPPYSD----------------------------ITPFVKKAA---TE-SD-N-QI-G-TVMLVPADTSVGWFK-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PV-T---GKP-VSGNN--KGS-MLIIW------------------------------------------R-P--Y-PR----T---------H----CHFA-------TVD-----------RDE---LMAFGA-------------------------KLLARREA--A--------------------------------------------------------- SU59_RS00780_Haemophilus_influenzae_756151459 ---------------------------MT------EQ-------QFDKDTWQTPCYVFEWLSQRFGLFDLDGCAT---ANNALTCHYIGEPNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-D--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-CD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGYS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SIS-----------LDF---IKKVGG-----------------YI-RMEK--------------------------------------------------------------------- PSNIH1_RS00725_Pantoea_sp_PSNIH1_746340387 MA-GYH--D-S-------------------H-TP----------IDIRDLWQTPPEIFAALNREFR-FVADVAAS---KLNHLLPAYLT-EQ-----DDAL--NQDW-------------------------A--A--QFP--IG--ITWCNPPYSD----------------------------ITPWVVKAT---EE-AR-K-GM-G-TVMLVPADTSVGWFS-AAR-----S-S--C-T--E---------VRFIT-----N----GRLSF---IR-AD-T---GKA-VNGNN--KGS-MLLIW------------------------------------------N-P--F-LS----Y---------F----GLTG-------YVS-----------RDA---LMSIGT-------------------------RLLLSAEK--V--SAA---------------------------------------------------- IE01_RS08420_Gallibacterium_anatis_517157190 ---------------------------M----------------SFDRDAYRTPKYVFKWLNSRFK-FDIDGCAT---EENNLSYHYIG--------KDGI--AEDF-LTFDPLD-----LIAVLE----F-C--N--F--------TIFVNPPYSN----------------------------PLPFVERAA---EL-KK-Q-GF-L-VAMLLPADKSTKWYQ-VIQ-----D-N--A-T--E-------V-IDIVG----------GRINF---LH-PE-T---GEE-VKGNN--KGS-LIAVF------------------------------------------D-P--T-MQ--------------G----FITR-------QVT-----------LDF---IKDVGG-----------------YG-I------------------------------------------------------------------------ NTHI477_RS07245_Haemophilus_influenzae_764356005 ---------------------------MT------EQ-------QFDKDTWQTPRYVFEWLSQRFGLFDLDGCAT---ANNALTCHYIGESNSDNDEHQSI--ADDF-LM--PIEQMLDVLLDEVAERC-S-D--P--L--------RIYVNPPYSN----------------------------VTPYLQRAK---EL-CD-A-GY-L-VVMLLNNDKSTQWYQNHIQ-----G-V--A-N--E-------V-IDITG----------GRIAF---IN-PV-T---GKE-IKGNS--KGQ-MVVVF------------------------------------------D-P--T-ME--------------D----FVTR-------SVS-----------LDF---VKKCGG-----------------YG-I------------------------------------------------------------------------ HMPREF0555_0745_Leuconostoc_mesenteroides_subsp_cremoris_ATCC_19254_227352467 ----------------------M----M--V-DK----VLF---SSNSMVWETPKDYFDKLNRKFK-FDLDACAS---DTNHKVDTYFT-ED-----DNAL--EQKW-------------------------G--G-----------NVFMNPPYGR-H--------------------------IGKFIKKAY---EE-HL-RDPN-RFIVMLIPSRTDTKYWH-EYI-Q---D-K--A----T---------VKFIK----------GRLKF-E-ID--------GES-MDAAP--FPS-ALVVY-----------------------------------------------G-F------------------------------------------------------------------------------------------------------------------------------------------------------ UA45_RS08225_Morganella_morganii_770844418 MKADYG--G-S-------------------T-TP----------KELRDLWQTPLPLFSALDAEFG-FYLDAAAD---KNNTLCSHYLT-EK-----DNAL--NSDW-------------------------Q--S--Y-----G--SIWCNPPYSD----------------------------IQPWVRKAA---EQ-CREQ-LQ-P-VVMLVPADTSVGWFK-SAL-----D-T--V-D--E---------VRFIT-----G----GRISF---IN-AG-T---DKS-KNGNT--KGS-MLLIW------------------------------------------R-P--F-TQ----P---------R----RIIT-------TVN-----------RDD---LMDIGN-------------------------RLLESQI------------------------------------------------------------- HMPREF0555_RS01180_Leuconostoc_mesenteroides_738135700 ---------------------------M--V-DK----VLF---SSNSMVWETPKDYFDKLNRKFK-FDLDACAS---DTNHKVDTYFT-ED-----DNAL--EQKW-------------------------G--G-----------NVFMNPPYGR-H--------------------------IGKFIKKAY---EE-HL-RDPN-RFIVMLIPSRTDTKYWH-EYI-Q---D-K--A----T---------VKFIK----------GRLKF-E-ID--------GES-MDAAP--FPS-ALVVY-----------------------------------------------G-F------------------------------------------------------------------------------------------------------------------------------------------------------ A15Y_RS20360_Escherichia_coli_486190610 MT-DFT--G-S-------------------N-TP----------AEHRDSWRTPPEIFAALNAEFV-FQLDAAAS---EKNRLCRLFIS-QE-----QNTL--TTSW-------------------------P--E--AMGYASG--YVWLNPPYSN----------------------------ISPFVKKAA---TE-NKFS-SV-G-CVMLLPADTSVGWFH-EAI-----Q-T--A-S--E---------VRFIT-----A----GRLAF---IN-PL-T---GKH-VSGNN--KGS-MLIIW------------------------------------------H-P--Y-PR----T---------H----CHFT-------TVD-----------RGE---LMAFGS-------------------------RILARREA--A--------------------------------------------------------- consensus/100% ..........................................................................................................h..........................................ahNsPa................................................................................................................................................................................................................................................................................................................................................................ consensus/95% ...............................................pp.a.TP..ha..Lp..a..F.lDssu......s.bs..ahs.........ssl.....a.........................................hahNPPYu................................ah.+u..................VhLlPsc.ss.aa....................p.........lphh...........GRl.F...................s.s..bs..h...h........................................................................................................................................................................................................ consensus/90% ...............................................ps.W.TP..hF..Ls..F..F.lDssA.....pN.bs..ahs.........ssL...p.a.........................................hahNPPYup...............................al.+Ah.........p......sVhLlPsc.ss.aa...h................-.........lchl...........GRl.F...................sss..bss.hlhla........................................................................................................................................................................................................ consensus/85% ...............................................p-.W.TP..hF..Ls.bF..F.LDssA.....pNsbC.paho.........ssL...p.W.........................................hahNPPYup............................l..alpKAh....p.s..p....s.sVhLlPsc.ss.aa...hh.......p.......E.........l+hlp..........GRl.F.............s....psss..bss.hlhla........................................................................................................................................................................................................ consensus/80% .............................................pppD.WpTP..hF..Ls.cFs.F.LDssA.....pNsbC.+aho........pssL...ppW.........................s..s............hahNPPYup............................l.salpKAh...pp.s..p....s.sVhLlPucpss.Wap.phh.......p.......E.........l+hlp..........GRl.F.............G....psss..bss.hlhla..........................................c.s........................................................................................................................................................... consensus/75% ................................s............pppD.WpTP..lF..Ls.cFs.FpLDssAs....pNAbC.+ahT..b.....pssL..pppW.........................s..s............lahNPPYup............................I.saVcKAh...pp.s..p....s.sVhLlPucsss.WFp.phh.....p.p..s....E.........l+hlp..........GRl.F.............G....psss..bsS.hlhla..........................................c.P.....p..................................................................................................................................................... consensus/70% ................................s............pppD.WpTP..lFs.Ls.EFs.FpLDssAs....pNAbC.+ahT..b.....pssL..pppW.........................s..s............lWhNPPYuc............................I.saVcKAh...pp.s..p.s..s.sVhLlPAcTss.WFp.phh.....p.p..u....E.........lRhIp..........GRL.F.............Gp...puss..bGS.hlhla..........................................+.P.....p.....................h...............................................................................................................................Back to Contents
GI Gene neighborhoods Dom archs Pfam archs Gene name Len Taxonomy Species name Genbank annotation # 96; Eukaryotic versions 551608163 N6-MTase N6-MTase EMIHUDRAFT_111979 250 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_111979 [Emiliania huxleyi CCMP1516]. 551578908 N6-MTase N6-MTase EMIHUDRAFT_240085 261 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_240085 [Emiliania huxleyi CCMP1516]. # 96; Prokaryotic homologs 446051431 <-Phage_integrase<-?<-?<-small-pep<-?<-N6-MTase*<-?<-?<-RecT-Redbeta N6-MTase N6-MTase - 185 bacteria>proteobacteria>gammaproteobacteria Escherichia coli DNA N-6-adenine-methyltransferase [Escherichia coli]. 447177970_?-><-446728115_?<-485655176_Phage_integrase<-446410552_?<-446686039_?<-446111015_small-pep<-446135997_?<-446051431_N6-MTase*<-692947403_?<-446108978_?<-485810799_RecT-Redbeta<-446918185_?<-446155721_?<-447210913_?<-446336822_? 486273694 RecT-Redbeta->?->?->N6-MTase*->?->small-pep->?->?->Phage_integrase-> N6-MTase N6-MTase - 185 bacteria>proteobacteria>gammaproteobacteria Escherichia coli phage N-6-adenine-methyltransferase [Escherichia coli]. 693027482_RecT-Redbeta->693027485_?->485691706_?->486273694_N6-MTase*->446135997_?->446111015_small-pep->446686022_?->446410552_?->445974032_Phage_integrase-> 84779227 <-Phage_capsid<-Phage_portal<-Terminase_LS<-?<-HNH<-DCM<-Phage_lysozyme<-N6-MTase*<-DnaC<-KilA-N||?-><-SSB N6-MTase N6-MTase SG0729 181 bacteria>proteobacteria>gammaproteobacteria Sodalis glossinidius str. 'morsitans' hypothetical phage protein [Sodalis glossinidius str. 'morsitans']. <-84779220_Phage_capsid<-84779221_Phage_portal<-84779222_Terminase_LS<-84779223_?<-84779224_HNH<-84779225_DCM<-84779226_Phage_lysozyme<-84779227_N6-MTase*<-84779228_DnaC<-84779229_KilA-N||84779230_?-><-84779231_SSB<-84779232_?<-84779233_?||84779234_?-> 490192932 <-N6-MTase* N6-MTase N6-MTase - 180 bacteria>proteobacteria>gammaproteobacteria Hafnia alvei DNA N-6-adenine-methyltransferase [Hafnia alvei]. <-490192932_N6-MTase*<-490192933_?<-737528911_?<-490192935_? 496089880 <-Phage_integrase<-small-pep<-N6-MTase*<-?<-METHYLASE<-?<-?<-RecT-Redbeta N6-MTase N6-MTase - 180 bacteria>proteobacteria>gammaproteobacteria Enterobacteriaceae bacterium 9_2_54FAA DNA N-6-adenine-methyltransferase [Enterobacteriaceae bacterium 9_2_54FAA]. <-490985983_?||496089884_?-><-490985991_?<-496089883_?||496089882_?-><-496089881_Phage_integrase<-748754240_small-pep<-496089880_N6-MTase*<-496089879_?<-496089878_METHYLASE<-748754325_?<-748754241_?<-496089875_RecT-Redbeta<-748754326_?<-496089873_? 505727589 <-Phage_AlpA<-HTH_3+Peptidase_S24||?->?->HTH->N6-MTase*->RusA->Phage_pRha+ANT->?->Phage_antitermQ-> N6-MTase N6-MTase - 180 bacteria>proteobacteria>gammaproteobacteria Rahnella aquatilis DNA N-6-adenine-methyltransferase [Rahnella aquatilis]. <-505727580_?||505727582_?-><-753991441_Phage_AlpA<-505727584_HTH_3+Peptidase_S24||505727585_?->505727586_?->505727588_HTH->505727589_N6-MTase*->505727591_RusA->753991050_Phage_pRha+ANT->505727593_?->505727594_Phage_antitermQ-><-505727595_?||505727596_?->505727597_?-> 746124400 <-Phage-tail-tape<-?<-Portal<-Terminase_LS||?->?->?->N6-MTase*->small-pep->Phage_integrase-> N6-MTase N6-MTase - 180 bacteria>proteobacteria>gammaproteobacteria Hafnia paralvei DNA methylase [Hafnia paralvei]. <-746124389_Phage-tail-tape<-746124673_?<-746124392_Portal<-746124676_Terminase_LS||746124679_?->746124395_?->746124398_?->746124400_N6-MTase*->746124403_small-pep->746124406_Phage_integrase->746124409_?-><-496089958_?<-746124412_?<-746124413_?<-496089961_? 736793592 <-Phage_antitermQ<-?<-Phage_pRha+ANT<-RusA<-N6-MTase*<-HTH N6-MTase N6-MTase - 179 bacteria>proteobacteria>gammaproteobacteria Ewingella americana DNA methylase [Ewingella americana]. <-736793578_?||736793581_?->736793582_?-><-736793583_Phage_antitermQ<-736793586_?<-736793588_Phage_pRha+ANT<-736793589_RusA<-736793592_N6-MTase*<-736793595_HTH<-736793711_?<-736793597_?||736793598_?->736793600_?->736793602_?->736793604_?-> 738472851 <-Phage_antitermQ<-?<-KilA-N<-RusA<-?<-N6-MTase* N6-MTase N6-MTase - 179 bacteria>proteobacteria>gammaproteobacteria Morganella morganii DNA methylase [Morganella morganii]. 485706482_?-><-738472838_Phage_antitermQ<-738472840_?<-738472843_KilA-N<-738473381_RusA<-738472848_?<-738472851_N6-MTase*<-738472855_?<-738472858_?<-640732309_?<-738472860_?<-738472862_?<-738472864_?||738473383_?-> 738462811 N6-MTase*->?->KilA-N->?->Phage_antitermQ-> N6-MTase N6-MTase - 178 bacteria>proteobacteria>gammaproteobacteria Morganella morganii DNA methylase [Morganella morganii]. 738462796_?-><-738462799_?||738464960_?->738462802_?->738462805_?->738462808_?->738464963_?->738462811_N6-MTase*->738462814_?->639128326_KilA-N->738462817_?->738462820_Phage_antitermQ->738462824_?-><-738462827_?||639126534_?-> 802097985 N6-MTase*->?->KilA-N->?->Phage_antitermQ-> N6-MTase N6-MTase - 178 bacteria>proteobacteria>gammaproteobacteria Morganella morganii DNA methylase [Morganella morganii]. 802097982_?->802097030_?->802097031_?->802097983_?->802097984_?->802097985_N6-MTase*->802097986_?->639128326_KilA-N->802097987_?->738462820_Phage_antitermQ->738462824_?-><-738462827_?||639126534_?-> 499730784 <-RusA<-?<-?<-N6-MTase*<-?<-?<-?||?->?->RecT-Redbeta-> N6-MTase SP - 177 bacteria>proteobacteria>gammaproteobacteria Sodalis glossinidius DNA N-6-adenine-methyltransferase [Sodalis glossinidius]. <-754366011_?<-643659505_?||499730781_?-><-754366012_?<-499730782_RusA<-754366538_?<-754366013_?<-499730784_N6-MTase*<-754366014_?<-754366015_?<-754366539_?||754366016_?->754366017_?->499730787_RecT-Redbeta->754366018_?-> 169152788 <-N6-MTase* N6-MTase N6-MTase ABSDF2497 173 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii SDF putative bacteriophage protein [Acinetobacter baumannii SDF]. <-169152781_?<-169152782_?<-169152783_?<-169152784_?<-169152785_?<-169152786_?<-169152787_?<-169152788_N6-MTase*<-169152789_?<-169152790_?<-169152791_?<-169152792_?<-169152793_?||169152794_?->169152795_?-> 284008293 KilA-N->?->?->Phage_lambda_P->?->N6-MTase*-> N6-MTase N6-MTase ARN_24250 173 bacteria>proteobacteria>gammaproteobacteria Arsenophonus nasoniae phage DNA methyltransferase [Arsenophonus nasoniae]. 284008286_?-><-284008287_?||284008288_KilA-N->284008289_?->284008290_?->284008291_Phage_lambda_P->284008292_?->284008293_N6-MTase*-> 447017697 <-N6-MTase<-?||?->?->?->N6-MTase*->?->?-><-Phage_integrase N6-MTase N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii N-6-adenine-methyltransferase [Acinetobacter baumannii]. <-446466656_?<-447033140_?<-446956397_N6-MTase<-447018352_?||740523977_?->446054157_?->446054521_?->447017697_N6-MTase*->446850517_?->446434453_?-><-446697986_Phage_integrase<-447100339_?<-446730003_?<-446054325_?||446643136_?-> 490838153 <-N6-MTase*<-?<-?<-?<-?<-?<-AAA N6-MTase N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Acinetobacter sp. NIPH 973 phage N-6-adenine-methyltransferase [Acinetobacter sp. NIPH 973]. <-490838141_?<-490838144_?<-490838145_?||490838148_?->490838149_?-><-446848420_?<-490838151_?<-490838153_N6-MTase*<-445951092_?<-446776857_?<-645915102_?<-446577501_?<-490838159_?<-490838161_AAA<-488063409_? 493629840 HTH->?->?-><-Phage_AlpA<-N6-MTase*<-?<-?<-?<-?<-?<-?<-HTH_3+Peptidase_S24 N6-MTase N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Acinetobacter nosocomialis N-6-adenine-methyltransferase [Acinetobacter nosocomialis]. <-491023730_?<-491023731_?||493629838_?->491023734_HTH->493629839_?->491023738_?-><-490850089_Phage_AlpA<-493629840_N6-MTase*<-445951092_?<-493629841_?<-493629842_?<-445995254_?<-446051578_?<-447183491_?<-493629844_HTH_3+Peptidase_S24 493629922 <-N6-MTase*<-?<-?<-?<-AAA N6-MTase N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Acinetobacter calcoaceticus/baumannii complex MULTISPECIES: N-6-adenine-methyltransferase [Acinetobacter calcoaceticus/baumannii complex]. <-491024412_?<-446530955_?<-487980009_?||491024414_?->691157783_?-><-491024421_?<-490838151_?<-493629922_N6-MTase*<-445951092_?<-493629923_?<-493629924_?<-493629925_AAA<-491280326_?<-493629926_?<-493629927_? 515155813 N6-MTase*->Phage_AlpA-> N6-MTase SP+N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Vibrio cyclitrophicus DNA N-6-adenine-methyltransferase [Vibrio cyclitrophicus]. 656242824_?->515155813_N6-MTase*->515155814_Phage_AlpA-><-515155815_?<-515158394_?<-695348704_?||515155818_?->515155819_?->515155820_?-> 645913983 N6-MTase*->?->Phage_AlpA-> N6-MTase N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Acinetobacter calcoaceticus/baumannii complex MULTISPECIES: adenine methyltransferase [Acinetobacter calcoaceticus/baumannii complex]. 446913963_?->691068996_?->691068999_?->691069002_?->691069005_?->671585605_?->691069009_?->645913983_N6-MTase*->691069012_?->446324015_Phage_AlpA->446956738_?-><-490848217_?<-691069015_?<-691069016_?||493629626_?-> 646896396 METHYLASE-><-?<-?<-DCM<-Phage_AlpA<-N6-MTase*<-?<-Phage_AlpA||RadC->?->?->?-><-Phage_AlpA N6-MTase SP+N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Vibrio parahaemolyticus adenine methyltransferase [Vibrio parahaemolyticus]. <-545079836_?<-545079837_?||646896368_METHYLASE-><-646896374_?<-686283225_?<-646896385_DCM<-646896390_Phage_AlpA<-646896396_N6-MTase*<-686283226_?<-646896408_Phage_AlpA||658925964_RadC->658925965_?->646896434_?->646896439_?-><-646896445_Phage_AlpA 691047241 RecT-Redbeta->?->?->?->?->N6-MTase*->?->Phage_integrase-><-?<-Phage_lysozyme N6-MTase N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. 691048071_?->691048073_?->497182141_RecT-Redbeta->497182088_?->446054153_?->691048076_?->691047238_?->691047241_N6-MTase*->490838151_?->493629963_Phage_integrase-><-446006682_?<-691048213_Phage_lysozyme<-447006407_?<-446060349_? 758882462 RecT-Redbeta->?->?->?->N6-MTase*->?-><-Phage_integrase N6-MTase N6-MTase - 166 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. 758882458_?->446375422_?->758882459_?->445988016_RecT-Redbeta->487936574_?->758882460_?->758882461_?->758882462_N6-MTase*->446046136_?-><-758882463_Phage_integrase||490978840_?->446933280_?->446032311_?->447038484_?->445997369_?-> 444754682 Phage_integrase-><-?<-N6-MTase* N6-MTase N6-MTase ACIN5021_2863 163 bacteria>proteobacteria>gammaproteobacteria Acinetobacter sp. OIFC021 DNA N-6-adenine-methyltransferase (N6-MTase) [Acinetobacter sp. OIFC021]. <-444754612_?<-444754588_?<-444754680_?<-444754736_?<-444754818_?||444754653_Phage_integrase-><-444754626_?<-444754682_N6-MTase*<-444754756_?<-444754570_?<-444754810_?<-444754666_?<-444754593_?<-444754819_?<-444754734_? 588219826 <-Phage_integrase<-?<-N6-MTase* N6-MTase N6-MTase J594_4091 163 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii 259052 DNA N-6-adenine-methyltransferase family protein [Acinetobacter baumannii 259052]. 588219822_?-><-588219825_Phage_integrase<-588219827_?<-588219826_N6-MTase*<-588219823_?<-588219821_?<-588219828_?<-588219824_? 593668543 <-N6-MTase*<-?<-?<-?<-AAA N6-MTase N6-MTase J660_0735 163 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii 88816 DNA N-6-adenine-methyltransferase family protein [Acinetobacter baumannii 88816]. <-593668536_?<-593668537_?<-593668538_?||593668539_?->593668540_?-><-593668541_?<-593668542_?<-593668543_N6-MTase*<-593668544_?<-593668545_?<-593668546_?<-593668547_AAA<-593668548_?<-593668549_?<-593668550_? 748690860 <-RadC||N6-MTase*->Phage_AlpA->HTH->REase+SFII->METHYLASE-> N6-MTase N6-MTase - 161 bacteria>proteobacteria>gammaproteobacteria Vibrio ichthyoenteri adenine methyltransferase [Vibrio ichthyoenteri]. 493763790_?-><-493763791_?<-748690858_?<-748690859_?<-493763794_?<-493763795_?<-493763796_RadC||748690860_N6-MTase*->493763798_Phage_AlpA->493763799_HTH->748690861_REase+SFII->493763801_METHYLASE->748690862_?->493763803_?-> 736663998 <-RusA<-?<-?<-?<-?<-?<-N6-MTase*<-DnaB<-Phage_rep_O<-?<-?||HTH_3+Peptidase_S24-> N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. <-736663988_?<-736663989_RusA<-736582961_?<-651319723_?<-736663993_?<-736663995_?<-736663997_?<-736663998_N6-MTase*<-736664032_DnaB<-736663999_Phage_rep_O<-736664000_?<-736664001_?||736664033_HTH_3+Peptidase_S24->736664002_?-><-736664003_? 740612810 N6-MTase*->McrB->McrC->?->HNH-> N6-MTase N6-MTase - 141 bacteria>proteobacteria>gammaproteobacteria Vibrio parahaemolyticus adenine methyltransferase, partial [Vibrio parahaemolyticus]. 516017700_?-><-545079325_?||645070513_?->491602254_?->645070514_?-><-645070287_?<-645070288_?||740612810_N6-MTase*->740612812_McrB->645070517_McrC->645070518_?->645070519_HNH->645070520_?-><-645070521_?||645070522_?-> 696306260 <-N6-MTase<-?<-?<-N6-MTase* N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter sp. WC-323 adenine methyltransferase [Acinetobacter sp. WC-323]. <-497201532_?<-696306258_?<-497201536_?<-497201530_?<-497201538_N6-MTase<-497201541_?<-696306264_?<-696306260_N6-MTase*<-696306262_?<-497201545_? 690981431 METHYLASE->?->?-><-?<-?<-?<-N6-MTase*<-?<-?<-?<-N6-MTase N6-MTase N6-MTase - 158 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. 445987296_METHYLASE->446375043_?->447052417_?-><-690981438_?<-690981435_?<-727739395_?<-690981431_N6-MTase*<-690981312_?<-446667397_?<-446902378_?<-446605937_N6-MTase<-690981373_?<-691001572_?<-691014600_? 691039522 <-HTH_3+Peptidase_S24||?->?->?->?->Phage_rep_O->?->N6-MTase*-> N6-MTase N6-MTase - 158 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. <-445974397_HTH_3+Peptidase_S24||446625677_?->445971061_?->447103404_?->446525189_?->445986772_Phage_rep_O->691016564_?->691039522_N6-MTase*->691015623_?->691015626_?->691026561_?-> 691068978 <-Head-tail_con<-?<-?<-?<-?<-?<-?<-N6-MTase*<-?<-?<-?<-?<-?<-DnaC<-Phage_rep_O N6-MTase N6-MTase - 158 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. <-691068966_Head-tail_con<-691068967_?<-691068970_?<-691068972_?<-691068974_?<-691068976_?<-691068977_?<-691068978_N6-MTase*<-691068979_?<-691068980_?<-691068982_?<-691068985_?<-645913892_?<-691068987_DnaC<-691068990_Phage_rep_O 695353200 <-METHYLASE<-Phage_AlpA<-N6-MTase*<-?<-Phage_AlpA||RadC-> N6-MTase N6-MTase - 144 bacteria>proteobacteria>gammaproteobacteria Vibrio splendidus adenine methyltransferase, partial [Vibrio splendidus]. <-515656659_?<-515656660_?<-515645430_?||515656661_?-><-515656662_?<-515656663_METHYLASE<-657349588_Phage_AlpA<-695353200_N6-MTase*<-695353203_?<-515656666_Phage_AlpA||515656668_RadC->515656669_?->515656670_?->515656671_?->515656672_?-> 691154760 <-N6-MTase* N6-MTase N6-MTase - 157 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase, partial [Acinetobacter baumannii]. <-691154760_N6-MTase* 691157882 DnaC->N6-MTase*-> N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. 446990918_?->446088269_?->515182860_?->691157878_?->691157879_?->691157880_?->691157881_DnaC->691157882_N6-MTase*->691157883_?->691157884_?->691157885_?->691157886_?->690988505_?->690988506_?->446940447_?-> 425484490 <-N6-MTase<-?<-?<-N6-MTase* N6-MTase N6-MTase ACINWC323_A0077 152 bacteria>proteobacteria>gammaproteobacteria Acinetobacter sp. WC-323 DNA N-6-adenine-methyltransferase (N6-MTase) [Acinetobacter sp. WC-323]. <-425484495_?<-425484488_?<-425484497_?<-425484494_?<-425484498_N6-MTase<-425484499_?<-425484487_?<-425484490_N6-MTase*<-425484493_?<-425484501_? 447010248 <-N6-MTase* N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii hypothetical protein [Acinetobacter baumannii]. <-746025322_?<-446082079_?<-447010248_N6-MTase*<-638872311_?<-446202223_?<-638872318_?<-447006889_?<-446995652_? 507070967 <-N6-MTase*<-N6-MTase<-?<-?<-?<-DnaC<-?<-DCM N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter pittii phage N-6-adenine-methyltransferase [Acinetobacter pittii]. <-507070960_?<-507070961_?<-507070962_?<-507070963_?<-507070964_?<-507070965_?<-507070966_?<-507070967_N6-MTase*<-507070968_N6-MTase<-507070969_?<-507070970_?<-507070971_?<-507070972_DnaC<-507070973_?<-690970629_DCM 690997976 <-Head-tail_con<-?<-?<-small-protein<-small-protein<-N6-MTase*<-?<-?<-RusA<-?<-DnaC<-Phage_rep_O N6-MTase N6-MTase - 154 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. <-690997963_?<-690997966_?<-690997968_Head-tail_con<-446255980_?<-446652432_?<-690997971_small-protein<-690997973_small-protein<-690997976_N6-MTase*<-690997978_?<-690997980_?<-690997982_RusA<-690997983_?<-690997985_DnaC<-690997990_Phage_rep_O<-446741286_? 630464595 <-N6-MTase* N6-MTase N6-MTase J532_4398 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii 940793 DNA N-6-adenine-methyltransferase family protein [Acinetobacter baumannii 940793]. <-630464593_?<-630464594_?<-630464595_N6-MTase* 690988986 <-HTH_3+Peptidase_S24||?->?->?->?->Phage_rep_O->?->N6-MTase*-> N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. <-446715420_HTH_3+Peptidase_S24||446088064_?->690988978_?->447018345_?->690988980_?->690988982_Phage_rep_O->690988984_?->690988986_N6-MTase*->690988987_?->690988990_?->690988991_?->690988994_?->690988996_?->727739050_?->446019471_?-> 690996743 <-N6-MTase*<-?<-Phage_rep_O<-?<-?<-?<-?||HTH_3+Peptidase_S24-> N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. <-638852081_?<-446019471_?<-690988996_?<-690988994_?<-690988991_?<-690988990_?<-690990083_?<-690996743_N6-MTase*<-690988984_?<-690988982_Phage_rep_O<-690988980_?<-447018345_?<-690988978_?<-446088064_?||446715420_HTH_3+Peptidase_S24-> 691027491 <-N6-MTase*<-?<-?<-DnaB<-Phage_rep_O N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. <-446978217_?<-446022309_?<-691027489_?<-691016566_?<-445966370_?<-446571071_?<-446082079_?<-691027491_N6-MTase*<-446749121_?<-446991165_?<-446028310_DnaB<-446122449_Phage_rep_O<-489397397_?<-446990989_?||446789918_?-> 691065210 N6-MTase*-> N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. 446990918_?->446088269_?->691065201_?->691065203_?->691065205_?->691065207_?->691065209_?->691065210_N6-MTase*->691065211_?->691065213_?->691065214_?->691065215_?->446074810_?->691065217_?->446300643_?-> 691093639 N6-MTase*->?->?->?->?->?->Phage_antitermQ-> N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. 690990088_?->447190729_?->691093632_?->691093635_?->691093639_N6-MTase*->690990083_?->690990082_?->690990080_?->690990079_?->690990078_?->690990076_Phage_antitermQ->690990074_?-> 691117543 small-protein->?->ASCH->?->?->DCM->N6-MTase->N6-MTase*-> N6-MTase N6-MTase - 155 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase [Acinetobacter baumannii]. 691117523_small-protein->691117525_?->691117527_ASCH->691117530_?->691117533_?->691117536_DCM->691117539_N6-MTase->691117543_N6-MTase*->691117546_?-><-691117549_? 663438128 <-N6-MTase* N6-MTase N6-MTase - 119 bacteria>proteobacteria>gammaproteobacteria Acinetobacter baumannii adenine methyltransferase, partial [Acinetobacter baumannii]. <-663438128_N6-MTase*<-446054521_?<-446054157_? # 96; 316921487 Phage_endo_I->?->?->Bro-N->?->?->N6-MTase*->?->?->?->MazG-Phage+MazG-Phage->Phage_pRha+ANT->AAA-> N6-MTase N6-MTase HMPREF0179_03455 158 bacteria>proteobacteria>deltaproteobacteria Bilophila wadsworthia 3_1_6 phage N-6-adenine-methyltransferase [Bilophila wadsworthia 3_1_6]. 316921494_?->316921493_Phage_endo_I->316921492_?->316921491_?->316921490_Bro-N->316921489_?->316921488_?->316921487_N6-MTase*->316921486_?->316921485_?->316921484_?->316921483_MazG-Phage+MazG-Phage->316921482_Phage_pRha+ANT->316921481_AAA->316921480_?-> 749811142 Phage_endo_I->?->?->Bro-N->?->?->N6-MTase*->?->?->?->MazG-Phage+MazG-Phage->Phage_pRha->ANT->AAA-> N6-MTase N6-MTase - 147 bacteria>proteobacteria>deltaproteobacteria Bilophila wadsworthia adenine methyltransferase, partial [Bilophila wadsworthia]. 491171955_?->749811133_Phage_endo_I->749811135_?->491171951_?->749811137_Bro-N->749811140_?->491171946_?->749811142_N6-MTase*->749811143_?->491171940_?->749811145_?->749811147_MazG-Phage+MazG-Phage->749811148_Phage_pRha->749811149_ANT->491171932_AAA-> # 96; 736470177 <-Phage_integrase||?->N6-MTase*-> N6-MTase N6-MTase - 143 bacteria>proteobacteria>alphaproteobacteria Afifella pfennigii hypothetical protein, partial [Afifella pfennigii]. <-651245532_?||736470174_?->651245534_?->736470147_?->651245535_?-><-651245536_Phage_integrase||736470150_?->736470177_N6-MTase*->736470179_?-><-736470182_?<-651245538_?||651245539_?-><-736470184_?||651245540_?->651245541_?-> # 96; 654109520 <-MOM<-N6-MTase* N6-MTase N6-MTase - 154 bacteria>firmicutes Desulfovirgula thermocuniculi adenine methyltransferase [Desulfovirgula thermocuniculi]. <-737120374_?<-654109515_?<-737120377_?<-737120378_MOM<-654109520_N6-MTase*<-654109526_?<-654109530_?<-737120375_? 567770034 <-DUF3310<-Prim-Pol+PriCT_1+D5<-?<-?<-?<-N6-MTase* N6-MTase N6-MTase ERIC1_1c08270 155 bacteria>firmicutes Paenibacillus larvae subsp. larvae DSM 25719 phage N-6-adenine methyltransferase [Paenibacillus larvae subsp. larvae DSM 25719]. <-567770027_?<-567770028_?<-567770029_DUF3310<-567770030_Prim-Pol+PriCT_1+D5<-567770031_?<-567770032_?<-567770033_?<-567770034_N6-MTase*<-567770035_?<-567770036_?<-567770037_?<-567770038_?<-567770039_?<-567770040_?<-567770041_? 517503045 <-N6-MTase*<-?<-?<-?<-?<-Phage_pRha<-?<-DnaC N6-MTase N6-MTase - 156 bacteria>firmicutes Brevibacillus laterosporus DNA N-6-adenine-methyltransferase [Brevibacillus laterosporus]. <-517503038_?<-517503039_?<-737329766_?<-517503041_?<-517503042_?<-517503043_?<-517503044_?<-517503045_N6-MTase*<-737329767_?<-517503050_?<-517503051_?<-517503052_?<-737329768_Phage_pRha<-517503054_?<-737329760_DnaC 493931641 DnaC->?->?->?->RecU->N6-MTase*->N6-MTase->HARE-HTH->ASCH->?->?->MPTase-> N6-MTase N6-MTase - 152 bacteria>firmicutes Anaerotruncus colihominis DNA N-6-adenine-methyltransferase [Anaerotruncus colihominis]. 749997332_?->749997333_?->493931636_DnaC->749997334_?->749997338_?->493931639_?->749997339_RecU->493931641_N6-MTase*->749997341_N6-MTase->493931643_HARE-HTH->749997342_ASCH->749997346_?->493931646_?->749997347_MPTase->493931648_?-> 737823765 small-protein->?->?->?->N6-MTase*->?->?->?->?->?->?->SSB-> N6-MTase N6-MTase - 135 bacteria>firmicutes Clostridium botulinum adenine methyltransferase [Clostridium botulinum]. 737823833_?->737814057_?->737823755_?->737823757_small-protein->737823759_?->737823761_?->737823763_?->737823765_N6-MTase*->737823767_?->737823769_?->737823771_?->737823773_?->737823775_?->737819447_?->737819450_SSB-> 739064083 METHYLASE->?-><-MPTase<-Phage-tail-tape||?->?->?->N6-MTase*->small-protein->Recombinase-> N6-MTase N6-MTase - 152 bacteria>firmicutes Pseudobacteroides cellulosolvens adenine methyltransferase [Pseudobacteroides cellulosolvens]. 739064069_METHYLASE->739064071_?-><-739064073_MPTase<-739064075_Phage-tail-tape||739064077_?->739064079_?->739064081_?->739064083_N6-MTase*->739064085_small-protein->739064553_Recombinase->739064087_?->739064089_?->739064556_?->739064091_?->739064093_?-> 291074040 <-HARE-HTH<-N6-MTase*<-?<-RecU<-?<-?<-N6-MTase N6-MTase N6-MTase CLOM621_08346 148 bacteria>firmicutes Clostridium sp. M62/1 DNA N-6-adenine-methyltransferase (N6-MTase) [Clostridium sp. M62/1]. <-291074033_?<-291074034_?<-291074035_?<-291074036_?<-291074037_?<-291074038_?<-291074039_HARE-HTH<-291074040_N6-MTase*<-291074041_?<-291074042_RecU<-291074043_?<-291074044_?<-291074045_N6-MTase<-291074046_?<-291074047_? 503587829 Phage_integrase->?->N6-MTase*->?->N6-MTase-> N6-MTase N6-MTase - 156 bacteria>firmicutes Desulfotomaculum kuznetsovii DNA N-6-adenine-methyltransferase [Desulfotomaculum kuznetsovii]. 503587822_?->752613398_?->503587824_?->752613399_?->752613400_?->503587827_Phage_integrase->503587828_?->503587829_N6-MTase*->503587830_?->503587831_N6-MTase->503587832_?->503587833_?->503587834_?->752613790_?->503587836_?-> 759006369 N6-MTase*->?->ASCH->?->?->?->Terminase_SS->Terminase_LS-> N6-MTase N6-MTase - 147 bacteria>firmicutes Aneurinibacillus migulanus adenine methyltransferase [Aneurinibacillus migulanus]. 759006362_?->759006363_?->759006364_?->759006365_?->759006366_?->759006367_?->759006368_?->759006369_N6-MTase*->759006370_?->759006371_ASCH->759006372_?->759006373_?->759006374_?->759006468_Terminase_SS->759006375_Terminase_LS-> 488372936 <-HNH<-PBECR1<-?<-?<-dUTPase<-?<-?<-N6-MTase*<-?<-?<-DUF3310<-?<-PVL_ORF50 N6-MTase N6-MTase - 145 bacteria>firmicutes Staphylococcus caprae DNA N-6-adenine-methyltransferase [Staphylococcus caprae]. <-488372955_HNH<-488372953_PBECR1<-488372950_?<-488372948_?<-488372946_dUTPase<-488372940_?<-739686961_?<-488372936_N6-MTase*<-488372934_?<-488372931_?<-488372929_DUF3310<-488372927_?<-488372925_PVL_ORF50<-488372923_?<-488372922_? 737442515 <-Phage_terminase<-?<-DCM<-ParB<-?<-?<-?<-N6-MTase*<-?<-?<-RusA<-DnaB<-DnaC N6-MTase N6-MTase - 145 bacteria>firmicutes Bacillus sp. NSP2.1 adenine methyltransferase [Bacillus sp. NSP2.1]. <-737442428_Phage_terminase<-651510613_?<-651510616_DCM<-651510619_ParB<-651510622_?<-651510625_?<-492413769_?<-737442515_N6-MTase*<-651510632_?<-737442516_?<-492413793_RusA<-651510639_DnaB<-737442431_DnaC<-737442517_?<-737442518_? 748713908 <-Phage_portal<-Terminase_LS<-Phage_terminase<-?<-?<-?<-?<-N6-MTase*<-?<-?<-?<-?<-?<-?<-RusA N6-MTase N6-MTase - 145 bacteria>firmicutes Brevibacillus agri adenine methyltransferase [Brevibacillus agri]. <-492413752_Phage_portal<-748713899_Terminase_LS<-748713907_Phage_terminase<-492413760_?<-492413763_?<-492413766_?<-492413769_?<-748713908_N6-MTase*<-492413776_?<-492413778_?<-492413781_?<-492413784_?<-492413787_?<-748713909_?<-492413793_RusA 554763517 N6-MTase*->?->?->?->Terminase_LS->Phage_portal->MuF->Phage_GP20-> N6-MTase N6-MTase - 144 bacteria>firmicutes Lactococcus lactis hypothetical protein [Lactococcus lactis]. 554763517_N6-MTase*->696369314_?->554763519_?->696369328_?->554763521_Terminase_LS->554763522_Phage_portal->696369317_MuF->696369330_Phage_GP20-> 432181416 <-Phage_portal<-Terminase_LS<-Phage_terminase<-?<-?<-?<-?<-N6-MTase*<-?<-?<-?<-?<-?<-?<-RusA N6-MTase N6-MTase D478_26539 157 bacteria>firmicutes Brevibacillus agri BAB-2500 DNA N-6-adenine-methyltransferase [Brevibacillus agri BAB-2500]. <-432181409_Phage_portal<-432181410_Terminase_LS<-432181411_Phage_terminase<-432181412_?<-432181413_?<-432181414_?<-432181415_?<-432181416_N6-MTase*<-432181417_?<-432181418_?<-432181419_?<-432181420_?<-432181421_?<-432181422_?<-432181423_RusA 739716594 <-MazG<-?<-?<-HNH<-?<-?<-?<-N6-MTase*<-DUF3310<-?<-?<-RusA<-?<-?<-Phage_rep_org_N N6-MTase N6-MTase - 144 bacteria>firmicutes Staphylococcus aureus adenine methyltransferase [Staphylococcus aureus]. <-739716582_MazG<-739716585_?<-739716586_?<-739716589_HNH<-739716637_?<-739716591_?<-739716592_?<-739716594_N6-MTase*<-739716640_DUF3310<-739716596_?<-499595896_?<-739716598_RusA<-739716600_?<-739716602_?<-739716604_Phage_rep_org_N 746045508 <-Terminase_LS<-?<-HNH<-?<-?<-small-protein<-N6-MTase* N6-MTase N6-MTase - 144 bacteria>firmicutes Lactococcus lactis adenine methyltransferase [Lactococcus lactis]. <-746045496_?<-746045498_Terminase_LS<-746046287_?<-746045500_HNH<-746045502_?<-746045504_?<-746045506_small-protein<-746045508_N6-MTase*<-746045509_?<-746046289_?<-746045511_?<-746045512_?<-746045513_?<-746046291_?<-746045514_? 700273311 <-N6-MTase<-N6-MTase* N6-MTase N6-MTase NZ45_03810 143 bacteria>firmicutes Clostridium botulinum adenine methyltransferase [Clostridium botulinum]. <-700273305_?<-700273306_?||700273307_?-><-700273308_?<-700273309_?<-700273310_?<-700273313_N6-MTase<-700273311_N6-MTase*<-700273312_? 496656604 <-ParB+N6-MTase<-?<-?<-?<-?<-HARE-HTH<-N6-MTase*<-?<-RecU<-?<-?<-N6-MTase N6-MTase N6-MTase - 158 bacteria>firmicutes Clostridium sp. 7_3_54FAA DNA N-6-adenine-methyltransferase [Clostridium sp. 7_3_54FAA]. <-496656597_?<-496656598_ParB+N6-MTase<-496656599_?<-496656600_?<-496656601_?<-769135258_?<-496656603_HARE-HTH<-496656604_N6-MTase*<-496656605_?<-496656606_RecU<-496656607_?<-496656608_?<-496656609_N6-MTase<-496656610_?<-496656611_? 488427723 <-small-protein<-?<-?<-DUF3310<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N<-AP2<-?<-DUF968 N6-MTase N6-MTase - 142 bacteria>firmicutes Staphylococcus epidermidis DNA N-6-adenine-methyltransferase [Staphylococcus epidermidis]. <-488427716_small-protein<-488427717_?<-488427718_?<-488427719_DUF3310<-488427720_PVL_ORF50<-488427721_?<-488427722_?<-488427723_N6-MTase*<-488427724_?<-488427725_?<-488427726_DnaC<-488427727_Phage_rep_org_N<-488427728_AP2<-488427729_?<-488427730_DUF968 489480013 Phage_portal->MuF->?->N6-MTase*->N6-MTase-> N6-MTase N6-MTase - 142 bacteria>firmicutes Clostridium botulinum phage N-6-adenine-methyltransferase [Clostridium botulinum]. 696516263_Phage_portal->696516265_MuF->666650663_?->489480013_N6-MTase*->696516267_N6-MTase-> 515743089 <-DUF3310<-?<-?<-?<-N6-MTase*<-?<-?<-DnaB<-?<-Phage_rep_org_N<-DUF968 N6-MTase N6-MTase - 142 bacteria>firmicutes Staphylococcus hominis DNA N-6-adenine-methyltransferase [Staphylococcus hominis]. <-739692513_DUF3310<-515743086_?<-515743087_?<-515743088_?<-515743089_N6-MTase*<-515743090_?<-515743091_?<-515743092_DnaB<-515743093_?<-515743094_Phage_rep_org_N<-515743095_DUF968<-515743096_? 446374006 SSB->DUF968->Phage_rep_org_N->DnaC->?->?->N6-MTase*->?->?->PVL_ORF50->Phage_Orf51->?->dUTPase-> N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus DNA N-6-adenine-methyltransferase [Staphylococcus aureus]. 447210992_?->446627362_SSB->447122187_DUF968->446427137_Phage_rep_org_N->446725726_DnaC->446159298_?->447046432_?->446374006_N6-MTase*->445971925_?->447109983_?->446458196_PVL_ORF50->447049569_Phage_Orf51->446987809_?->446107781_dUTPase->447204818_?-> 446374007 <-Phage_Orf51<-PVL_ORF50<-?<-?<-?<-N6-MTase*<-?<-?<-DnaB<-?<-Phage_rep_org_N<-DUF968<-SSB N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus DNA N-6-adenine-methyltransferase [Staphylococcus aureus]. <-446377139_?<-827456484_?<-827456486_Phage_Orf51<-827456488_PVL_ORF50<-446695570_?<-827456491_?<-445971951_?<-446374007_N6-MTase*<-447046434_?<-446947149_?<-447028912_DnaB<-447054991_?<-446427118_Phage_rep_org_N<-447122162_DUF968<-446627367_SSB 506511035 <-Phage_Orf51<-DUF3310<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N||?-><-DUF968<-SSB N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus hypothetical protein [Staphylococcus aureus]. <-686306375_?<-446901953_?<-686148387_Phage_Orf51<-445944872_DUF3310<-506506510_PVL_ORF50<-447109983_?<-445971925_?<-506511035_N6-MTase*<-447046432_?<-446159298_?<-521258099_DnaC<-446112371_Phage_rep_org_N||446633145_?-><-521258098_DUF968<-752533923_SSB 554679133 SSB->DUF968->Phage_rep_org_N->?->DnaB->?->?->N6-MTase*->?->?->DUF3310->Phage_Orf51-> N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus prophage LambdaSo DNA modification methyltransferase [Staphylococcus aureus]. 554679128_SSB->554679130_DUF968->446427119_Phage_rep_org_N->447054987_?->554679132_DnaB->446947142_?->447046432_?->554679133_N6-MTase*->445971955_?->447110008_?->445944880_DUF3310->554679135_Phage_Orf51->554679137_?->447028362_?->446987770_?-> 678260344 <-small-protein<-?<-Phage_Orf51<-DUF3310<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N||?-><-small-protein<-DUF968 N6-MTase N6-MTase ERS140248_02184 141 bacteria>firmicutes Staphylococcus aureus prophage L54a%2C N-6-adenine-methyltransferase [Staphylococcus aureus]. <-678260337_small-protein<-678260338_?<-678260339_Phage_Orf51<-678260340_DUF3310<-678260341_PVL_ORF50<-678260342_?<-678260343_?<-678260344_N6-MTase*<-678260345_?<-678260346_?<-678260347_DnaC<-678260348_Phage_rep_org_N||678260349_?-><-678260350_small-protein<-678260351_DUF968 686297326 SSB->DUF968->Phage_rep_org_N->?->DnaB->?->?->N6-MTase*->?->?->?->PVL_ORF50->Phage_Orf51-> N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus adenine methyltransferase [Staphylococcus aureus]. 446857537_SSB->686174277_DUF968->686297325_Phage_rep_org_N->447054991_?->686169507_DnaB->686169506_?->447046432_?->686297326_N6-MTase*->686297327_?->447110007_?->446695570_?->486217540_PVL_ORF50->686175250_Phage_Orf51-> 686300364 SSB->DUF968-><-?||Phage_rep_org_N->DnaC->?->?->N6-MTase*->?->?->?->PVL_ORF50->Phage_Orf51-> N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus adenine methyltransferase [Staphylococcus aureus]. 446627367_SSB->686300367_DUF968-><-446336900_?||523688958_Phage_rep_org_N->686300366_DnaC->446159298_?->686300365_?->686300364_N6-MTase*->445971943_?->447110007_?->446695570_?->686300363_PVL_ORF50->686348626_Phage_Orf51-> 686391504 <-Phage_Orf51<-PVL_ORF50<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N<-AP2<-DUF968<-SSB N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus adenine methyltransferase [Staphylococcus aureus]. <-686391501_?<-686391502_Phage_Orf51<-686391503_PVL_ORF50<-447109983_?<-445971925_?<-686391504_N6-MTase*<-447046432_?<-446159298_?<-686332987_DnaC<-446427132_Phage_rep_org_N<-686391505_AP2<-686298587_DUF968<-686391506_SSB 686419170 SSB->DUF968->AP2->Phage_rep_org_N->DnaC->?->?->N6-MTase*->?->?->?->PVL_ORF50->Phage_Orf51-> N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus adenine methyltransferase [Staphylococcus aureus]. 686419164_SSB->686419165_DUF968->686419166_AP2->686419167_Phage_rep_org_N->686419168_DnaC->446551491_?->686419169_?->686419170_N6-MTase*->445971955_?->686382904_?->446023397_?->445951516_PVL_ORF50->686382906_Phage_Orf51-> 686449191 <-Phage_Orf51<-DUF3310<-?<-?<-N6-MTase*<-?<-?<-DnaC<-Phage_rep_org_N<-AP2<-DUF968<-SSB N6-MTase N6-MTase - 141 bacteria>firmicutes Staphylococcus aureus adenine methyltransferase [Staphylococcus aureus]. <-446066855_?<-446901953_?<-686449190_?<-446053519_Phage_Orf51<-445944880_DUF3310<-447109974_?<-686369055_?<-686449191_N6-MTase*<-447046428_?<-446178742_?<-686449192_DnaC<-686449193_Phage_rep_org_N<-686449194_AP2<-686449195_DUF968<-447021753_SSB 737532221 <-HNH<-SSB<-?<-?<-HNH<-?<-N6-MTase*<-?<-?<-?<-?<-?<-N6-MTase<-DnaC N6-MTase N6-MTase - 141 bacteria>firmicutes Halobacillus MULTISPECIES: adenine methyltransferase [Halobacillus]. <-737532213_?<-737532215_HNH<-737532216_SSB<-737532218_?<-737532219_?<-737533831_HNH<-737532220_?<-737532221_N6-MTase*<-737532222_?<-737532223_?<-737532225_?<-737532227_?<-737532228_?<-737533832_N6-MTase<-737532230_DnaC 492715347 <-HARE-HTH<-N6-MTase*<-?<-RecU<-?<-?<-N6-MTase N6-MTase N6-MTase - 158 bacteria>firmicutes Clostridiales MULTISPECIES: DNA N-6-adenine-methyltransferase [Clostridiales]. <-495674054_?<-495674055_?<-495674056_?<-490331179_?<-490331178_?<-490331176_?<-490331175_HARE-HTH<-492715347_N6-MTase*<-490331171_?<-490331168_RecU<-490331165_?<-495674061_?<-490331152_N6-MTase<-495674063_?<-495674065_? 500994137 <-N6-MTase* N6-MTase N6-MTase - 140 bacteria>firmicutes Clostridium botulinum DNA N-6-adenine-methyltransferase [Clostridium botulinum]. <-489454419_?<-500994137_N6-MTase* 635344555 <-N6-MTase<-?<-?<-?<-?<-?<-N6-MTase*<-DnaC<-Phage_rep_org_N<-Phage_pRha+ORF6C<-MetJArc||MetJArc-><-small-protein<-small-protein N6-MTase N6-MTase BN981_00304 140 bacteria>firmicutes Halobacillus trueperi phage N-6-adenine-methyltransferase [Halobacillus trueperi]. <-635344548_?<-635344549_N6-MTase<-635344550_?<-635344551_?<-635344552_?<-635344553_?<-635344554_?<-635344555_N6-MTase*<-635344556_DnaC<-635344557_Phage_rep_org_N<-635344558_Phage_pRha+ORF6C<-635344559_MetJArc||635344560_MetJArc-><-635344561_small-protein<-635344562_small-protein 738763505 <-DUF3310<-Prim-Pol+PriCT_1+D5<-?<-?<-?<-?<-N6-MTase* N6-MTase N6-MTase - 136 bacteria>firmicutes Paenibacillus larvae adenine methyltransferase, partial [Paenibacillus larvae]. <-738761287_?<-738761289_DUF3310<-738761291_Prim-Pol+PriCT_1+D5<-738761294_?<-738761296_?<-738761298_?<-738761300_?<-738763505_N6-MTase*<-738761302_?<-738761305_?<-738763507_?<-738761307_?<-738761309_?<-738761310_?<-738761313_? 737140426 <-N6-MTase* N6-MTase N6-MTase - 140 bacteria>firmicutes Clostridium tetani adenine methyltransferase [Clostridium tetani]. <-746206773_?<-737140426_N6-MTase*<-737140429_?<-737140432_?<-737140435_? 748203410 small-protein->?->?->?->?->N6-MTase*-> N6-MTase N6-MTase - 140 bacteria>firmicutes Clostridium botulinum adenine methyltransferase [Clostridium botulinum]. 489456872_?->489456871_?->489456870_small-protein->489456868_?->489456866_?->489456865_?->748203409_?->748203410_N6-MTase*->500994144_?->500994146_?->500994147_?->489454419_?->748203411_?->647362810_?->748203413_?-> 752703286 <-N6-MTase* N6-MTase N6-MTase - 140 bacteria>firmicutes Clostridium botulinum adenine methyltransferase [Clostridium botulinum]. <-752703286_N6-MTase* 768719850 <-Phage_portal<-HNH<-Terminase_LS<-N6-MTase* N6-MTase N6-MTase - 140 bacteria>firmicutes Oenococcus oeni adenine methyltransferase [Oenococcus oeni]. <-488908904_?<-488908905_Phage_portal<-768719849_HNH<-488908906_Terminase_LS<-768719850_N6-MTase*<-488908910_? 446374005 <-N6-MTase* N6-MTase N6-MTase - 139 bacteria>firmicutes Staphylococcus aureus hypothetical protein, partial [Staphylococcus aureus]. <-446374005_N6-MTase*<-510795933_? 737533832 <-N6-MTase<-?<-?<-?<-?<-?<-N6-MTase*<-DnaC<-Phage_rep_org_N<-Phage_pRha+ORF6C N6-MTase N6-MTase - 137 bacteria>firmicutes Halobacillus MULTISPECIES: adenine methyltransferase [Halobacillus]. <-737532220_?<-737532221_N6-MTase<-737532222_?<-737532223_?<-737532225_?<-737532227_?<-737532228_?<-737533832_N6-MTase*<-737532230_DnaC<-737533835_Phage_rep_org_N<-737532232_Phage_pRha+ORF6C<-737533836_?<-737532233_?<-737532235_?<-737532237_? 654100680 <-MOM<-N6-MTase* N6-MTase N6-MTase - 154 bacteria>firmicutes Desulfovirgula thermocuniculi adenine methyltransferase [Desulfovirgula thermocuniculi]. <-654100660_?<-654100664_?<-737119001_?<-737119003_?<-654100676_?<-737119021_?<-737119023_MOM<-654100680_N6-MTase*<-654100684_?<-654100688_?<-737119025_?<-654100700_?<-654100704_?<-654100708_?<-654100715_? 518557238 RecT-Redbeta->SSB->?->?->?->?->?->N6-MTase*->?->?->small-protein->?->small-protein->?->HNH-> N6-MTase N6-MTase - 152 bacteria>actinobacteria Bifidobacterium breve DNA N-6-adenine-methyltransferase [Bifidobacterium breve]. 489927158_RecT-Redbeta->489927162_SSB->489927167_?->489927170_?->489927172_?->489927174_?->489927176_?->518557238_N6-MTase*->489927184_?->489927185_?->489927186_small-protein->489927190_?->489927191_small-protein->489927192_?->489927194_HNH-> 673939868 N6-MTase->?->N6-MTase*->Terminase_LS->HNH->Phage_portal-> N6-MTase N6-MTase Phi93_04 140 viruses>dsdna viruses, no rna stage>caudovirales Lactococcus phage phi93 DNA methylase [Lactococcus phage phi93]. 673939865_?->673939866_N6-MTase->673939867_?->673939868_N6-MTase*->673939869_Terminase_LS->673939870_HNH->673939871_Phage_portal->673939872_?->673939873_?->673939874_?->673939875_?-> # 2; 491783102 <-N6-MTase<-?<-HTH_3+Peptidase_S24||?->Phage_pRha+ANT->?->HTH->N6-MTase*->?->RusA->Phage_antitermQ->?->?->?->KilA-N-> N6-MTase N6-MTase - 174 bacteria>proteobacteria>gammaproteobacteria Actinobacillus pleuropneumoniae DNA methylase [Actinobacillus pleuropneumoniae]. <-491783091_N6-MTase<-491783093_?<-491783095_HTH_3+Peptidase_S24||491783098_?->763111857_Phage_pRha+ANT->491783100_?->491783101_HTH->491783102_N6-MTase*->491783105_?->491783106_RusA->491783107_Phage_antitermQ->491805399_?->491805403_?->491783110_?->491783111_KilA-N-> 500173972 <-N6-MTase<-?<-HTH_3+Peptidase_S24||?->Phage_pRha+ANT->Phage_rep_O->HTH->N6-MTase*->?->RusA->Phage_antitermQ->?->Phage_lysozyme-> N6-MTase N6-MTase - 174 bacteria>proteobacteria>gammaproteobacteria Actinobacillus pleuropneumoniae DNA methylase [Actinobacillus pleuropneumoniae]. <-500173966_N6-MTase<-500173967_?<-762512306_HTH_3+Peptidase_S24||500173969_?->500173970_Phage_pRha+ANT->762512559_Phage_rep_O->762512561_HTH->500173972_N6-MTase*->500173973_?->500173974_RusA->500173975_Phage_antitermQ->762512308_?->500173976_Phage_lysozyme->500173977_?->762512310_?-> # 1; 342803448 <-RadC||N6-MTase*->Phage_AlpA->HTH->REase+SFII->METHYLASE-> N6-MTase N6-MTase VII00023_15021 397 bacteria>proteobacteria>gammaproteobacteria Vibrio ichthyoenteri ATCC 700023 putative phage N-6-adenine-methyltransferase [Vibrio ichthyoenteri ATCC 700023]. 342803441_?-><-342803442_?<-342803443_?<-342803444_?<-342803445_?<-342803446_?<-342803447_RadC||342803448_N6-MTase*->342803449_Phage_AlpA->342803450_HTH->342803451_REase+SFII->342803452_METHYLASE->342803453_?->342803454_?-> 584469889 N6-MTase*->McrB->McrC->?->HNH-> N6-MTase N6-MTase VPUCM_1151 247 bacteria>proteobacteria>gammaproteobacteria Vibrio parahaemolyticus UCM-V493 prophage LambdaSo, DNA modification methyltransferase, putative [Vibrio parahaemolyticus UCM-V493]. 584469882_?->584469883_?-><-584469884_?||584469885_?->584469886_?->584469887_?-><-584469888_?||584469889_N6-MTase*->584469890_McrB->584469891_McrC->584469892_?->584469893_HNH->584469894_?-><-584469895_?||584469896_?->Back to Contents
<-Restriction endonuclease------------* *---------------------------------------------------------------------------------------------> Str-1 Str-2 Str-3 Str-4 Str-5 Str-6 Str-7 <---N-terminal RAGNYA-------------------------------------------------------------------------------------------------> < Helical coiled coil-----------><----C-terminal RAGNYA- --------------------------->< c-TERMINAL HELIX OF COILED cOIL--------------------------------------------------------------------------------------------------------------------------------------------------------------> ALIGN -------HHHHHHHHHHHHHHHH-------------------------EEEE------HHHHHH-----H-HHHHHHHHHHHHHH-----EEE---------HHHHH-------HHHHHHH------H-H------HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHH-----------------HHHHHHHH-----HHHHHHH----EEEE-----------HHHHHHHHHHHHHH------------------HHHEHHHH----------EEE----EEEEEEEEE------EE---------EEEEEEHHH-----------HH-----------E---HHH-HHHHHHHHHEE---------------------------------EEEEEE------------H---HHHHHH--HHHHHHHHHHHH----------EEEEEE------------EEHHH-------HHHHHHHH-----EEEEE-----EEEE---E---EEEHHHH-------HHHHHH-H---------EEE-------HHHHHHHHHHH--------------------------------------------E---HHHHHHHHHHHHHHHHHHHH---------------E----------------HHHHHHHHH---HHHHH----EEEE-------------------EEEEE-----HHHHHHHHHH----HH----------HH-HHHHHHHHHHHHHHH------EE------EEEEEEH--------HHHHHHHHHH------EEEEE---HHHHH-HHHHHHHHHHHHHH--HHEEH-------HHHHHHHH-----HHHHHHH------HH----EEEE----------HHHHHH-------------HHHHHHHH---E-EEEEE---E------HHHH-HHH--HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH------HHHHE---- HMM ---HHHHHHHHHHHHHHH--EEEE----------HHH--EEE-----EEEEEE-----EEEEEE-----HHHHHHHHH--HHHEEH---EEEE----EEE-EEEHHHHHHE-HHHHHHHHHH----HEEHH---HHHHHHHHHHH---EEE---HHHHHHHHHHHHHHHHHHHHHHHH---EEEE---HHHHHHHHHHH--HHHHHHHHHH-HHEEEEEE----------HHHHHHHHHHHHHHH--HEEE-------HHHHHHHHHHHHH-----EEEEEEEE-HHHHHHHHHHHHH----EEEE------EEEHHHHHHHHH-------HHHHH-------EEEEEEEHHH-HHHHHHHHHHHHH--H---E---E-----EEEE---E-------EEEEEEE---------HHE---EHHHHHHHHHHHHHHHE--HH-HHHHH---EEEEEEE--------EEEEEEEEE-----HHHHHHH--HHHHHHHHHHHHHHHHHHHHHHEEHHHHHHHHHHHHHHHHHHHHHHHHH---E---EEEEE---HHHHHHHHHHHHHH---------HHHHHEE-EHHHHHH------EEEEEE-----HHHHHHHHHHHHHH--HHHHHHHHH---HH-HH------EEEEEE------EEEEEEE----HHHHHHHHHHHHHHHHHHHHHEE---EEE-----EEEEEEEHHHHH-------EEHEEEE----EEEE-H-HHH--HHEHHHEE-HHHHHHHHHH---HHHEE--HHHHHHHHHHHHHHHHH---EEEEEEE-------EEEEEE-HHHHHHHHHHHHHHHHHHHH----EEEEEE----E--HHHHH-EE----H-HHHHH------HH---EEEEE----------HHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHH--EEEE---HHHEHHHHHHHHHHHHHHH--HHHHHH---- FREQ ----------HHHHHHHHHHH---EEE-------EEEEEE--------EEEE-------EEEEEE-----HHHHHHHHHHHHHHHH--HHHHHH------EEE----------HHHHHHHHHHHHHHHHHHH----EEEEEEEEEEEEE----------HHHHHHHHHHHHHHHHH----------------HHHHHH----HHHHHHHHHHHH------HEEEEHHHH-----HHHHHHHHHHH----EE---E-------HHHHHHHHH---------------HEEEHHEEE--------E---------HEEHHHHHHHH-H-HHHHHHHHH-----H--------E-H-HHHHHHHHHHH------------H-----H--------------EEEEE----------EEEE---E-----------------EEEE--HHH----EEEEEHHHH----EEEE-----EE----HHHHHHHHHH----EEEEE----EEEEEHHHEEEEEEEE------------EEEEEE--------EEE-----------------------------------HH-HHHHHHHH-----EEEEH----HHHHH------------HHHHHHHHHHHHHHHHHHHHHHHH-------EEEEE----------------HHHHHHHH---HHHHHH-HHHHHHHHHHHHH-----HEEE-----------------EEEEE-----------HEEEHHHHHHHHHH------EEEEEE-----EEEEE------HHHHHHHHHH--------EEEEEEHHHHHHHHHHHHHHHHHH---------------HEEHHHHHHHH-----HHHHHHH------H-----HHH--------------EEEEEEE----------HHHHHHHHHH---EEEHHHHHHHHHHHHHHH-HHHHHHHH-----HH-----EEEEEEEEE------E---HHHHHHHHHHH------EEEEE--- PSSM -----HHHHHHHHHHHHH---------------HHHHHHH------EEEEEE-------EEEEEE------HHHHHHHHHHHHHH----EEEEEE---EEEEEE--------------------------------HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHH----------------HHHHHHHH---HHHHHHHHHHHHHH-------------------HHHHHHHHHHH-------------HHHHHHHHHHHHHH--------EEE---HHHHHHHHH-------EEE------HHHHHHHHHHHHHH----HHHHHHH-----HH--EEEEE-HH-HHHHHH---E---------EE-----------------------EEEEEE-----------------------HHHHHHHHHHHHHHHHHHH----EEEEEEE---------------------HHHHHHHHHH----EEEEEE------------EEEEEEEEE----------HHHHHHHHHH----------------HHHHHHHHHHH-------------EEEE------------HHHHHH--------------HHHHHH-----HHHHHHHHHHHHH-------------------EEEEE--------------HHHHH--HHHHHHH-------------------EEEEE-------HHHHHHHH--HHEHHHHHHHHH-H-HHHHEEEHHHHHHH---E-----EEEEE---EEEEEEE--------EEEEEEEE-------EEEEE-E--HHHHHHHHHHHHHHHHHHH-----------EEEEEE--EEEEEE----HHHHHH------HH----EEE----------HHHHHHHHHH-----------HHHHHHHHHHH--EEEEEHHHH--HHHHHHH-HHHHHHHHHHHHHH-----EEHHHEEHHHH---EEEEEE-----HHHHHHH-----E------- FINAL ------HHHHHHHHHHHHHH----EEE-------HHHHH--------EEEEEE------EEEEEE-----HHHHHHHHHHHHHHHH---EEEEE-----EEEEE---------HHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHH--------------HHHHHHHHH----HHHHHHHHHHHHH-------EEHHHHH-----HHHHHHHHHHH----EE--------HHHHHHHHHHHH---------EEE--HHHHHHHHHH-------EEE-------HHHHHHHHHHHH-----HHHHHHH-----H---EEEEE-HH-HHHHHHHHHHH--------E---------E--------------EEEEEE-----------EE---E-------HHHHHHHHHHHHHHH-------EEEEEEE-------EEEEE---EEE---HHHHHHHHHHH----EEEEE-----EEEE--EEEEEEEEEE-----------HHHHHHH-------EEEE-------HHHHHHHHHH----------------EE--HHHHHHH-----HHHEE-----HHHH---HHHHH-----HHHHHHHHHHHHHHHHHHHHHHHH--------EEEEEEEE-----HHHH---HHHHHHHHHHHHHHHH---HHHHHHHHHHH----EEEEEE------HHHH-------EEEEEE-----------EEEHHHHHHHHHH-------EEEEE----EEEEEE--------HHHEEEEE-------EEEEEEEHHHHHHHHHHHHHHHHHHHH----EEE-----EEEEEE--EEEEE----HHHHHHH------HH---EEEEE----------HHHHHHHHH----------HHHHHHHHHHHH--EEEEHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH----HHHHHHEEEEE---EEEEE--HHHHHHHHHHH-----EEEEE--- RFI_15181_Reticulomyxa_filosa_569404634 MKYSDLVERDIEAAIDNELRYLKWNDDSKDINSCNRNKKKLLGGKRPDYILYKDNSDEPIAIIEAKKPYEDINKAQQQGIEYAKILNAPVVFATDGIYTKTYHIKKQANLTLNNEEVDDLLNQSTLLNFLQDNIYDSIDKIRLNRQKGFTIKRGINIYIQNYLFSNILFLKVISELAEMNDCTIALPPKDYLWDNFKIKKGLDLVDFLNKQAFDYFKKSYGGKVLSKIEILSGKERILNDIITNLDDLWLS---DTNTDIKGDAFEYFLRNYGGAETDFGEYFTPRHIVKTMVKLLNPKFGEKIYDGFCGTGGMLTESFKYIKRRMPLNPNTIRYL-----KNETVFGGEFST-IFRIAKMNMILAGDGHSNIARQDS-----YEKKQTNK-------LDVVITNIPF-GNKMKTDY---LSQYGYNGKSAEICGVLHCLDALNNQNENARAGIIVPEEEAKQKQNHIWYFDLQNDGYALNKARTKIKGQNDIDVLLSEASLNIDEIERLKRINFDVLYKNKVRNNKYVLLANQYKEQVID---NFAFQEFSLQELEEIKHIEFKKGNALSKTEVENDGIYECILYGELYTKYNNPFIDKVYSTTNVKGKILSNYGDVQNKINPQYLSLVFNYTLKNELAKYARGANILHLSNDNIKKIKIPLPPLEEQQKIVEEIDSYQKVIDGAKQIIDNWNPSFEVREGCEIRNLGDITKLVRGPFGGSLKKEIFVESGFKVYEQSNCIKNDVKIGNYYITESKYKEMIRFSVQENDILMSCSGTIGKVLLLNNNFEKGIINQALLKVTPAQSSKKKIVEQIENERKIVESSKQLIKLQEEKIKNKINLFLLNSNGIDARDYSSNLSKNDTELKAKKK------NLGQDDFIIDNKSAENIISQLEEVNKLCINNCGLASQNHEFATQILNNLFTNVNLVMGDIWNAYANYSNNL-ISSKNLVDMNIANEQLADNFLNIAKQVQLINAQVMLDCCNELVAPSLKQAASCSEKIAKKINSN--------------------------------------------------------------------------------- RFI_34923_Reticulomyxa_filosa_569334903 ---------------------------------------------------------------------------------------------------------KKTTGFLDKELIREILDKNNPC-IKEHVVPDEVYYQKAIKINEILHNGSINKNNRARVVATMLLALLKDNYINRENNCFS------MINELNSRAE----EILHEKDKRGFIHCIKISVPPTPDNHIKFRKALLEAVQELDSINIRSAMNSGTDILGRFYEQFL-KYGNGSKEIGIVLTPRHIARFAVDVLNITNKDKVLDPACGTGGFLVSAFDKVKSEVDK-----EEL--EKFKTEGIYGIEQDPEVVALALVNMIFRGDGRANLEEGNC-----FTSKK-----FIDLKVSKVLMNPPF-ALKKSDEK-----EYKFIDF------------ALNKMEKGGYLFAVIPSS-------------VMFRSKNFKDWRVKMLENNTLKAVIKLPDALFYPVS---VCTSAIIIQKGVKHDKNAN--------------GVMINSQKNNNIEEIRNALASHLNSIKLGQSKQQFIQKPIDFDKYLECSSEAYLED---KDYSKDEI-----EAQAKIPIDFY-FKIQKCQSGNLENYPTG-DIPFVSNTSLNNGVVKYVVLSEEKELIKNVPCI--AINGFGFATIQTHPFIGSGNGG-----GYVSALIPKKEMTMLELAYYAG---QLNLKSWCFSYGRRAVKHRLSAIKLSEFKKESINPNILSNIKNDLVSEIDSFVKKINSKSDTTWLIMNELKRRGYELFYYIPTNLIQVNGKILAIGNFIKIKKYQPMVYEIGKRQTLNLEDASAILIRQNPPLNMEYLTSTYLLEVIKHKVLILNNPSQIRNCPEKLFVTNFPQFCPPTIIASNYNHEVKKFITSHKEVVIKPIYDFGGSYIKKISLRSKNIKEIIKKYQLKFGNFIIQKFLPFVIEGDKRIILLDGEILGAIKRIPKAGDFRANLVVGGKAAKVEITKNDLVICRTLRPELKRRGLMIAGIDIIGNNLIEINVTSPTGLVAINKLYNQKSEVYVVNAIERKLKDAIKTYG RFI_21063_Reticulomyxa_filosa_569384171 ------------------------------------------------------------------------------------------------------------------------------------------------MKHNVLLQSCLCGRFSE--FANILFLKLLSEGNEKS-----------WWSDIK------------SQSNDY-------------------------IINEIDPLVLS---SIDSDIKGDAFEYFLEKTTSTENDLGEYFTPRNVVKTIINLVDPKFKETVYDPFCGTGGFLTESFNYIKENNIIEGE--EDL--KRLKQETIYGREVTA-TARIAKMNMILHGDGHAGIQQINTLSNPDYIEKKGGKWIFVKLQINQVIKRMPFIIQRIQTDRGQEFFAYNVQEKLKEYKIKFRPIKPASPHLNGKYSHLMDKLHTWEK---------YYNKGRPHSALQGKTPWEKYKELEPQIP--TIEEVH----LNYEV---------------------SQE---NFVPQSYNVH--KNIQDI--KRGKSYN--------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------<----ATP grasp in sequence above ------------------------------------------------------------------------------------>------------------------------------------------------------------------------- RFI_02175_Reticulomyxa_filosa_569435738 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------DKVLDPACGTGGFLVSAFDKVKSEVDK-----EEL--EKFKTEGIYGIEQDPEVVALALVNMIFRGDGRANLEEGNC-----FTSKK-----FIDLKVSKVLMNPPF-ALKKSDEK-----EYKFIDF------------ALNKMEKGGYLFAVIPSS-------------VMFRSKNFKDWRVKMLENNTLKAVIKLPDALFYPVS---VCTSAIIIQKGVKHDKNANVLWGWLKDGFVKKKGVMINSQKNNNIEEIRNALASHLNSIKLGQSKQQFIQKPIDFDKYLECSPEAYLED---KDYSKDEI-----EAQAKIVLQNL-ISFKLCSQ-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RFI_38478_Reticulomyxa_filosa_569312235 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MYLNVSNPKNETVFGGEVST-IFRIAKMNMILAGDGHSNIARQDS-----YEKKQTNK-------LDVVITNIPF-GNKMKTDY---LSQYGYNGKSAEICGVLHCLDALNNQNENARAGIIVPEGI------------LFNGNKAYTQLRRDLVEKYSLENVVSLPKRTFVDVG---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- consensus/100% .............................................................................................................................................................................................................................................................................................................................................b.L.....KpEslaG.E.ss..h.lAbhNMIh.GDG+usl.b.ss.....a.pKb..........ls.Vl.p.PF...+bps-b......Y.h..b............shs...ps.b..hh.................hb..sbs.p.hb.c...p..bc...pbs...h..l...........................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................Back to Contents
# 1; Eukaryotic versions 569312235 N6-MTase N6_Mtase RFI_38478 146 eukaryota>rhizaria Reticulomyxa filosa restriction-modification protein, partial [Reticulomyxa filosa]. 569334903 N6-MTase+ATP-grasp! N6_Mtase+Methyltransf_26+Eco57I+GSH-S_N+GSH-S_ATP RFI_34923 892 eukaryota>rhizaria Reticulomyxa filosa N-6 DNA methylase, partial [Reticulomyxa filosa]. 569384171 N6-MTase+Reverse-transcriptase N6_Mtase+rve+rve_3 RFI_21063 326 eukaryota>rhizaria Reticulomyxa filosa Type I restriction-modification system methyltransferase subunit [Reticulomyxa filosa]. 569404634 REase+N6-MTase+RAGNYA+helix+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S RFI_15181 978 eukaryota>rhizaria Reticulomyxa filosa type I restriction-modification system methyltransferase subunit [Reticulomyxa filosa]. 569435738 N6-MTase N6_Mtase RFI_02175 275 eukaryota>rhizaria Reticulomyxa filosa N-6 DNA methylase, partial [Reticulomyxa filosa]. # 1; Prokaryotic homologs 345528927 REase+N6-MTase+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S FBFL15_0854 748 bacteria>bacteroidetes Flavobacterium branchiophilum FL-15 Probable type I modification methyltransferase [Flavobacterium branchiophilum FL-15]. 495892824 REase+N6-MTase+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S - 977 bacteria>bacteroidetes Paraprevotella clara restriction endonuclease subunit M [Paraprevotella clara]. 91069778 HSDR_N_2+N6_Mtase+Methyltransf_26+Eco57I RBE_1419 517 bacteria>proteobacteria>alphaproteobacteria Rickettsia bellii RML369-C Type I restriction-modification system methyltransferase subunit [Rickettsia bellii RML369-C]. 746565283 REase+N6-MTase+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S - 790 bacteria>proteobacteria>alphaproteobacteria Rickettsia felis hypothetical protein [Rickettsia felis]. 67005333 N6-MTase N6_Mtase+Methyltransf_26 RF_p07 332 bacteria>proteobacteria>alphaproteobacteria Rickettsia felis URRWXCal2 Type I restriction-modification system methyltransferase subunit (plasmid) [Rickettsia felis URRWXCal2]. 827052825 REase+N6-MTase HSDR_N_2+N6_Mtase+Methyltransf_26 MRECE_1c072 584 bacteria>tenericutes Mycoplasmataceae bacterium CE_OT135 type I restriction endonuclease subunit M [Mycoplasmataceae bacterium CE_OT135]. 823691316 N6-MTase N6_Mtase+Methyltransf_26 - 576 bacteria>spirochaetes Brachyspira hyodysenteriae hypothetical protein, partial [Brachyspira hyodysenteriae]. 763152770 REase+N6-MTase HSDR_N_2+N6_Mtase+Methyltransf_26 - 531 bacteria>bacteroidetes Flavobacterium branchiophilum hypothetical protein, partial [Flavobacterium branchiophilum]. 490962086 REase+N6-MTase+RAGNYA+helix+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S - 983 bacteria>firmicutes Peptoniphilus lacrimalis restriction endonuclease subunit M [Peptoniphilus lacrimalis]. 480765781 REase+N6-MTase+RAGNYA+helix+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S HMPREF1083_02457 968 bacteria>firmicutes [Clostridium] clostridioforme 90A6 hypothetical protein HMPREF1083_02457 [[Clostridium] clostridioforme 90A6]. 740438970 REase+N6-MTase+RAGNYA+helix+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S - 966 bacteria>firmicutes [Clostridium] clostridioforme restriction endonuclease subunit M [[Clostridium] clostridioforme]. 546935999 REase+N6-MTase+RAGNYA+helix+RAGNYA+helix HSDR_N_2+N6_Mtase+Methyltransf_26+Methylase_S+Methylase_S - 981 bacteria>firmicutes Clostridium sp. CAG:81 type I restriction modification DNA specificity domain protein [Clostridium sp. CAG:81].Back to Contents
<-----Methylase core---- Str-1 Str-2 Str-3 Str-4 Str-5 Str-6 Str-7 <------------------------------------------------------------------------------------------------------TaqIC-----------------------------------------------------------------------------------------------------------------------------------> RES MFEQSDYSRYQQIKSQTQRWHHDKTLYHSDCVVASIQHEAIRQAKLQHAANDHAYEAFFLRWLFVYICFMQGRFGGLGSGKRKQQVHSGGVA-------------NRMTKTHDLESPTDNHKRRRPNDD-EDA---------ISEEEINAIVAILGNQLLIVT-----------------------KS-IMVEKE---SYTPSAAVTFLLRRMDVVDAQD---------------------------------------------------------------------------------------------------------------------ARSTEIYCLMTAFMTVYLCFV-------------------RVLIQQ-------PE----FDHFMQLFNHHDSLLNQ----DAQP--------------------------FLWFYALHHM------PIVDDLHKQLSTSTK------QCAMANIDSIFSKFYTQYFLEMSAAKHQKDHGQYY|TPRSVLRFMWDRCATV-----SHLIQLLQQQQTG------------------------MCRVFD-PCLGIGSFLCEFLTRFTKAC--RF----TVWSDPQRLTQLLLQDIPDHIFGIEIDPFAYQLCKMNMMVHLYPLYQRLCELGVQLPP------------Q--SIHRFKLFCNDTLKLNVESNPFRNAETVD---PFEKHWLDQLRDTCKLKFDFIVTNPPYMIRKTGFITQPDPAIYDESRLGGAKL------------------------------------------------------------------------------------------SQAYLYFMWIALQRCDDTQGQVCLITPSQWIVLEFAQQLRT-------WIWEHCKLLDIYEF----EPFKVWPKVQTDSLIFRI------CKRTSSLPN-------------SN-HTLYLRHVGKNMTLMHLLDIYRHFRP-DQQLLCTDNAL---------KYKHTPLTEHNRQLKTKH-------SSFSFLLPSVSFLDQLESMTQHLGRICDTDPAAQ----KANTA-APLIW-NRGPNTNPVYSLV-VRTAWARVTFGKETCDRWLKPCFYWNGKTI--SSATGG--GKEGEFWRH-RDPLRLGKKETSAAEAYLPY--------------CGVDVP-FYSMILVNREDADRLKEDFNNK-GPWSALYLYLH--DARVALQADKKEED----------------IANCQYNKCGL-VPVKIIHPINCGYFTRSQPRPRFFIDRQEMAVTNQ-CIYFTIKPDYPW----Q--DPDYYCGLLNSTLIMFFIKLHCSYDQQGRMRFFGRLMAYVPFAPPPSLEFMQQ--VATLVQGVTLARSCLYPFLHYCKGGQR----LLERVRN----FEWHLTSIES-----------------------------------------------GIVRQFE-----PPADWRQGISTNTAELHWIIDFIHT-----LNK-DNAHDIFIALLKLNSLFQLAIDQMIYHLYRIPQALQLEIEHDLKLDNLRQEW-PHVS-LQIPNEEEHKSSTNISVWYQSTLSMAKSFIDLSNE FINAL ------------------------------------HHHHHHHHH--------------------------------------EE--------------------------------------------------------------HHHHHHHHHHHHHHHH-----------------------HH-HHHHHH---H-----HHHHHHHHHHHH----------------------------------------------------------------------------------------------------------------------------EEEEEEEE-HHHHHHHHH-------------------HHH----------------HHHHHHHH--------------------------------------------EEHHHHHHH------HHHHHHHHHHHH--H------HHHHHHHHHHHHHHHHHHHHHHHHHHH--------|--HHHHHHHHHHHH--------HHHH---------------------------------EEEE--------HHHHHHHHHHHHHH--HH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------H--HHHHHHHHHHH-EEE------HHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH---EEEE----E--HHHHHHHHHHHHH---------------------------------------------------------------------------------------------HHHHHHHHHHHHHH------EEEEEEHHHHHHHHHHHHHHH-------HHHHHHHHHHHEE-----------EEEHHHHHHHHH------HHH-----H-------------HH-HHEEEEE------HHHHHHHHH-----HHHHH----HH---------HH---------------H-------HHHHHHHHHHHHHHHHHHHHHH-----------------HHHH-HHHHH-HHHHH---HHHHH-HHHHHHHHH---HHHHHHHHHHHEE-------EEEE------EEEEEE----EEEEEEEE---HHHHHHH--------------HHHHHH-HHHHH------EEEEEEE-------HHHHHHHHH--HHHHHHHH-----E----------------E-HHHHHHHHH-HHHHHHH-----EEEE-----EEEEE---EEEEE-----EEE-----------------EEEEEE-HHHHH-------------EEEHHHHHEE-----------HHH--HHHHHHHHHHH--------------------HHHHHHH----H--------------------------------------------------------HHHHHHH-------HHHHH----HHHHHHH-------------------HHHHHHHHH------HHHHHHHHHH----HHHHHHHHHHHHHHHHHHHH----H-HHHHHHHH------HHHHHHHHHH----------- ALIGN ---------------------------------HHHHHHHHHHH----------HHHHHHHHH--EEE------------------------------------------------------------------------------HHHHHHHHH--HHHHHH-----------------------HH-HHHHH---------HHHHHHHHH--------------------------------------------------------------------------------------------------------------------------------HHHHHHHHHHHHHHHHHH-------------------HHHHH--------------HHHHEHH--------------------------------------------HHHHHHHHHH------HHHHHHHHHHHH----------EEEE--HHHHHHHHHHHHHHHHHHHHH-------|-----HEEHHHHHH---------HHHHHHHH-----------------------------------------HHHHHHHHHHHHH--HH----HH---HHHHHHHHHH---------H--HHHHHHHHHHHHHH--HHHHHHHHHHH-------------------HHHHHHHH------EE----------------HHHHHHHHHH--------EEEE----EEEE----------HHHHHHHH----H------------------------------------------------------------------------------------------HHHHHHHHHHHHHH-------EEEE---HHHHHHHHHHHHH-------HHHHHHHHHHHHHH--------E-------HHHHHH------HH--------------------HH-HHHHHHHH-----HHHHHHHH----------------H---------HH-------------------------EEEE-----HHHHHHHHH--------------------------EEE----------EEEE-EEHHHHHH--------------EEE-------------------EEE-----HHHHHH-----H---------------------------EEEEEEE----HHHHHHH------HHHHHHHHHH--HHHHHHH------E----------------EEEEE-------EEEEEEE---------------EEE----EEEE---EEEEEE-----------------HHHHHHHHHHHHHHHHHHH-----HHHHHHHH-------------HHHH--HHHHHHHHHHHHHHHHHHHHH-----H----HEHHHHH----H-HHH---HH-----------------------------------------------HHHHH-----------------HHHHHHHHHHHHHHH-----------HHHHHHHHHHHH-HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH-------------------------HHHHHH----------- HMM --HHHHHHHHHHHH-HH-HHHHHHHHH---EEEE--H-HHHHHHHHHHHHHHHHH-HEE---HHHHHHHHHHH--HHEEEEEHHHHHHHHHH-------------HHHHHHHHHHHHHHHHH--------HEE---------EEEEEEH--EEEE------E-----------------------------HHHH---HHHHHH------EEEEEEEE------------------------------------------------------------------------------------------------------------------------EEEE--EEEE---HHHHHHH-------------------HHH----------HH----HHHHHHHHHHHHH----------EE--------------------------EEEE-HHHHH------HHHHHHHHHHHHHHH------HHHHHHHH---EEEEEE--HHHHHHHHH-------|--HHHHHHHHHHHHHH-----HHHHHHHHHH----------------------------HHEHH-HH----HHHHHHHHHHHHH----E----EEE--HHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------------H--HHHHHHHHHHH-EEEE-----------HH---HHHHHHHHH------EEEEEEEE---EEEEE-EEEE----HHHHHHHHH---H------------------------------------------------------------------------------------------HHHHHHHHHHHHHH------EEEEE--HHHHHHHHHHHHHH-------HHHHHHHHHHHEE--------EEEEEE--HHHHHHH------HH-----------------------EEEEEEE------HHHHHHHHH-------------------------EEEE----------------------EEEEEE--HHHHHHHHHHHH--HHH------------------EEEEE-E-------EEEEE-HHHHHHHHH--HHHHHHEEEEEEEE---EE--EEE-------EEEEEE----EEEEEE-----HHHHHHH--------------HHH----EEEEEEEHHHHHHHHHH--------HHHHHHHHH--HHHHHHH-----EE----------------EEEE----EEE-EEEEEEEEE---EEEE----EEEEEE---EEEEEE-EEEEEE--------------HHHHHHEHHHHHHHHHHHH--------EEEEHHHHHHH--------H--HHH--HHHHHHHHHHHHHHHHHHHHHHH---H----EEHHHHH----HHEE-----H-----------------------------------------------HHHH----------------H-HHHHHHHHHHHHHH--------H-HHHHHHHHHHHHHH-HHHHHHHHHHHHHH----HHHHHHHH---HHHHHHH--------E----------HHHHHHHHHHHHHHHHHH----- FREQ ----------------------------------------H-H---------------------------------------------------------------------------------------------------------EEEEEEE--HEEHHH-----------------------HH-HHHHHH---------HHHHHHHHHHH-----------------------------------------------------------------------------------------------------------------------------EEEEEEEHEEEEHHHHHH-------------------HEEE---------------HH-HHEE---------------------------------------------EEEEEE---------HHHHHHHHHH-----------HHEHHHHHHHHHHHHHHHHHHHHHHHH--------|---EEEEEEEEEE---------EEE----------------------------------EEEE---------HHHHHHHHHHHHH--HH----HHHHHH------HHHHHHHHHHHHHHHH---HHHHHHHHHHHHH-HHHHHHHHHHHHH------------H--HH-HHHHHHHH-HHE------HHHHHHHH---HHHHHHHH---HHHHHHHHHHH------------HHHHHHHHHHHHHHHH---------------------------------------------------------------------------------------------HHHHHHHHHHHHH------HH-HHHHHHHHHHHHHH---HH-------HHHHHHHHHHHHH------------HHHHHHHHHHH------HHHH---HH-------------HH-HHHHHHHH-----HHHHHHHHH-----HHHHH---HHH---------HHHHHHEE-------HHH-------HHHHHHHHHHHHH---HHHHHHH-HHEE----------HHHHH-HHHHH-HHHHH----HHHH-HHHHHHHH-----HHHHHHHHHHHHH-------HHHHH--HHHHHHH-----HHHHHHHHH---HHHHHH--------------HHHHHH-HHHHH-----EEEEEE---------HHHHHHHHH--HHHHHHHH-----------------------HHHHHHHHHH-HHHHHHH-----EEEE------EEEE--HHHHH-------EEE--------------E-EEEEEEE------------------EEEEEEEEEEE------------H--HHHHHHHHH--------E--------------EEEEEE----E----------------------------------------------------------HEEEE-----HHHHHH----------------------------------EEEE----------------------HHH-HHHHHHHHH-----HH-HHHH-HHHHHEEE------HHHHHHHHH------EEE--- PSSM ---------E-----------------------HHHHHHHHHHHHH----HHH-HHHH--------------------------EE----------------------------------------------------------HHHHHHHHHHHHHHHHHHH-----------------------HH-HHH------------HHHHHHHHHHHHHHH--------------------------------------------------------------------------------------------------------------------------EEEEEHHHHHHHHHHHH-------------------HHH----------------HHHHHHH---------------------------------------------HHHHHHHHH------HHHHHHHHHHHH--H------HHHHHHHHHHHHHHHHHHHHHH-HHHH--------|---HHHHHHHHHHH-----------------------------------------------EEE-------HHHHHHHHHHHHHH--HH----HH------HHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-HHHHH--H-------------------------HEEH----------------------HHHHHHHHHHHHHHHHHHHHHHH--HHHHH--------HHHHHHHHHHH----------------------------------------------------------------------------------------------HHHHHHHHHHHHHH------EEEEEE--HHH--HHHHHHHH-------HHHH----EEEEE-----------E-----HHHEEE------E------------------------EEEEEEE------HHHHHHH---------------------------EEE-----------------------EEE------HHHHHHHHHHHH-------------------------EE-E-------HHHHH-HHHHHHHH---EEEE-------------EE--EEEEE----EEEEEEE-----EEEEEEE---HHHHHHH--------------HHHHHH-HHHHHH-----EEEEEEE-------EEEEEHHHH--HHHHHHH-------------------------HHHHHHHHH-HHHHHH-------EE------EEEEE---EEE----------------------------EEEE---HHHHHH----------HHHHHHHHHH------------HHH--HHHHHHHHHHH--------------------HHHHHHH----H--------------------------------------------------------HHHHHH-----------------HHHHHHH-------------------HHHHHHHHHH------------H--------HHHHHHHHHHHHHHHHHH--------------------HHHHEH-HH------------ MAM1_0525c10839_Mucor_ambiguus_758346042 MFEQSDYSRYQQIKSQTQRWHHDKTLYHSDCVVASIQHEAIRQAKLQHAANDHAYEAFFLRWLFVYICFMQGRFGGLGSGKRKQQVHSGGVA-------------NRMTKTHDLESPTDNHKRRRPNDD-EDA---------ISEEEINAIVAILGNQLLIVT-----------------------KS-IMVEKE---SYTPSAAVTFLLRRMDVVDAQD---------------------------------------------------------------------------------------------------------------------ARSTEIYCLMTAFMTVYLCFV-------------------RVLIQQ-------PE----FDHFMQLFNHHDSLLNQ----DAQP--------------------------FLWFYALHHM------PIVDDLHKQLSTSTK------QCAMANIDSIFSKFYTQYFLEMSAAKHQKDHGQYY|TPRSVLRFMWDRCATV-----SHLIQLLQQQQTG------------------------MCRVFD-PCLGIGSFLCEFLTRFTKAC--RF----TVWSDPQRLTQLLLQDIPDHIFGIEIDPFAYQLCKMNMMVHLYPLYQRLCELGVQLPP------------Q--SIHRFKLFCNDTLKLNVESNPFRNAETVD---PFEKHWLDQLRDTCKLKFDFIVTNPPYMIRKTGFITQPDPAIYDESRLGGAKL------------------------------------------------------------------------------------------SQAYLYFMWIALQRCDDTQGQVCLITPSQWIVLEFAQQLRT-------WIWEHCKLLDIYEF----EPFKVWPKVQTDSLIFRI------CKRTSSLPN-------------SN-HTLYLRHVGKNMTLMHLLDIYRHF---RP-DQQLLCTDNAL------KYKHTPLTEHNRQLKTKH-------SSFSFLLPSVSFLDQLESMTQHLGRICDTDPAAQ----KANTA-APLIW-NRGPNTNPVYSLV-VRTAWARVTFGKETCDRWLKPCFYWNGKTI--SSATGG--GKEGEFWRH-RDPLRLGKKETSAAEAYLPY--------------CGVDVP-FYSMILVNREDADRLKEDFNNK-GPWSALYLYLH--DARVALQADKKEED----------------IANCQYNKCGL-VPVKIIHPINCGYFTRSQPRPRFFIDRQEMAVTNQ-CIYFTIKPDYPW----Q--DPDYYCGLLNSTLIMFFIKLHCSYDQQGRMRFFGRLMAYVPFAPPPSLEFMQQ--VATLVQGVTLARSCLYPFLHYCKGGQR----LLERVRN----FEWHLTSIES-----------------------------------------------GIVRQFE-----PPADWRQGISTNTAELHWIIDFIHT-----LNK-DNAHDIFIALLKLNSLFQLAIDQMIYHLYRIPQALQLEIEHDLKLDNLRQEW-PHVS-LQIPNEEEHKSSTNISVWYQSTLSMAKSFIDLSNE PARPA_06902.1_scaffold_25125_Parasitella_parasitica_758364669 ------------------------------------------------------------------------------------------MT-------------HKLVSAPPIPATVKN-KRQRPN---EKP---------LGEEDIHVIVTMLGNQLLNVK-----------------------KL-IMVEKD---TYTPSPAVTFLLRRMDV-DTQD---------------------------------------------------------------------------------------------------------------------ARSNEIYCLMTAFMTVYLCFI-------------------RLLIRQ-------QD----FDDFMKLFNDHDSLLNQ----ENQP--------------------------FLWFYAVHHM------SVEDDLHKQLAAAGRNVHSYSHLILADIDSIFSKFYTHYFLEISMAKHQKDHGQFY|TPQAVLRFMWDKCADL-----QHLVRQLQQK--S------------------------LCSVFD-PCLGIGSFICEFLTRLIKAC--QF----TVWDDPQQLAHLLLHDIPDHIYGIEIDPFAYQLCKLNMMVHLFPLYRRINELQVRLPP------------K--SIHRLRLFCNDTLKLTVESNPFWNTGSVD---PFEKHWLDQLRDASKLKFDFVVTNPPYMIRKTGFVTQPDPAIYDESKLGGAKI------------------------------------------------------------------------------------------SQAYLYFMWVALQRCDDANGQVCLITPSQWMVLEFAEQLRN-------WIWSNCKLLDIYEF----EPYKVWPKVQTDSLIFRL------CKRSSTMLN-------------SN-HTLYLRHVGRNTNLIQLLDIYQNF---RP-GQPS--VDLSL------KYKLTAFTQENRVLQTNH-------SSFSFLLPSVSFLDQLNSITQHLGRICDSDLSSS----S---K-APLVW-NRGPNTNPVYSLV-VRTRWAIETFGQETCDRWLKPCFYWNGKTI--YSATGG--GKEGAFWKD-RDPCRLDKKETSAAEAYLPY--------------YNANVP-FYSMIMVNKEDAEKLKENHANN-GTWSALYSYLR--DARIALQADKNEED----------------IANCQYSKSGL-VPVKIIHPINCGYFTRSQPRPRFFVDKREMTVTNQ-CIYFTIKPDYPW----Q--DPDYFCGLLNSTLLMFFIKLHCSYDQQGRMRFFGRLMAYIPFAPPPSVDFMRQ--VAGFVQAVTMARSCLYIIIRYSEGGQK----LIERVRN----FEWHLTQEEL-----------------------------------------------AILGHFQ-----PPLDYKDSLS-NVTQLGWVIELFKK-----ANALENTQDAFIVMLKLNSLFQLAIDQMIYHLYKIPESLQLEIEHDLKLDNVRQEW---LA-YRIPTLTN---SIILEEWYSSILSIAKSLV----- HMPREF1544_04357_Mucor_circinelloides_f_circinelloides_1006PhL_511007562 -----------------------------------------------------------------------------------------------------------MTTTPYPDSPIANCKRSRPNDDDEEA---------INQEEINAIVNTLGNQLLIVK-----------------------KS-IMIEKE---SYTPSAAVTFLLRRMDVVDSQD---------------------------------------------------------------------------------------------------------------------ARSTEIYCLMTAFMTVYLCFI-------------------QVLIPQ-------TD----FDRFMQLFNDHDSLLNQ----ETQP--------------------------FLWYYAVHHM------SIVDDLHKQLSTSTK------HLVMANLDSIFSKFYTHYFLEISAAKHQKDHGQYY|TPRSVLRFMWDRCATV-----PHLIQLLQQRQTG------------------------ICRVFD-PCLGIGSFLCEFLTRFIKAC--RF----TIWNDPQRLTELLLQDIPDHIFGIEIDPFAYQLCKMNMMVHLYPLYQRVCELGIQLPP------------R--SIHRLRLFCNDTLKLKVESNPFWNSDNVD---QFEKHWLDQLRDACNLKFDFIVTNPPYMIRKTGFITQPDPAIYDESRLGGAKL------------------------------------------------------------------------------------------SQAYLYFMWIALQRCDDTNGQVCLITPSQWMVLEFAEQLRQNLDLGSLQIARHLRVRTIQSMAQSANRFTYFPPMQTHKSTTKI--------RSHIIPS-------------PY--------------------IYQNF---RP-DQPP--TDSSL------KYKHTPLTDHTTSLKTKH-------SSFSFLLPSVSFLDRLESITQHLGRICDIDPAKL----N---T-APLIW-NRGPNTNPVYSLV-VRTDWAIKTLGQETCDRWLKPCFYWNGKTI--SSTTGG--GKEGEFWKS-RDPVRLSKKETSAAEAYVPY--------------YGVGVP-RYSMILVNKEDADKLKENFNNN-GAWSALYLYLR--DARVALQADKKEED----------------IANCQYNKCGV-VPVKIVHPINCGYFTRSQPRPRFFIDKHQMAVTNQ-CIYFTIKPDCPW----Q--DPDYYCGLLNSTLAMFFIKLHCSYDQQGRMRFFGRLMAHVPFAPPPSTEFMQQ--VATFVQGVTLARSCLYPFLRYCKGGQR----LLERVRN----FEWHLTAMES-----------------------------------------------DIVRQFE-----PPANWTEAISCNTAELEWIIDLIHT-----VNQ-DNALNVFIALLKLNSLFQLAVDQMIYHLYRIPQALQSEIEHDLKLDNLRQEWGPNFS-LHIPSDVD--APTNMAAWIQSTLSMAKSFIDS--- RMCBS344292_07627_Rhizopus_microsporus_729710234 -------------------------------------------------------------------------------------------------------------------MTITNEQTTTTSGK-VND---------LGERETTTVVNILGLSLVNIL-----------------------QD-IRNSQEKETCASSSETAAFLAQR----SLDD---------------------------------------------------------------------------------------------------------------------KDGLYIHCLMTGFMAVYLTFI-------------------ELVLSD-------S-----FHDFIRLFSPHDSLL-------STP--------------------------FVWYYFKHHV------SLLYHLRIQLDHL--------DISITSIDTIISKFYTHYFLETSAQKHQKDHGQYY|TPKPVIQFMWEKVIAS-----RPLLVHC-----G------------------------IPRIFD-PCLGIGSFLCEYIHRLIEQC--RQ----YVWNDAERLAKLLTQDIPESIWGVEIDPFTYHLCKLNMMVHLFPIYQRLNELQLSLPP------------H--SINRLRLFCNDTLTLRCDN------GNQD---AFEKDCLNLLRDPSKLKFHYIVTNPPYMIRKTGFITQPDPTLYDLQTLGG-RG------------------------------------------------------------------------------------------TQAYVYFMWAALQRIDDQLGQVCLITPSQWTILEFAQHFRE-------WILMNCKLLDMYEF----EPYKIWPKVQTDSLIFRI------CKRTSILDH-------------FE-YTLYLRNKARNTPLADLLQQYRDF---NP-LTN---CNPQL------QFRYSFCLKSCKGM-----------ASFAFILPTTSCLDELNRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHRFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRVQSTED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQIAVTNQ-CIYFTIQPSTPW----K--DYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSARFMHE--LALFVQSITFTRTWLYTFIRHTHSGQR----LMERVRS----YEWHLDEADK-----------------------------------------------AALSHYDTFDIRPDADLA-SFS-YWQSIQWIDDFVQR-----KR--GDAFHCFVVLLKIASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIINTAKSLVD---- RMATCC62417_07972_Rhizopus_microsporus_727145438 -------------------------------------------------------------------------------------------------------------------MTITNEQTTTTSGR-VND---------LGERETTTVVNILGLSLVNIL-----------------------QD-IRNSQEKETCASSSETAAFLAQR----SLDD---------------------------------------------------------------------------------------------------------------------KDGLYIHCLMTGFMAVYLTFI-------------------ELVLSD-------S-----FHDFIRLFSPHDSLL-------STP--------------------------FVWYYFKHHV------SLLYHLRIQLDHL--------DISITSIDTIISKFYTHYFLETSAQKHQKDHGQYY|TPKPVIQFMWEKVIAS-----RPLLVHC-----G------------------------IPRIFD-PCLGIGSFLCEYIHRLIEQC--RQ----NVWNDAERLAKLLTQDIPESIWGVEIDPFTYHLCKLNMMVHLFPIYQRLNELQLSLPS------------H--SINRLRLFCNDTLTLRCDN------GNQD---AFEKDCLNLLRDPSKLKFHYIVTNPPYMIRKTGFITQPDPTLYDLQTLGG-RG------------------------------------------------------------------------------------------TQAYVYFMWAALQRIDDQLGQVCLITPSQWTILEFAQHFRE-------WILMNCKLLDMYEF----EPYKIWPKVQTDSLIFRI------CKRTSILDH-------------FE-YTLYLRNKARNTPLADLLQQYRDF---NP-LTN---CNPQL------QFRYSFCLKSCKGM-----------ASFAFILPTTSCLDELNRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHRFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRVQSTED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQIAVTNQ-CIYFTIQPSTPW----Q--DYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSARFMHE--LALFVQSITFTRTWLYTFIRHAHSGQR----LMERVRS----YEWHLDEADK-----------------------------------------------AALSHYDTFDIRPDADLA-SFS-YWQSIQWIDDFVQR-----KR--GDAFHCFVVLLKIASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIINTAKSLVD---- RMCBS344292_12151_Rhizopus_microsporus_729705342 -------------------------------------------------------------------------------------------------------------------MTITNEQTTTTNGK-VND---------LGERETTTVVNILGLSLVNIL-----------------------QD-IRNSQE--TCASSSETATFLAQR----SLDD---------------------------------------------------------------------------------------------------------------------KDRIYIHSLMTGFMAVYLTFI-------------------ELVLSD-------S-----FHDFMRLFSLHDSLL-------STP--------------------------FVWYYYKHHV------SLLYHLRIQLGHL--------DISITSIDTIISKFYTHYFLETSAQKHQKDHGQYY|TPKPVIQFMWEKVIAS-----RPLLVHC-----D------------------------IPRIFD-PCLGIGSFLCEYIHRLIEQC--RH----YVWNDAERLAKLLTQDIPESIWGVEIDPFTYHLCKLNMMVHLFPIYQRLNELQLSLPP------------H--SINRLRLFCNDTLTLRSDD------GNQD---AFEQDCLNLLRDPSKLKFHYIVTNPPYMIRKTGFITQPDPTLYDLQTLGG-KG------------------------------------------------------------------------------------------TQAYVYFMWAALQRIDDQLGQVCLITPSQWTILEFAQHFRE-------WILMNCKLLDMYEF----EPYKIWPKIQTDSLIFRI------CKRKSILDH-------------FE-YTLYLRNKARNTPLADLLQQYRDF---NP-LTN---CNPQL------QFRYSFCLKSCKDM-----------ASFAFILPTTSCLDELNRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHKFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRAQSTED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQIAVTNQ-CIYFTIQPSTPW----Q--DYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSTRFMHE--LALFVQSVTFTRTWLYTFIRHTHSGQR----LMERVRS----YEWRLDEADK-----------------------------------------------AALSHYDTLDIRPDADLA-SFS-CWQSIQWIDDFVQR-----KR--GNAFHCFVILLKIASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIMNTAKSLVD---- LRAMOSA03076_Absidia_idahoensis_var_thermophila_671695680 ----------------------------------------------------------------------------MDQQQCKQHEQPLSLL-------------NRAASPASP-IPSSN-ERKRSY---IHM---------NEVDVVKDVVAMLGDVLERIN-----------------------DAWV--------------------------QQED---------------------------------------------------------------------------------------------------------------------EETRRMRGLVTTYVAFVEAVA-------------------DDLLHN--DNNDRP-----FAAYMAIFDPHDQMA------DDDP--------------------------LLHHDTLFRR------SLAKDIRHRLNQLGV--QS--ESLLKCVDTIFARFYT----NKAPPQQQKDHGQFY|TPQTVVRFMWEQCLAN-----NN------KK--H------------------------VPRVLD-PCMGMGAFLCEFLTRWVMQL--DS----ATWDNAVALEQVLCTDIPAHIWGVELDPVALRLAKLNILVHLLPLYRRLRQLTQQNTL------------T-LRVDRLHLFCSDTLRLTP-V-------GDD---PWEHVELQRLH-SGHLVFDYIVTNPPYMIRKTGRITDPDPALYDSRILGG-RG------------------------------------------------------------------------------------------VQAYVYFMWICLQRCDPHDGELCLITPSQWLVLEFARHLRA-------WIWEHCELLQLFQL----EPYKVWPRVQTDSLIFRL------RMRGTRPPN-------------LNTHTLFLRHTARRATLQDILAAYTTF---NPHQQP---PSSDI------AYKYTPTHDRSRIQNSPN-------ASFAFLSPSTSLTGELAQLTHSLSRLCDGP------------G-APLVF-HRGPNTHPVYALV-VRTQWARDYFGPQCCSRWLRPAFYWSGK-----AAGTN--DPESIFWHL-RDPQRLARKETSPAEAYAPF--------------YAPDA--NYSLLLVDKEGADKLESSAATL-DQDARLYEYLQ--AARVALQPTREERK----------------VTWCHYNQSGADVAVKIVHPINCGYFTKSQPRQRFFVDRHQLCVTNQ-CMYFTLSPETDL-------SAEFFCGILNSSTVQFFLREHCAYDQQRRTRFFGRHLANIPCCSLPVASSFEATLMTDLVHAVTISRLWIYAIVWYT-DAQH----VIEHLRA----GTWDIHPGDM-----------------------------------------------PRVSAVSVQD-LNHSHHTSAWS-NDIRSHWISRVLDS-----RQH-TSLDIILVQLLQLASLFQYGIDQLTFVLYHVPIPLQRALEHELELVQATARW-S--H-VSLND----------------IFDTAQAILH--ND LCOR_02075.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661187564 -----------------------------------MNSEIIEDQQYEPAAKRRKL---------------------PNQQKQPQIQQPLSSS-------------SKQLVPPNP-LGSSYIERKRAY---VKV---------SGGNLVQDTLAILGGVFTRVN-----------------------TA-L--------------------------RQED---------------------------------------------------------------------------------------------------------------------EGTRHSRTLMTTFAVFTEAVT-------------------GYFLHDVSKPDERP-----FATYMAVFSPQDQLT------DGDP--------------------------LLHPDVAFRQ------SLAMDIQYQLERIGV--QR--DKLYDCIDTLFSRFYA----DKTTSVHQKNKGQFF|TPQNVVHFMWGRCLDH-----KN------KN--Q------------------------MPRVLD-PCMGLGAFLCDFLRRWVTQLQRDG----ATWDNAGVLQQALCTDIPANVWGVEVDPVALKLSKLNAMLHVLPLYRRLRQLTGNNTD------------GFLRVNRLHLFCNNTLQLDPST-------GID---AWEQQELQVLR-SGY--FDYVITNPPYVSQKGKCFAVPDPALYDELVLGD-CI------------------------------------------------------------------------------------------KQAYAFFIWFCLQRCDPQEGEVCFITPSHWMTSEHDYNLRI-------WIWENCEFLQLYYF----KSAKVWPRLNTDSMIFRL------RMRGTRPPD-------------LNARSLYVRNMEIGLPLQDILDSYVAF---DP-QQP---QPKHI------KFKLTPTHDPQRIHQSSG-------AKLTFLCLS-PLTDELQEFTKSMTRLCNSR------------G-APLEF-TSGCSPMPRYGLV-VRTKWALENFGTRCYARWFRPAFYWSSK-----GARSK--GCEVDFWRV-RDPERIVKRELPPSEAFVPF--------------YTAEDAKKYSLILVDKDGIAELEVSTDP---EEERLYEYLQ--EARAEMQP-GNKRK----------------AIFSPFFRSGVDEAVKIVHPALNGYQSKYTPRQRFFVDRDQLGVTDQ-CGFFTLARGVDL-------SPEFFCGLLNSSTLLYLLRHYCTYDQEQRMYFYERHLKNVPCCDLPSASSQEAALMNDLVHAVTTARVWINAIVQAS-GAQY----ITTSLRN----CTWDIHPDDF-----------------------------------------------SAIDGLLIQDFLTSPEHTEGWS-DEVKNHWISQSLFS-----RPH-ARLVVVLVQLLMLSSLFQFGIDQLAYVLYRIPGNMQHAMEEEMEHVEFREQW-S--H-VTLDD----------------IFDTAQSILQ---- RirG_033390_Rhizophagus_irregularis_DAOM_197198w_595493243 MSNANDNTQTVAPPPKRLKSSDEVSSQSDD-----WLSVVLNEVEVDPRIDTWN-------------CFFQDSHVSRHKPQNSEITFNTPFI-------------FKSVNPAQN-LVNGAESNSQRRQE-FSSNLSSFSTNFLSEDLVGKVVEVLGILLEEVK-----------------------KD-IWITII---NKDHLIPASFLREFNKL-VHDSHPNNVKTPFIKCVEEFRSFISRV---------------------------------QSSYLS-------------------------------------------------PDIELFKNALNSYCQITSFITVVHMLF------DTVVMDFLCENNEEISIRQ--LS---------VETFMNIFHDDDNFLKRNIPFDDQP-----------------PNASENLEAFTWYYRLVVRSQP---QYQCDLSVRFHSLNI--H---FTTLEPIESILSILYTKHFLNVLAKEQQKDHGQFY|TPREVMQFMWDRVLIGKGN--RTWIEKL----LG--GSFQNSSLYPNQSSSNWGVLPQAPSVLD-PCMGIGSFLCEYISRLILAA--QQ--CPIVWNNSVAISNLL-HSLSVNLWGIEIDAFAYHLCKINITLHILPLYKRYLHLTSLIND------------L--KLSRLHLFCNDTLNLYLPK-------REH---TWEYENLWLLRSPQRLKFDFIVTNPPYMIRKTGFISEPDSELFDERVLGK-GG------------------------------------------------------------------------------------------MQAYAYFFWFCVERCREEIGEVCLISASQWMGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCGIDVQTKIIHPINYGYFSKNQPRQRFFIDEDNVCVTNQ-CIYFTVKTTTRL-------PPYFFLGILNSTTIQHFLSHHCKYDQQGRMRLFRENMAKIPYAAPAHSDGVEW--FIRCVQRMILARQMLYEGIRICGDEDK----VASTIVEKLRRGAWQLTGREWQEKESNAYNDETSVILREEQEGNWVMVKQCRGRGDGVEYSLCPGSGGEQVARHETEETTEQTTTFHNFS-SAADAAVLKDHRAPSGNNISKT-HIMKPFFESLLYASACLQYAIDQYTYTLYGINAKFQMALEEELKLELFEAIL---NKYPRLNGTAG---NEDGEEKKGKVPEWGERLFE---- RirG_033390_Rhizophagus_irregularis_DAOM_197198w_595493244 MSNANDNTQTVAPPPKRLKSSDEVSSQSDD-----WLSVVLNEVEVDPRIDTWN-------------CFFQDSHVSRHKPQNSEITFNTPFI-------------FKSVNPAQN-LVNGAESNSQRRQE-FSSNLSSFSTNFLSEDLVGKVVEVLGILLEEVK-----------------------KD-IWITII---NKDHLIPASFLREFNKL-VHDSHPNNVKTPFIKCVEEFRSFISRV---------------------------------QSSYLS-------------------------------------------------PDIELFKNALNSYCQITSFITVVHMLF------DTVVMDFLCENNEEISIRQ--LS---------VETFMNIFHDDDNFLKRNIPFDDQP-----------------PNASENLEAFTWYYRLVVRSQP---QYQCDLSVRFHSLNI--H---FTTLEPIESILSILYTKHFLNVLAKEQQKDHGQFY|TPREVMQFMWDRVLIGKGN--RTWIEKL----LG--GSFQNSSLYPNQSSSNWGVLPQAPSVLD-PCMGIGSFLCEYISRLILAA--QQ--CPIVWNNSVAISNLL-HSLSVNLWGIEIDAFAYHLCKINITLHILPLYKRYLHLTSLIND------------L--KLSRLHLFCNDTLNLYLPK-------REH---TWEYENLWLLRSPQRLKFDFIVTNPPYMIRKTGFISEPDSELFDERVLGK-GG------------------------------------------------------------------------------------------MQAYAYFFWFCVERCREEIGEVCLISASQWMGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCGIDVQTKIIHPINYGYFSKNQPRQRFFIDEDNVCVTNQVC----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RirG_033390_Rhizophagus_irregularis_DAOM_197198w_595493245 MSNANDNTQTVAPPPKRLKSSDEVSSQSDD-----WLSVVLNEVEVDPRIDTWN-------------CFFQDSHVSRHKPQNSEITFNTPFI-------------FKSVNPAQN-LVNGAESNSQRRQE-FSSNLSSFSTNFLSEDLVGKVVEVLGILLEEVK-----------------------KD-IWITII---NKDHLIPASFLREFNKL-VHDSHPNNVKTPFIKCVEEFRSFISRV---------------------------------QSSYLS-------------------------------------------------PDIELFKNALNSYCQITSFITVVHMLF------DTVVMDFLCENNEEISIRQ--LS---------VETFMNIFHDDDNFLKRNIPFDDQP-----------------PNASENLEAFTWYYRLVVRSQP---QYQCDLSVRFHSLNI--H---FTTLEPIESILSILYTKHFLNVLAKEQQKDHGQFY|TPREVMQFMWDRVLIGKGN--RTWIEKL----LG--GSFQNSSLYPNQSSSNWGVLPQAPSVLD-PCMGIGSFLCEYISRLILAA--QQ--CPIVWNNSVAISNLL-HSLSVNLWGIEIDAFAYHLCKINITLHILPLYKRYLHLTSLIND------------L--KLSRLHLFCNDTLNLYLPK-------REH---TWEYENLWLLRSPQRLKFDFIVTNPPYMIRKTGFISEPDSELFDERVLGK-GG------------------------------------------------------------------------------------------MQAYAYFFWFCVERCREEIGEVCLISASQWMGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCG-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- MVEG_04886_Mortierella_verticillata_NRRL_6337_672825191 TNSPSDPMLRPSSSPSSFPTSSSASIPMPH-----FRGQLLHRA---PDSAYQR-------------RYFGASTSGSGSSPNPLSSFSPSTVPVPIHSDPLSKMPRKTSTSKRP-QPNDTIGNTSTSGP-FSSGRGSIANQ--DATLTSYLHTQLGRLLLSTKAHVRRLLESFLHQKAASTVHPRLES-VLVELF---N-GMLMSHNALATREHI-SSQEQPMTLRPVLIQSLDHLAALMAHAPSVSSSSTPATLATATPGATRKKAPPHPPSRLSQSIYFSDTDRDTDSDLDLEGVAQGTDATRREATEGTTGRSSGPPRPESELLAIEIETVEMYTSTIEFLSLMTAFVSVVCWLMQARLKRDTSQPTTLGVDIEDVDMEN--RQPTDKDCFDRLHSFLLLFDDHDNVLK---PIQDALHSEEQGGHSTTHPRGAMDDPTRQVGLFAWHFSLFSDDEDGPLQQEVNLIDRQALDSI--H---F------DVILNDLYSTHVLAMTAKEHQKDHGQFY|TPSNVVDFMWRRAIVGRENLLERFVANL----GGAKGQGVQASMAPVESEASL-----VPTALD-PCLGVSTFLSCYVRLLIQKA--RQDHTETIWNSPIASRLLL-AQICENIWGIELDGFAFWMARCGILASLIPLVERVQKLQHQQQQGLQAYQAGRGETT--KLTRLHLFRNDTLQLTVPD-------GVHPDKSWERACILQLRDPQLLRFDFIVTNPPYMIRKTGTFSAPDPEVYDWSILET-GGSPTIITSNVSPSETGSRSKPRRGSISPNPPINEDVVSAAEEEDELEGSDSEATTPDSRSGSPRSSRVKASSSSWPTSSASASMRLGAKGMMQAYGYFIWFAAQRIKPYAGVSCMITASQWLTLEFATKLRA-------WLFENCLMDEFFQF----EPFKVFAKVQTDSLIFKIRSMEPGRTRQDSSIEPSIPLYDRLLEIGAH-RTVFLRHTDHHRPLDGILQDYMDFFAISP-QEQS--SSVNIMVSNKTREELSAVIAAAPQPSSSSTTVTAPTYSFAPMMPSSLLSTFLLSLTQDLPGICSAGTKRVNRL-S---AVEPLLW-HRGPNTNPVYGLV-VRMEYAEVMFGEVMKARWFRPAFYWNGKNSPEVGMMTKALHKEGQFWQG-RDRLRLSKKEGSPAESY---LVPTPG-----------SHR-LYGLCMVDKESVKVLREQMAQGVQGAAALWQYLT--DVRNHFQPGLASKKRKVFLSGKQQMTDDEGVAYCSTNQCGSDVPEKLVHPINYGYFSKTQPRQRFFLDTSSLAVTNQ-CIYLTLNKLSHHYDAAQSPPLIYFLTLLNSSTLQFFVLHHCQYDQQGRMRLFRESMAKIPFQDRDVKSSP---------QRIQYAAQL---GQLMIDLKGT----LYKVVME------WHLTGSSSRTDLGAPRLSEPFIGSVGGNQGLLDWIRRGGDPPTGV-LPKTRDQIWRMLQGHASAPTTRSPSAPSSIA-QLSTSA--PPALPALGAHFHRA-ESLSTQADIDTDTNTGTDTDTDTDDNFESGRRSRFDQEEDFEKPRREYQHPL---QE-PRASGFSP---QQYNQQHSSWLKSSNDPTPS---- RirG_248540_Rhizophagus_irregularis_DAOM_197198w_595439684 --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MELYSPNI--H---FAILELIESKLLILYAKHFVNVLLKE-QKNHGQFY|TLRKVMRFMWDQIGIK-GN--RKRIEIF-LR-----GTF--------QTSSNWEVFPQALNVLDNSCMKMRSFLCEFVSRLTPAA--HI-------------------EIKDLYWYQKAAEQGFDNAKFN----------------------------------------LGLWYNNEIFIEKDE------AGRF---YWCQKAAEDGDKTAQFNLGI------YYYYGVGIEKDEAKSFYWFQKAAE-SG------------------------------------------------------------------------------------------FKGAQFNLGICYQYGDCIEKDK--IKAFYWYHKAVKNGLKQ----------AQYNLGIYYEN-------GVGIDKNEDKAFYWY---Q--KAAENGVKE-------------AQ-FNLGLCYENGNGIGKDEVKAFYWY-H-RA-------AENGL--KEA-QYNLGTCYKNGDGIEKD--------DVKAFYWYQKAVENGLKEAQLNLEN-CRFNGIGIEK--D---E-VNIFYAHHKPEERSLNAIKWIENALKYEKVKFIPYKEIKNTQPLCKGRFG----------HISKVIWTK-INNYVICKKLINTIDNKNNL---------L--DAFIHELK-INLHLNYSNRIIRCLGISFDQKTSEYLLIMEYANGGDLQSYLKNNFN------Y------------LTWNDKKKLAFQIADGLNYLHNENVIHRDLHSRNIVIHENTAKITDF-GISKNQNDQISI-------AYIDNFGVVAYMEPKCLIDPNFPYTKSSDIYSFGVLMWEISSGYPPFKDNDNI-----VALAISINTDIIYYSMPLF------A--LYL------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------ GLOINDRAFT_201382_Rhizophagus_irregularis_DAOM_181602_552925964 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MGLEFADKLRA-------WMWQKCHLVEFFQF----EPFKVWRKIQTDSLIFRL------RRRSEPILSTSVPPIQPVLT--ES-SILFLRYMNRKATLHETLQAYSNF---DP-NNAQ--YEKDM------QYKLSLPYPMTQLPSSTN------SYSFTFLMPSSAVSAYLHSITAHLPSLCDHASMKHTWV-E---N-NPLIW-HRGPNTNPVYALV-VRTTWAYSKFGPEVCRRWLRPVFYWNGKNG----------GKEAEFWQKMGDELRLEKKESSPAEAYVPFIVNNSGGTTIAKDGLEQDRS-MYSLIMVDRDAVDKVRREFGEN----SEFWKYLK--EARKYLQTGFTSRE----------------VVYCGTSKCG-------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- RMATCC62417_18770_Rhizopus_microsporus_727130617 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------M-----------ASFAFILPTTSCLDELSRLTGHLPRLCDGEGKRD----G---Q-APLVW-HRGPNTNPVYALV-VRTAWARHRFGDKVCQQWLKPCFYWNGKSG--YASTGG--GKEGEFWKT-RDPLRLIKKEASAAEAYWPY------------RSYDEKES-FYSLIMINREDADYLRAQSAED-ESYAVLYNYLH--EARLSLQANKNDKD----------------IVYCQYSKCGTEYPVKIVHPINCGYYSRTQPRQRFFIDTTQVAVTNQ-CIYFTIQPSAPW----Q--NYDYFCGILNCTLLQYFTKVYCSYDQQGRMRFFGRSMATIPFAPPPSARFMHE--LALFVQSVTFTRTWLYTFIRHAHSGQR----LMERVRS----YEWHLDEADK-----------------------------------------------AALNHYDTLDIRPDADLA-SFS-CWQSIQWIDDFVQR-----KR--GDASHCFVVLLKMASLFQFAIDQMAYYAYGIPLHLQLEVEKELQLISQRREW----T-MQIRELEY-----NEENWSSLIINTAKSLVD---- GLOINDRAFT_201358_Rhizophagus_irregularis_DAOM_181602_552925963 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|-----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRLFRENMAKIPYAAPAHSDGVEW--FIRCVQRMILARQMLYEGIRICGDEDKVASTIVEKLRR----GAWQLTGREWQEKESNAYNDETSVILREEQEGNWVMVKQCRGRGDGVEYSLCPGSGGEQVARHETEETTEQTTTFHNFS-SAADAAVLKDHRAPSGNNISKT-HIMKPFFESLLYASACLQYAIDQYTYTLYGINAKFQMALEEELKLELFEAIL-N-KY-PRLNGTAG---NEDGEEKKGKVPEWGERLFE---- RO3G_16192_Rhizopus_delemar_RA_99-880_384500990 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------|---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MIRKTGFITQPDPTLYDLQTLGG-RG------------------------------------------------------------------------------------------TQAYIYFMWVALQRIDDQQGQLCLITPSQWTVLEFAQHLRE-------WILANCKLLDMYEF----EPYKVWPKVQTDSLIFRI------CKRTSVLPN-------------TD-YTLYLRNKARNTSLTDILQQYNVF---NP-AES---HDPEL------QYRYGFCTRNYKDL-----------TSFAFILPTTSCLDELNRITGHLPRLCDGEGKKNSWYTD---Q-IPLVW-HRGPNTNPVYALV-VRTSWAKQTFGDKICQLWLKPCFYWNGKSG--SAAKGG--GKEGEFWKS-RDPLRLCKKETSAAEAYWPY------------RLLDSQDS-FYSIIMVNREDADFLKSQVEHD-SSYKAFYSYLR--EARLALQANQNDKD----------------IVYCQYSKSGTDHPVKIVHPINCGYYSRTQPRQRFFVDTTQIAGSLQ-L----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- consensus/100% .......................................................................................................................................................................................................................................................................................................................................................................................................................................................................................bb..................-sbh..hYs...........QKs+GQaa|...........................................................................................................................................................................................................................................................................................................................................................................................h+............ph.h..hb..........h....ppcp...bh..........p...p....................................a..a..................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................... consensus/95% .......................................................................................................................................................................................................................................................................................................................................................................................................................................................................................bb..................-sbh..hYs...........QKs+GQaa|...........................................................................................................................................................................................................................................................................................................................................................................................h+............ph.h..hb..........h....ppcp...bh..........p...p....................................a..a..................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................................... consensus/90% ....................................................................................................................s.....ppp......p.............p........LG..h...........................p..l............................pp............................................................................................................................sbhTsahsh...h.......................h.p.............h..ah.lFp.pDph........s............................h.a.................pl..bh..............h..l-olhsbhYs....p....bpQKsHGQaY|.................................................................................................................................................................................................................................h.b.s......s..ha....h................................................................................................buh.a.h.hshbb.c......ChIps..W..bEaspphR........bhh.phph.phabh....p.hpla.chpTDphha+l......p.R.p...p..............p...l.lp...........l..Y..F..................................................bs.h..p..h.s.L.php.ph...Cp.................sl.a.pp.sp..s..ul..lc..h.b..h......bhbps..hhpu+..............p..hWp...s...l.++b.ss.-sb............................h.hspc.h..l..p..........hh.Yhp...hb..hbs......................h..s...b.u.............................................................................................................................................................................................................................................................................................................................................................. consensus/85% ....................................................................................................................s.....ppp......p.............p........LG..h...........................p..l............................pp............................................................................................................................sbhTsahsh...h.......................h.p.............h..ah.lFp.pDph........s............................h.a.................pl..bh..............h..l-olhsbhYs....p....bpQKsHGQaY|T.p.VhpFMW.bh.............................................h.phhD.sCh.h.sFlspalpbh...h..p....................pl...ha..chs..sh..s+hs........................................h.La.sspl.l..................ap...h...c.s....h.h......YhhbKsG.hs.PDs.laDbp.Lu...............................................................................................QAY.aFhWhshbRhc...G..ChIosSbW..LEFApphR........Whh.pCcl.phabh....csaKla.+lQTDShIF+l......p.Rpp...p..............p...LaLR..s.p.sL.p.Lp.Y.sF...ps...........h.........bp.s...................shsFhh.o.sh.sbL.phT.pLsplCs.................PLha.p+GPsspPlYuLV.VRs.aAb..hG.b.h.bWh+PsFYWsGK..............Es.FWp...D..Rl.KKE.ssuEua.sh..............h..p...bYuhlhls+-shc.lc.p.sp.......ha.YLp..-hR..hQssbppcc................ls.s..pbsG.............................................................................................................................................................................................................................................................................................................................................................. consensus/80% ....................................................................................................................s.ss.pppp......ps..........s.p....ll..LG..L..l........................ps.l............................ps......................................................................................................................ps....sbhTsFhshh..hh...................ph.lpp.............h.pah.lFp.cDphh.......spP..........................h.aa............ph..clp.bh.p............h.sl-oIhubhYopahlp..s.cpQKDHGQaY|TPp.VhpFMWcbhh.......p....................................hsplhD.PChGhuuFLC-alpRh...h..p.......Wsss.....hL..pls.plaGlElDshshphsKhNh.hplhPlhbRh.pL.....................plpRL+LFhNsTLpL...p.........c....aEb..l..L+.s..h.FcallTNPPYMIRKTGhhopPDs.laDbp.LG...............................................................................................QAYhYFhWhslbRhc...GblClIosSQW..LEFAppLR........WhhbpCcLlphabF....EPaKlW.+lQTDSLIFRl......pbRsp.h.p..............p...LaLR..s+p.sL.c.Lp.YpsF...pP..p.....p..h........bac.s......p............SFsFlhPosuh.sbLpphT.pLsplCc................sPLla.pRGPNTpPVYuLV.VRT.WAbppFG.bshpbWh+PsFYWsGK............sbEu.FWp...D.hRL.KKEsSsAEAYhPa..............h..p...bYShlhls+-shcblc.p.sp.......hapYLp..-AR..LQssbppcc................ls.C..s+sG......l.a....shbp+...p.phhlcppph..o.b......................................................................................................................................................................................................................................................................................................................... consensus/75% ....................................................................................................................s.ss.pppp.p....ps..........s.cb...lV.hLG..L.pl........................ps.lb.pb....s.s...s.shL.pb....s.ps......................................................................................................................ps....sbhTuFhsVh..hh...................cl.lpp.............hcsFh.lFpscDshL.......spP..........................F.Waa...........ph..cLp.bh.p.s..........h.sI-oIhSbhYTpaFLp..s.cpQKDHGQaY|TPp.VhpFMWcbhh.......p....................................hsplhD.PChGhuuFLC-alpRh...h..p.......Wsss.....hL..pls.plaGlElDshshphsKhNh.hplhPlhbRh.pL.....................plpRL+LFhNsTLpL...p.........c....aEb..l..L+.s..h.FcallTNPPYMIRKTGhhopPDs.laDbp.LG...............................................................................................QAYhYFhWhslbRhc...GblClIosSQW..LEFAppLR........WhhbpCcLlphabF....EPaKlW.+lQTDSLIFRl......pbRsp.h.p..............p...LaLR..s+p.sL.c.Lp.YpsF...pP..p.....p.ph........ba+ho.....sp............SFsFlhPosuh.sbLpplT.cLsplCD..s.............sPLlW.pRGPNTNPVYuLV.VRT.WAbppFG.csCpbWL+PsFYWNGKs...........GKEubFWpp..D.hRL.KKEsSsAEAYhPa..............h..p.s.hYShIhls+-sh-bl+.p.spp....u.hapYLp..-AR..LQssbspcc................lsaC..sKsG....sKllHPhN.GYho+sbPR.RFFlDppphslTsQ.s................................................b.a...h..ls....s..p.............h..s..h.................l.............................................................................................................................................................................................................. consensus/70% ....................................................................................................................s.ss.pppp.p....ps..........s.cb...lV.hLG..L.pl........................ps.lb.pb....s.s...s.shL.pb....s.ps......................................................................................................................ps....sbhTuFhsVh..hh...................cl.lpp.............hcsFh.lFpscDshL.......spP..........................F.Waa...........ph..cLp.bh.p.s..........h.sI-oIhSbhYTpaFLp..s.cpQKDHGQaY|TPp.VhpFMW-+hh.......p.bl..h..............................hsplhD.PChGlGoFLCEalpRhlb.h..p.......Wsss..l.plL.psls.plaGlElDshsapLsKhNh.lHlhPlYbRh.pL....s................plpRL+LFCNDTLpL...p.........c...saEbp.L.bLRps.bLpFcalVTNPPYMIRKTGhlopPDs.lYDbp.LG...............................................................................................QAYhYFhWhslQRhc.p.GblCLIosSQWhsLEFAppLR........WhhbpC+Ll-habF....EPaKVW.KlQTDSLIFRl......pbRsp.l.p..............p...LaLR..s+pssL.-.Lp.YpsF...pP..ps....p.ph........ba+ho.s...sp............SFsFlhPosuh.sbLpplT.HLsplCD..s.p.........p.sPLlW.HRGPNTNPVYuLV.VRTsWAbppFG.csCpbWL+PsFYWNGKs...........GKEu-FWpp.bD.lRL.KKEsSsAEAYhPa..............h..c.s.hYSlIhls+-shDbL+.p.sps....u.hapYL+..-AR..LQssbsp+-................lsaC..sKsG..hssKIlHPINhGYao+sQPR.RFFlDppphsVTsQ.C..bs..............s....hslls....b.h....h.Ysbp.chbhF.c.hh.lPhss.s..p..p......hsp.h.hsp.hl...h............lhp.l........Wplp..p..................................................l............p...shu....p.........................h......ss..p.s.Dp...........hp...-.-b.....p..h.......ph....................p.sp........Back to Contents
GI/Gene label Domain architecture Pfam architecture Gene name Len Taxonomy Species name Genbank/other annotation Uram1000007539 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I Uram1000007539 1090 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.52_#_84_#_combest_scaffold_52_106915 Uram1000001377 Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1 Uram1000001377 655 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.5_#_213_#_combest_scaffold_5_101108 Bcir1000010321 Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1 Bcir1000010321 690 eukaryota>fungi>mucoromycotina Backusella circina e_gw1.291.36.1 Bnat1000001029 Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1 Bnat1000001029 678 eukaryota>rhizaria Bigelowiella natans estExt_fgenesh1_pg.C_320094 Bnat1000018648 Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1 Bnat1000018648 394 eukaryota>rhizaria Bigelowiella natans e_gw1.71.34.1 Bcir1000008046 Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1 Bcir1000008046 633 eukaryota>fungi>mucoromycotina Backusella circina estExt_Genewise1.C_420006 384490648 Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1 RO3G_06575 484 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_06575 [Rhizopus delemar RA 99-880]. 595439684 N6-MTase[]+N6-MTase[Sel1+Sel1+Sel1+Sel1+Sel1+Kinase+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Pkinase RirG_248540 637 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 197198w Mkk2p [Rhizophagus irregularis DAOM 197198w]. 384491727 Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+N6-MTase+TaqIC/RAGNYA/Methylase_S Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1+Sel1 RO3G_07628 644 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_07628 [Rhizopus delemar RA 99-880]. 595493245 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I RirG_033390 886 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 197198w hypothetical protein RirG_033390 [Rhizophagus irregularis DAOM 197198w]. 727145438 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I RMATCC62417_07972 936 eukaryota>fungi Rhizopus microsporus hypothetical protein RMATCC62417_07972 [Rhizopus microsporus]. 729705342 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I RMCBS344292_12151 934 eukaryota>fungi Rhizopus microsporus hypothetical protein RMCBS344292_12151 [Rhizopus microsporus]. 595493244 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I RirG_033390 925 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 197198w hypothetical protein RirG_033390 [Rhizophagus irregularis DAOM 197198w]. 758364669 N6_Mtase+TaqIC/RAGNYA/Methylase_S .N6_Mtase+Eco57I 511007562 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I HMPREF1544_04357 962 eukaryota>fungi Mucor circinelloides f. circinelloides 1006PhL hypothetical protein HMPREF1544_04357 [Mucor circinelloides f. circinelloides 1006PhL]. 758346042 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I MAM1_0525c10839 1074 eukaryota>fungi Mucor ambiguus hypothetical protein MAM1_0525c10839 [Mucor ambiguus]. Ccor1000001613 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I Ccor1000001613 1177 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus gm1.1775_g 595493243 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I RirG_033390 1238 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 197198w hypothetical protein RirG_033390 [Rhizophagus irregularis DAOM 197198w]. 729710234 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I RMCBS344292_07627 936 eukaryota>fungi Rhizopus microsporus hypothetical protein RMCBS344292_07627 [Rhizopus microsporus]. 384500990 N6-MTase-frag - RO3G_16192 383 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_16192 [Rhizopus delemar RA 99-880]. Pbla1000013531 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase Pbla1000013531 1204 eukaryota>fungi>basal Phycomyces blakesleeanus estExt_fgeneshPB_pg.C_140177 661187564 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase LCOR_02075.1 946 eukaryota>fungi Lichtheimia corymbifera JMRC:FSU:9682 hypothetical protein RO3G_16192 [Lichtheimia corymbifera JMRC:FSU:9682]. 672825191 N6-MTase+TaqIC/RAGNYA/Methylase_S - MVEG_04886 2159 eukaryota>fungi Mortierella verticillata NRRL 6337 hypothetical protein MVEG_04886 [Mortierella verticillata NRRL 6337]. 552925964 N6-MTase+TaqIC/RAGNYA/Methylase_S(frag) - GLOINDRAFT_201382 310 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 181602 hypothetical protein GLOINDRAFT_201382 [Rhizophagus irregularis DAOM 181602]. Mver1000004892 N6-MTase - Mver1000004892 2160 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (2160 aa) Lhya1000010458 N6-MTase - Lhya1000010458 392 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora estExt_fgenesh1_pm.C_2000009 727130617 N6-MTase(frag)+TaqIC/RAGNYA/Methylase_S - RMATCC62417_18770 448 eukaryota>fungi Rhizopus microsporus hypothetical protein RMATCC62417_18770 [Rhizopus microsporus]. Bcir1000016958 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase Bcir1000016958 329 eukaryota>fungi>mucoromycotina Backusella circina MIX930_10_37 552925963 TaqIC - GLOINDRAFT_201358 274 eukaryota>fungi>glomeromycota Rhizophagus irregularis DAOM 181602 hypothetical protein GLOINDRAFT_201358 [Rhizophagus irregularis DAOM 181602]. Bcir1000008318 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I Bcir1000008318 992 eukaryota>fungi>mucoromycotina Backusella circina estExt_fgenesh1_pg.C_1200041 671695680 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+UPF0020+Eco57I LRAMOSA03076 938 eukaryota>fungi Absidia idahoensis var. thermophila hypothetical protein LRAMOSA03076 [Absidia idahoensis var. thermophila]. ---- Prokaryotic homologs---- # 1; 658523148 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+TaqI_C - 866 bacteria Atribacteria bacterium SCGC AAA255-G05 hypothetical protein, partial [Atribacteria bacterium SCGC AAA255-G05]. 489091935 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Methyltransf_26+Eco57I+TaqI_C - 1254 bacteria>spirochaetes Leptospira weilii N-6 DNA methylase [Leptospira weilii]. 740186027 REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N_2+N6_Mtase+Eco57I - 1215 bacteria>deinococci Thermus sp. NMX2.A1 hypothetical protein [Thermus sp. NMX2.A1]. 495592567 REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N_2+N6_Mtase+Eco57I - 1166 archaea>euryarchaeota Haloferax mucosum type i restriction-modification system methyltransferase subunit [Haloferax mucosum]. 489139504 REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N_2+N6_Mtase+Eco57I - 1214 bacteria>deinococci Thermus aquaticus N-6 DNA methylase [Thermus aquaticus]. 495849928 REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N_2+N6_Mtase+Eco57I+TaqI_C - 1167 archaea>euryarchaeota Haloferax MULTISPECIES: type i restriction-modification system methyltransferase subunit [Haloferax]. 647643842 N6_Mtase N6_Mtase+Eco57I - 1198 bacteria>proteobacteria>betaproteobacteria Herminiimonas sp. CN hypothetical protein [Herminiimonas sp. CN]. 652390117 N6_Mtase N6_Mtase+Eco57I - 1284 bacteria>cyanobacteria Planktothrix rubescens hypothetical protein [Planktothrix rubescens]. 491099787 REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N_2+N6_Mtase+Eco57I+TaqI_C+DUF4337 - 1164 archaea>euryarchaeota Haloarcula sinaiiensis type i restriction-modification system methyltransferase subunit [Haloarcula sinaiiensis]. 754792650 N6_Mtase N6_Mtase+Eco57I - 1284 bacteria>cyanobacteria Planktothrix agardhii hypothetical protein [Planktothrix agardhii]. 654626038 N6_Mtase N6_Mtase+Eco57I - 833 bacteria>cyanobacteria Dolichospermum circinale hypothetical protein [Dolichospermum circinale]. 818764894 HSDR_N+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N+N6_Mtase+Eco57I+TaqI_C UX10_C0029G0010 906 bacteria Parcubacteria (Magasanikbacteria) bacterium GW2011_GWA2_45_39 hypothetical protein UX10_C0029G0010 [Parcubacteria (Magasanikbacteria) bacterium GW2011_GWA2_45_39]. 808798668 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I+TaqI_C - 1275 bacteria>cyanobacteria Limnoraphis robusta hypothetical protein [Limnoraphis robusta]. 493714293 REase+N6_Mtase++TaqIC/RAGNYA/Methylase_S HSDR_N_2+N6_Mtase+Methyltransf_26+Eco57I+TaqI_C - 1171 archaea>euryarchaeota Natrialba aegyptia type i restriction-modification system methyltransferase subunit [Natrialba aegyptia]. 493035869 N6_Mtase+Eco57I N6_Mtase+Eco57I - 664 bacteria>cyanobacteria Coleofasciculus chthonoplastes N-6 DNA methylase [Coleofasciculus chthonoplastes]. 755639426 N6_Mtase N6_Mtase - 765 bacteria>actinobacteria Leucobacter komagatae hypothetical protein [Leucobacter komagatae]. 568633968 REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N_2+APG6+N6_Mtase+Methyltransf_26+Eco57I+TaqI_C BN903_17 1254 archaea>euryarchaeota Halorubrum sp. AJ67 uncharacterized protein domain protein [Halorubrum sp. AJ67]. 851124860 REase+N6_Mtase+TaqIC/RAGNYA/Methylase_S HSDR_N_2+APG6+N6_Mtase+Methyltransf_26+Eco57I+TaqI_C - 1209 archaea>euryarchaeota Halorubrum sp. AJ67 hypothetical protein [Halorubrum sp. AJ67]. 428252835 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I+TaqI_C Mic7113_3025 1277 bacteria>cyanobacteria Microcoleus sp. PCC 7113 type I restriction-modification system methyltransferase subunit [Microcoleus sp. PCC 7113]. 754157697 N6_Mtase+TaqIC/RAGNYA/Methylase_S N6_Mtase+Eco57I+TaqI_C - 1280 bacteria>cyanobacteria Microcoleus sp. PCC 7113 hypothetical protein [Microcoleus sp. PCC 7113].Back to Contents
Str-1 Str-2 Str-3 Str-4 Str-5 Str-6 Str-7 Conserved residues G K D D R <------------ HTH------------------------------------D------------------> Y +S Y D FINAL -------------HHHHHHHHHHHH-----------------E---EEEEEE---HHHHHHHHH-----EEEEE----HHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHH---------EEEEEHHHHHHH--HH--H---HHHHHHHH-H--HHHH-------------------EEEEEHHHHHHH----HH-----------EEEEE----------HH--HH-----HHHHHHHHHHHHH--------EEEEE----HHHHHHHHHHH---------------------EEEEEEEE-EE-----E------EEEEEEEE------------- ALIGN -------------HHHHHHHHHEEE-E-------------------EEEEE----HHHHHHHH------EEEE-----HHHHHHH---HHHHHHHHHHHH-H-------------------HHEEEEEEH-----------EEEHHHHHHH---HH--H---HHHHHHHH-H--HHHHHH-----------------HEEEEHHHHHHH-----------------EEEEE-----------H--HH-----HHHHHHHHHHHHH--------EEEEE---HHHHHHHHHHHH---------HHH---------HHHHEHHH-H------------HHHHHEHHH------------- HMM --------HHHHHHHHHHHHHHHHH-H---------------E---EEEEEE---HHHHHHHHHH----EEEEE--HHHHHHHHHHHHHHHHHHHHHHHE-E----------------HHHHHHHHHHHHH-------EEEEEEEHHHHHHHHHHH--H---HHHHHHHH-H--H-HEEE----HHHHHH---HHHHEEEEHHHHHHHH----HHHH---------EEEEEE--EEEEE--HH--HH-----EE-HHHHHHHEHHH-------EEEEE----HHHHHHHHHHH----------EE---------EEEEEEEE-EE-----EEE----EEEEEEEE---E--------- FREQ -------------HHHHHHHHHHEE-----------------E---EEEEEE----HHHHHHHH-----EEEE-----HHHHHHH---HHHHHHHHHHHH--------------------HHHHHHHHHHH-----------EEEHHHHHH---HH--H---HHHHHHHH-H----E--------------------EEEEEH-HHH-------------------EEEE-----------HH--HH-----HHHHHHHHHHHHH--------EEEEE----EEEEEHHHHH-------------------------EEEEE-E------------HHH-HEEE-------------- PSSM -------------HHHHHHHHHHHH---------------------EEEEE----HHHHHHHHH----EEEEE------HHHHH--HHHHHHHHHHHHHH------------------HHHHHHHHHHHHH------------HHHHHHH-------------------------------------HHH---H---HEEEHHHHHHHH----HHH----------EEEEE------------------------HHHHHHHHHH--------EEEEE-----HHHHHHHHHH---------------------EEEEEEEE-EE------------EEEEEEE-------------- TVAG_007390_Trichomonas_vaginalis_G3_123421258 FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLYISHLLHIMFPKATIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KPDEKI-PTDQKEIVREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLDTLMK-K--VLYLRYTNLFDENIDA---YLEGLTIVHKDWRILF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKDTVETFDVM-Y-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE TVAG_056220_Trichomonas_vaginalis_G3_123471301 FT-K-PPLPFFGNKSRCKDILLREL-KK----L------PNGL---TFVDLFGGSFYISHLCHTVFPDSKIICNDFDNYMNRLKHIPDTNKILKELKEKI-P-----I--GKMERI-PLDKKNTVREVLKK----A-EYIDWDSISSRLLYSGAIR--V---HDIETLMS-K--VLYLNYTKVFKENIEK---YTEGIEFVRCHWTELY----EKYKNKEN-----VFFIVDPPFYNTWDFQY--QV-----DWTLRDSLETLDVLHN-HP--CFYFTSDKSGLETVMRWLEDI-----HDYDFR---------YDKIEYER-G---------------------------------- TVAG_557140_Trichomonas_vaginalis_G3_123207322 FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLYISHLLHIMFPKATIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KLDEKI-PTDQKEIVREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLDTLMK-K--VLYLRYTNLFDENIDA---YLEGLTIVHKDWRILF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKDTVETFDVL-N-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE TVAG_271330_Trichomonas_vaginalis_G3_123479010 FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLFISHLLHTLFPKATIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KLDEKI-PTDQKEIIREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLVTLMK-K--VLYLRYTHLFDENIDD---YLEGLTIVHKDWRILF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKDTVETFDVL-N-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE TVAG_344370_Trichomonas_vaginalis_G3_123481438 FT-K-PPLPFIGNKSKIRKDLIDIL-KD----I------KGDY---VFVDLFGGSLFISHLLHTLFPKSTIIANDYDNYVDRLKHIHDTNEILKELKERI-N-----V--KPDEKI-PTDQKEIVREVISK----A-PYIDWDTLCSRLLYSGAYK--Y---YDLDTLMK-K--VLYLRYTNLFDENIDA---YLEGLTIVHKDWRIIF----NEYKDLPN-----VFFICDPPYFHTYSIQY--GD-----EWTLKYIVETFDVL-N-YP--SIYFSSDKSYTEELIEILSKR-----YGEKFS---------VTKHVIGR-GL-----INPNTQKNNEVIYVT---F-S-I-NGEE TVAG_039120_Trichomonas_vaginalis_G3_123484516 FT-K-PPLPFFGNKSRCKDILLREL-KK----L------PNGL---TFVDLFGGSFYISHLCHTVFPDSKIICNDFDNYMDRLKHIPDTNKILKELKEKI-P-----I--GKMERI-PLDNKNIVREVLKK----A-EYIDWDSISARLLYSGAIR--V---HDIETLMS-K--VLYLNYTNVFKEDIEK---YIEGIEFVRCHWTDLY----EKYKDKEN-----VFFIVDPPFYNTWDFQY--QV-----DWTLRDSLETLDVLHN-HP--CFYFTSDKSGLETVMRWL------------------------------------------------------------------- TVAG_051460_Trichomonas_vaginalis_G3_123976294 FT-K-PPLPFFGNKSRCKDILLREL-KK----L------PNGL---TFVDLFGGSFYISHLCHTVFPDSKIICNDFDNYMNRLKHIPDTNKILKELKEKI-P-----I--GKMERI-PLDKKNTVREVLKK----A-EYIDWDSISSRLLYSGSIR--V---HDIETLMS-N--VLYLNYTKVFKEDIEK---YTEGIEFVRCRWTELY----EKYKNKEN-----VFFIVDPPFYNTYDFQY--QV-----DWTLRDSLETLDVLHN-HP--CFYFTSDK----------------------------------------------------------------------------- BN1088_RS06390_Sphingobacterium_sp_PM2-P1-29_786219197 YT-S-SPLPFMGQKRRFLKKFKEVL-IN----N------KPDA---IYVDLFGGSGLLSHIVKQYYPKATVVYNDYDNFSERLLHIDQTNELLASIRYLI-K--D--L--PNDKAI-PVDRRQPVIDCIYA-HEKRYGYVDYVSISSNLLFAMNYA------KDMDQLSR-Q--VFYKTIRESSY-NADG---YLEGVERVSMDYKSLF----ERYKDQSN-----VVFLVDPPYLSTETSVY--KSS----HWKLSDYLDVLDVL-K-VPH-YYYFTSNKSQIVELCEWLGSK--VP-GANPFR---N-----TVLYSNVS------S-VNYSSKYTDIMIVK-------------- M573_RS10255_Prevotella_intermedia_771514766 FN-S-APLPFQGQKRKFAKEFAKVL-HQ----Y------PDDT---VFVDLFGGSGLLSHITKHQKPNATVVYNDFDNYRQRLAHISQTNELLATIREIL-K--D--V--PRGKMV-AGGERQLVIDAIKR-HEKCYGYVDYITLSSSIMFSMKYC------TNIDDLEK-Q--GIYNRVRRGDFATCDG---YLDDLTVVSVDYKQLV----EQYKDVPN-----VVFIIDPPYLSTDTASY--NM-----NWQLSDYLDVLLVL-F-KHS-FIYFTSNKSSIIELCEWIARN--SG-MNNPFE---Q-----CNKVEVDT------S-MNYNSTYTDIMLYTT-------I----- JCM6334_RS11905_Prevotella_disiens_545432296 FN-S-APLPFQGQKRKFAKEFAKVL-QQ----Y------PDDT---MFVDLFGGSGLLSHITKRQKPNATVVYNDFDNYRQRLAHISQTNELLAAIREIL-K--D--V--PRGKMV-AGEERQLVIDAIKR-HEKFYGYVDYITLSSSIMFSMKYC------TNIDDFEK-Q--GIYNRVRRGDFATCDG---YLDDLTVVSVDYKQLV----EQYKDVPN-----VVFIIDPPYLSTDTASY--SM-----NWQLSDYLDVLLVL-F-NHS-FVYFTSNKSSIIELCEWIARN--SG-MNNPFE---R-----CNKVEINT------S-MNYNSAYTDIMLYTT-------I----- K941_RS0107590_Moraxella_caprae_656071893 HK-T-APLPFTGQKRMFLRHFEKILKDN----I------PNDGEGFTVLDVFGGSGLLAHNAKRILPKATVIYNDFDGYVERLAHIPTTNRLRQELFEIL-K--G--E--PRSVKL-SSTAKAKVLGHLRK-SADNGTFVDVQTLAGWLLFSGRQV------GSLDEFLA-E-STFYNRIVKTDYPNADG---YLDGLILECLDFEKLL----QKYQDTPN-----CLLLLDPPYLCTAQGAY--AKH---GYFGMTKFLRLMQ-F-V-RPP-FIFFSSTKSELMDYMAYVQRY-----EPNTWQRVGD-----FTHIKVNS------S-INAKVSYEDNILAK-------------- AJF4211_000450_Avibacterium_paragallinarum_JF4211_523674289 -------MPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVLKF------------- AJF4211_000170_Avibacterium_paragallinarum_JF4211_523673311 -------MPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- HSM_RS03540_Histophilus_somni_501302336 FS-Q-APLPFVGQKRLFLNAFKQVL-ND-NI-Q------NDGE-GWTIVDAFGGSGLLSHVAKRIKPKARVIYNDFDGYADRLKHISDTNRLRAELIQIV-G-DI--V--PKNKRL-DDNKKQEIINKIND-FN---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQFL----PKFADNPK-----ALFVLDPPYLCTKQNSY--KMA-N--YFDLVDFLQLIDLT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- AJF4211_RS08790_Avibacterium_paragallinarum_545595880 YK-Q-APLPFVGQKRLFLNAFKQVL-ND-NI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- AJF4211_RS06820_Avibacterium_paragallinarum_545595679 YK-Q-APLPFVGQKRLFLNAFKQVL-ND-NI-P------NDGE-GWTIIDTFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FN---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- AJF4211_RS12815_Avibacterium_paragallinarum_545595274 YK-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- AJF4211_RS06190_Avibacterium_paragallinarum_737726850 FN-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVLKF------------- Z012_RS09750_Avibacterium_paragallinarum_805420685 FS-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-P------DDGE-GWMIVDAFGGSGLLSHVAKRIKPKARVIYNDFDGYADRLKHISDTNRLRAELIQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDPPYLCTKQNSY--KMA-T--YFDLVDFLQLIDLT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- AJF4211_RS10020_Avibacterium_paragallinarum_737726745 FN-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-S------DDGE-GWTIVDAFGGSGLLSHVAKRIKSKARVIYNDFDGYADRLKHISDTNRLRAELLQIV-G-DI--V--PKNKRL-DNNKKQEIINKIND-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKLADYPSAED---YLDGLEIVSEPFQQLL----PKFADNPK-----ALFVLDSPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- JP34_RS00795_Gallibacterium_anatis_746011652 FS-Q-APLPFVGQKRLFLNAFKQVL-NE-HI-P------DDGE-GWTIVDAFGGSGLLSHVAKRIKPKARVIYNDFDGYADRLKHISDTNRLRAELIQIV-D-DI--V--PKNKRL-DNNKNQEIINKING-FK---GFKDLNTIASWLLFSGQQV------SSFEELFS-K--TFWNGIKQADYPSAED---YLDGLEIVSEPFQTLL----PKFADNPK-----ALFVLDPPYLCTKQNSY--KMA-N--YFDLIDFLQLIDRT-R-PP--YVFFSSTKSEFVRFIAYMVQAK-KD-NWQAFD---G-----AERIVLQT------S-LNYQVSYEDNLVFKF------------- IE01_RS08000_Gallibacterium_anatis_517157783 YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NEGE-GWTIVDVFGGSGLLARNAKDICPKSTVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPGARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFE-K--SLWNNLTKRDYPVADD---YLDGLNIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQTAY--KKE-T--FFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYIDFLIKYK-ER-NYQHFV---D-----VVEQKINV------T-VNCNVNYQDNMIYKF------------- UMN179_RS12515_Gallibacterium_anatis_503512750 YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLARNAKDICPKARVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEVLFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQTAY--KKE-T--FFDLIDFLELMRLI-R-PP--FIMFSSAKSEFNRYIDFLIKHK-EK-NYQHFV---D-----AVEQKINV------R-VNHNVNYQDNMVYKF------------- P375_RS07850_Gallibacterium_genomosp_2_746067969 YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLAGNAKDICSKARVIFNDYDNYAERLANIKQTNQLRQQLAYCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLNGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQSTY--KKE-T--YFDLIDFLELMRLI-R-PP--FIMFSSAKSEFNRYIDFLIKHK-EK-NYRHFV---D-----AVEQKINV------R-VNHNVNYQDNMVYKF------------- JP33_RS07160_Gallibacterium_anatis_746094831 YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDDE-GWTIIDVFGGSGLLARNAKDICPKAQVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQTAY--KKE-T--FFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYINFLIKYK-ER-NYQHFV---D-----AVQQKINV------T-VNCNVNYQDNIVYKF------------- IO46_RS12295_Gallibacterium_anatis_746089913 YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLARNAKDICSKAQVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPEARL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQSAY--KKE-S--YFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYIDFLIRYK-ER-NYQHFV---D-----AVEQKINV------T-VNCNVNYQDNIVYKF------------- JP28_RS09245_Gallibacterium_anatis_746010293 YT-K-APLPFTGQKRNFLKLVEKAL-IE-NI-D------NDGE-GWTIIDVFGGSGLLARNAKDICSKAQVIFNDYDNYAERLANIKQTNQLRQQLAHCL-I--D--V--KPGTRL-SNEKKKEIIDIIRR-FD---GYKDIKALTSWLLFSGNDV------KSLEALFK-K--SLWNNLTKRDYPVADD---YLDGLDIVRMDFKDLI----NQHRHKEK-----VLFLLDPPYICTEQSAY--KKE-S--YFDLIDFLELMRLI-R-PP--FIMFSSVRSEFNRYIDFLIRYK-ER-NYQHFV---D-----AVEQKINV------T-VNCNVNYQDNIVYKF------------- ERS450003_01064_Haemophilus_influenzae_777210024 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GNGE-GWTIIDTFGGSGLLSHTAKRLKPKARIIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--PKNKRM-TKDCKAECIKIIQN-FK---GYKDLNCLASWLLFSGQQV------ATFDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSDDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- NTHI723_RS04270_Haemophilus_influenzae_764389671 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GEGE-GWTIIDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--SKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSDDPK-----ALFVLDPPYLCTKQESY--KQA-K--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AERITVNA------K-LNYQVAYEDNLVYKF------------- HMPREF9095_RS06800_Haemophilus_491953443 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWIIIDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINRLRAELYSVV-G-NA--T--SKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSNDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- C645_RS00620_Haemophilus_influenzae_803453319 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKRLKPKARIIYNDFDGYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRQSDYPKADG---YLDGVEIVRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- C645_RS06690_Haemophilus_influenzae_803453531 FK-Q-APLPFIGQKRMFLKQFETIL-ND-NI-P------DDGE-GWTIIDTFGGSGLLSHTAKRLKPKAHVIYNDFDGYAERLVHIDDTNALRAQIFAKI-G-NT--T--PKNKRL-PKSLKAEIIQIIDE-FQ---GYKDLNCLASWLLFSGQQV------GSLEELYR-K--DFWHCVRLSDYPSADG---YLDGVEVIRESFHALL----PKFVDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- HIBPF_RS02220_Haemophilus_influenzae_503290984 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVIKPKAHVIYNDFDSYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFSDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-L-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- SU58_RS08535_Haemophilus_influenzae_756163264 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVMKPKAHVIYNDFDGYAERLVHIDDTNALRAQIFAKI-G-NT--T--PKNKRL-PKSLKAEIIQIIDE-FQ---GYKDLNCLASWLLFSGQQV------GSLEELYR-K--DFWHCVRLSDYPGAEG---YLDGVEIVKESFHTLL----PKFSNDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- CK45_RS04150_Haemophilus_influenzae_696244941 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKRLKPKASVIYNDFDGYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCIRQSDYPKADG---YLDGVEIVRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- JP32_RS09880_Gallibacterium_anatis_746007528 FA-Q-APLPFVGQKRMFLQQFRAVL-NQ-LI-T------NDGE-GWTIVDAFGGSGLLSHTAKRLKPNAHVIYNDFDGYAERLKHIDDINRLRQILSELL-A--N--Y--PRDRRL-DIAMRHKVIDAIES-FN---GYKDPHILCAWLLFSGQQV------KSINELYS-R--GFYNCIRQSDYTTADG---YLDGIEVVNESFVTLL----PKFADDSK-----AIFVLDPPYLCTKQASY--KQE-R--YFDLIDFLELIRLT-R-PP--YLFFSSTKSEFIRFVDWLIASK-GD-NWQSFV---D-----YQRIVIQT------S-ASHNGQYEDNLIYNF------------- HMPREF1199_RS02420_Prevotella_oralis_565956523 YL-S-APLPFVGQKRMFAKEFIKVL-DR----F------SDGT---TFVDLFGGSGLLSHITKCRKPNSTVVYNDFDNYRKRIENIPVTNALLSDLRKIV-R--D--I--PRKKRI-TGETREKVLACLKR-YLQQYGYVDFITISSSILFSMKYV------TIFEDVGK-E--TLYNNIRMNDYPPCSD---YLDGLQVVCCDYKNLY----DKYKDMPK-----VVFLIDPPYLNTEVGTY--NM-----YWKLPDYLDVLKIL-Q-GCS-FVYFTSDKSSIIELCEWMEKN--KE-TGNPFK---N-----CSIASVNA------H-VNHNAGYTDMMLYTV------------- HMPREF1475_RS02705_Prevotella_oralis_738993231 YN-S-APLPFVGQKRMFAKKFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHIAKWQKPNSKVVYNDFDGYRLRLEHIPQTNELLAEIREIV-R--N--V--PRHKAI-AGETRNYIFESLLR-HQERYGYLDFITVSSSVMFSMKYQ------LSINDMRK-E--TLYNNVRTTDYPICDD---YLEGLTITSVDYKQLF----NQYKDRPD-----VVFLVDPPYLSTDVGTY--KM-----YWRLSDYLDVLEVL-T-GHS-FVYFTSNKSSILELCDWIGRN--KN-MDNPFK---K-----CTKVEVNA------H-MNYNATYTDIMLYTN-------V----- HMPREF0663_11914_Prevotella_oralis_ATCC_33269_323094424 YN-S-APLPFVGQKRMFAKKFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHIAKWQKPNSKVVYNDFDGYRLRLEHIPQTNELLAEIREIV-R--N--V--PRHKAI-AGETRNYIFESLLR-HQERYGYLDFITVSSSVMFSMKYQ------LSINDMRK-E--TLYNNVRTTDYPICDD---YLEGLTITSVDYKQLF----NQYKDRPD-----VVFLVDPPYLSTDVGTY--KM-----YWRLSDYLDVLEVL-T-GHS-FVYFTSNKSSILELCDWIGRN--KN-MDNPFK---K-----CTKVEVNA------H-MNYNATYTDIMLYTN-------V----- HMPREF0645_RS12560_Prevotella_bergensis_494312007 YN-S-APLPFVGQKRMFAKEFRKVL-EQ----F------PDGT---TFVDLFGGSGLLSHITKCEKPHSKVVYNDFDGYRLRLEHIPQTNELLAKLREIV-R--K--I--PKHKPI-TGEAREQVFECLRE-HQECYGYLDFITISSSIMFSMKYR------LSIDEMRK-E--ALYNNVRSTDYPLCCD---YLDGLTIVSSDYKQVF----NLYKNTPG-----VVFFVDPPYLSTEVGTY--KM-----YWRLADYLDVLTVL-A-GHS-FVYFTSNKSSILELCDWVGRN--KT-VGNPFE---K-----CTKVEFNA------H-MNYNATYTDMMLYKK-------A----- L888_RS0101115_Hallella_seregens_654481515 YL-S-APLPFVGQKRMFAKEFRKVL-DQ----I------PDGT---TFVDLFGGSGLLSHIAKYDKPHSEVVYNDFDGYRRRLEHIPQTNELLAELRDIV-R--D--V--PRYKAI-TGETREHVFGCLLQ-HEKRYGYIDFITVSSSIMFSAKYC------LSIDDMRK-E--ALYNKVRSSDYSECPD---YLDGLTIVSKDYKQLF----KEYRDKPD-----VVFLVDPPYLGTEVGTY--KM-----FWKLADYLDVLKVL-Q-GHA-YIYFTSNKSSIIELCEWLGQN--RD-MGNPFE---H-----STRVEFKA------Q-MNYNASYTDMMLYKN-------A----- M082_RS01650_Bacteroides_696270804 YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRQRLANIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITISASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITGEDYKEVF----KRYKDAPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRRVEFSA------N-VNYQAKYTDMMLYTK-------P----- M137_RS11600_Bacteroides_494836361 NL-S-APLPFVGQKRMFAKEFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKRSKPDATVVYNDFDNYRFRLKNIPQTNKLLADIRELV-G--NS-I--PKHKPI-KGELRERIFKRIEE-EELNVGYVDFITLSSSLMFSMKYK------LSVAEMRK-E--VLYNNIRKTGYPESSD---YLKGLEIVSCDYKAVF----NQYKDVPG-----VVFLIDPPYLSTDVGTY--NM-----YWRLSDYLDVLKIL-E-KHS-FVYFTSNKSSILELCEWIGAN--KT-IGNPFE---G-----CTKKEFNA------H-MNYSAEYTDMMLYKK-------Q----- BN456_01886_Prevotella_sp_CAG:1031_512184299 YL-S-APLPFVGQKRMFAKQFIEVI-RQ----Y------PADT---VFVDLFGGSGLLSHITKHFHPESRVIYNDFDNYRLRINNIPRTNSLLESIRPIA-S--Q--F--DRHKPI-TGGAREQIFSLLEQ-EEKETCFLDFITLSSSLMFSMKYK------MSIEGMRG-E--TLYNNVRKNGYEPCRD---YLAGLEIVSCDYRELF----EQYKDTPG-----VVFFVDPPYLSTDVGTY--RM-----YWRLADYLDVLSVL-P-GHN-FIYFTSEKSCIIELCEWMGRH--PS-LGDPFA---R-----CQRREFNA------T-MNYNASYKDIMLFTI-------P----- HMPREF1070_RS05245_Bacteroides_ovatus_490456001 YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDATVVYNDFDNYRRRLANIPATNVLLSDLRWIA-E--G--E--PRNKRI-TGEVCEKMFARIER-EEKERGYVDYITLSSSLLFAMRYM------LSLEDMRK-E--TLYNNIRQTDYPEAKD---YLEGLTITGEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLDVLTVL-K-GHP-FVYFTSNKSSILELCDWIDRN--PF-IGSPFK---N-----CRKVEFNA------H-MNYNSKYTDMMLYTK-------P----- BSFG_RS03650_Bacteroides_sp_4_3_47FAA_495946269 YL-S-APLPFVGQKRMFAKEFIKVL-DR----F------PDST---VFVDLFGGSGLLSHITKRVRPDAVVVYNDFDNYRQRLDNIPNTNQLLADLRRIT-A--E--L--PRKKRI-TGEARERILARIEK-EEKEHGYVDYITLSSSLLFSMKYV------LNLDNMRK-E--TFYNTIHRTDYSDAKD---YLEGLTIVSEDYKEVF----KRYKDVLG-----VVFLVDPPYLSTEVGTY--KM-----YWHLADYLNVLHVL-K-EHS-FVYFTSNKSSILELCSWIGDN--PS-IGNPFK---D-----CVKVEFNA------C-VNYSSCYTDIMLCKQ-------G----- BA92_RS10770_Bacteroides_490455210 YL-S-APLPFVGQKRMFAREFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDATVVYNDFDNYRCRLVNIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITVSASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITSEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKVEFSA------S-VNYQAKYTDMMLYTK-------P----- JCM15754_RS11780_Prevotella_aurantiaca_640570678 YY-S-APLPFVGQKRMFAKEFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHITKSEKPHSKVVYNDFDGYRLRLEHVSQTNELLSELRKIV-R--D--L--PKHKPI-VGEARKRIFECLIK-CQERYGYLDFITISSSLLFSMKYC------LNIDDMNK-E--TLYNNIRSTDYPLCDG---YLDGLTIVSTDYKQVF----NQYKDTPN-----VVFLVDPPYLSTEVGTY--KM-----YWKLADYLDVLSVL-A-GHS-FVYFTSNKSSILELCDWIGRN--KH-IGNPFE---K-----CTKVECNA------R-MNYNSTYTDMMLYKN-------A----- JCM17725_RS06630_Prevotella_scopos_647557435 YY-S-APLPFVGQKRMFAKEFKKVL-EQ----F------PDGT---TFVDLFGGSGLLSHITKSEKPHSKVVYNDFDGYRLRLEHVPQTNELLSELRKIV-R--E--L--PKHKPI-VGGARKQIFECLIK-HQDRYGYLDFITISSSLLFSMKYC------LNIDDMNK-E--TLYNNIRSTDYPLCDG---YLDGLTIVSADYKQVF----NQYKDTPN-----VVFLVDPPYLSTEVGTY--KM-----YWKLADYLDVLSVL-A-GHS-FVYFTSNKSSILELCDWIGRN--KH-IGNPFE---K-----CTKVEFNA------R-MNYNSTYTDMMLYKN-------A----- VK67_RS05530_Mannheimia_haemolytica_493291112 FK-Q-APLPFVGQKRMFLAQVSQIL-NE-NI-T------DDGQ-GWTIIDVFGGSGLLAHTAKHIKPKAHIIYNDYDGYAERLKHIPDTNRLRKQIYDII-G-KS--T--PKNKRL-DPDKKSQVINIIQS-FD---GYIDVNCVASWLLFSGQQI------NSLEDLFN-K--IFWNGVRQTDYPSAEG---YLDGIEVTHESFHKLL----PRFQHKDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-VP--YIFFSSTKSEFVRFVDFMVSEK-KD-NWQTFE---N-----AKWIKVKA------S-LNYQSTYEDNLVYKF------------- L278_RS124350_Mannheimia_haemolytica_544865513 FK-Q-APLPFVGQKRMFLAQVSQIL-NE-NI-T------DDGQ-GWTIIDVFGGSGLLAHTAKHIKPKAHIIYNDYDGYAERLKHIPDTNRLRKQIYDII-G-ES--T--PKNKRL-DPDKKSQVINIIQS-FD---GYIDVNCVASWLLFSGQQI------NSLEDLFN-K--IFWNGVRQTDYPSAEG---YLDGIEVTHESFHKLL----PRFQHKDK-----VLLLLDPSYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-VL--YIFFSSTKSEFVRFVDFMVSEK-KD-NWQAFE---N-----AKWIKVKA------S-LNYQSTYEDNLVYKF------------- JCM16497_RS23455_Bacteroides_sartorii_640590225 YL-S-APLPFVGQKRMFAKEYIKVL-GE----I------KGAK---VFVDLFGGSGLLSHITKQQRPDVTVIYNDFDNYSRRLEHIGHTNAILDRLRDIL-A--S--V--PRLKIV-PKPLKGRIIEMLAE-EEAETGFVDYITLSTSLLFSMKYA------TTLDEMRK-Q--TMYNRIKRTDY-DADG---YLDDVVVESCDYRELF----EKYRNRDD-----VVFLVDPPYLSTDVSTY--SM-----CWKLSDYLDVLKVL-V-GHK-YVYFTSNKSSIVELCDWLGRN--KE-IGNPFI---G-----STRKEFNA------S-MNYNSHYTDIMLYNV-------A----- J450_RS04260_Mannheimia_haemolytica_525759492 FK-Q-APLPFVGQKRMFLAQVSQIL-NE-NI-P------DDGQ-GWTIIDVFGGSGLLAHTVKHIKPKAHIIYNDYDGYAERLKHIPDTNRLRKQIYDII-G-ES--T--PKNKRL-DPDKKSQVINVIQS-FD---GYIDVNCVASWLLFSGQQI------NSLENLFN-K--IFWNGIRQTDYPSAEG---YLDGIEVTHESFHKLL----PRFQHKDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-VP--YIFFSSTKSEFVRFVDFMVSEK-KD-NWQTFE---N-----AKWIKVKA------S-LNYQSTYEDNLVYKF------------- C801_RS14355_Bacteroides_uniformis_511019476 YL-S-APLPFVGQKRMFAKEYIKVL-GE----V------KDAK---VFVDLFGGSGLLSHITKRQCPDATVVYNDFDNYRRRIENIPRTNALLVDLRNIV-R--G--V--PKHGCI-KGTMRDEVFVRLEQ-EERTHGYIDFITISSAIMFSMKYK------LSIPEMKK-E--ALYNNIRQSDYPAASD---YLEGLTIVSCDYKVLF----EKYRDRDD-----VVFLVDPPYLSTDVGTY--NM-----YWKLSDYLDVLKVL-V-GHR-YVYFTSNKSSIIELCDWLDKN--KE-IGNPFI---G-----ATRKEFNA------S-MNYNSHYTDIMLYNV-------A----- HMPREF1181_RS12035_Bacteroides_490416379 YS-Q-APLPFVGQKRMFASEFRKVL-KR----F------SDKT---VFIDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLGNIHRTNELLEDLRETA-K--G--Y--PRHKKI-TGSMRDVFLERILQ-DEQN-GFVDYLTLSSSLLFSMKYV------LNFEELKK-Q--NLYNKLRQNDY-NCDG---YLDGLEVVCCDYKELA----DKYNADTD-----VVFLVDPPYMATDISTY--KM-----DWRLQDYLDVLLVL-S-GHP-FVYFTSGKSPILDFCEWMEQH--PG-IGNPFR---G-----TCKSTLTA------R-MNYNSSYTDIMLYKG-------T--AEA HMPREF1079_00192_Bacteroides_fragilis_CL05T00C42_392705106 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFSGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA BSBG_02560_Bacteroides_sp_9_1_42FAA_229455867 NL-S-APLPFVGQKRMFAKEFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKRSKPDATVVYNDFDNYRFRLKNIPQTNKLLADIRELV-G--NS-I--PKHKPI-KGELRERIFKRIEE-EELNVGYVDFITLSSSLMFSMKYK------LSVAEMRK-E--VLYNNIRKTGYPESSD---YLKGLEIVSCDYKAVF----NQYKDVPG-----VVFLIDPPYLSTDVGTY--NM-----YWRLSDYLDVLKIL-E-KHS-FVYFTSNKSSILELCEWIGAN--KT-IGNPFE---G-----CTKKEFNA------H-MNYSAEYTDMMLYKK-------Q----- HMPREF1079_RS0100985_Bacteroides_fragilis_695330037 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFSGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA C799_RS11490_Bacteroides_thetaiotaomicron_696234173 YS-Q-APLPFVGQKRMFASEFRKVL-ER----F------GDKT---VFVDLFGGSGLLAHITKRERPDATVIYNDHDNYRGRLENIGRTNRLLADLRDMA-R--E--H--PRHKMI-TGSLRAAFLERIRQ-EEQT-GAVDYITLSSSLLFSGKYA------LNLEELGK-Q--SFYNNLRLSDY-RCGG---YLDGLEVVCCDYKVLA----DKYGCSPD-----VVFLVDPPYMATDTSTY--QM-----DWKLKDYLDVLLVL-K-GHP-FVYFTSSKSPILDFCSWMEEH--PG-SGNPFR---G-----AGRSTFAA------R-MNYASSYTDIMLYRE-------M--PGA BFAG_03319_Bacteroides_fragilis_3_1_12_313137261 YL-S-APLPFVGQKRMFAREFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKCRKPDATVVYNDFDNYRRRMAHIPQTNRLIADIRGMV-G--DA-V--PRHRPI-TGELRERIFRRIEQ-EERTVGYVDFITLSSSLMFSMKYR------LSVPEMRK-E--ALYNNIRKADYPECAD---YLDGLEIVSCDYKEVF----GRYKDTPG-----VVFLVDPPYLSTDVGTY--NM-----YWHMSDYLDVLNVL-A-GHS-FVYFTSNKSSILELCEWIGRN--RD-IGNPFE---K-----CTRVEFNA------H-MNYNASYTDMMLYRK-------E----- C799_02456_Bacteroides_thetaiotaomicron_dnLKV9_507741308 YS-Q-APLPFVGQKRMFASEFRKVL-ER----F------GDKT---VFVDLFGGSGLLAHITKRERPDATVIYNDHDNYRGRLENIGRTNRLLADLRDMA-R--E--H--PRHKMI-TGSLRAAFLERIRQ-EEQT-GAVDYITLSSSLLFSGKYA------LNLEELGK-Q--SFYNNLRLSDY-RCGG---YLDGLEVVCCDYKVLA----DKYGCSPD-----VVFLVDPPYMATDTSTY--QM-----DWKLKDYLDVLLVL-K-GHP-FVYFTSSKSPILDFCSWMEEH--PG-SGNPFR---G-----AGRSTFAA------R-MNYASSYTDIMLYRE-------M--PGA HMPREF1055_02982_Bacteroides_fragilis_CL07T00C01_387775820 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA HMPREF1018_02174_Bacteroides_sp_2_1_56FAA_335946057 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA M070_4300_Bacteroides_fragilis_str_A7_(UDC12-2)_596213380 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCHWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA JCM12083_RS06185_Prevotella_shahii_647518976 YK-S-APLPFVGQKRMFVKEFIKVL-SQ----F------PADT---VFVDLFGGSGLLSHIAKRTKPESTVVYNDFDNYRCRLQHIPQTNRLIADLRDIV-A--DK-V--PRNKPI-TGELRKRIFARIEQ-EERVVGYVDFITLSSSIMFSMKYK------LSVPEMRK-E--TLYNSIRKSDYPTCPD---YLDGLEITSCDYKELF----NQHKDTPS-----VVFLVDPPYLSTEVGTY--NM-----YWKMADYLDVLNVL-A-GHS-FVYFTSNKSSILELCEWLGRN--RS-LGNPFE---N-----AVKVEFNA------H-MNYNASYTDMMLFKK-------E----- HMPREF9008_RS11875_Parabacteroides_sp_20_3_496053795 YL-S-APLPFVGQKRMFARKFMKVL-EQ----Y------PEST---VFVDLFGGSGLLSHITKRCKPEATVIYNDFDNYHKRLENIPRTNRLIADLRSMV-G--NS-V--PRHKTI-TGELRERIFSRILQ-EEHETGYVDFITLSSSLMFSMKYK------LSVPEMRK-E--ALYNNIRKADYPECTD---YLEGLEIVSCDYKELF----NRYKDTPG-----VVFLVDPPYLSTDVGTY--NM-----SWRMSDYLDVLNVL-S-GHP-FVYFTSNKSSILELCEWIGKN--KN-TGNPFE---G-----CTRMEFNA------H-INYSSSYTDMMLFKK-------E----- M118_4484_Bacteroides_fragilis_str_3783N1-2_595939381 YL-S-APLPFVGQKRMLAKEFMKVL-EQ----Y------PDGT---LFVDLFGGSGLLSHITKSLKPHSTVIYNDFDNYRFRMKHIPQTNQLLADIREMV-G--NS-V--PRHKII-KGELRERIFSRIEQ-EENSTGYVDFITLSSSILFSMKYK------LSVQDMRK-E--ALYNNIRKTGYPECTD---YLEGLEIVSCDYKEVF----NRYKDIPG-----VVFLVDPPYLSTDVGTY--NM-----YWNMADYLDVLNVL-K-GHS-YVYFTSNKSSILELCEWIGKN--RD-LGNPFE---N-----CTKVEFNA------H-MNYNSSYTDMMLYKK-------E----- M088_0657_Bacteroides_ovatus_str_3725_D1_iv_649508868 YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRQRLANIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITISASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITGEDYKEVF----KRYKDAPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRRVEFSA------N-VNYQAKYTDMMLYTK-------P----- BFAG_RS07280_Bacteroides_fragilis_695344948 YL-S-APLPFVGQKRMFAREFIKVL-EQ----F------PEDT---VFVDLFGGSGLLSHIAKCRKPDATVVYNDFDNYRRRMAHIPQTNRLIADIRGMV-G--DA-V--PRHRPI-TGELRERIFRRIEQ-EERTVGYVDFITLSSSLMFSMKYR------LSVPEMRK-E--ALYNNIRKADYPECAD---YLDGLEIVSCDYKEVF----GRYKDTPG-----VVFLVDPPYLSTDVGTY--NM-----YWHMSDYLDVLNVL-A-GHS-FVYFTSNKSSILELCEWIGRN--RD-IGNPFE---K-----CTRVEFNA------H-MNYNASYTDMMLYRK-------E----- M125_RS18320_Bacteroides_492741740 YL-S-APLPFVGQKRMFAREFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRSDATVVYNDFDNYRCRLVNIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEIRDKMFARIER-EEKEHGYVDYITVSASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITSEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----FWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKVEFSA------S-VNYQAKYTDMMLYTK-------P----- HMPREF9007_RS09530_Bacteroides_sp_1_1_14_496037689 YL-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRQRLANIPVTNVLLSDLHRIA-E--G--E--PRNKRI-TGEVRNKMFARIER-EEKEHGYVDYITISASLLFAMKYV------TCLKEMKK-E--TIYNRIRRTDYPEAED---YLEGITVTCEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLNVLNVL-K-GHA-FVYFTSNKSSILELCDWMGRN--PF-LGNPFK---E-----CRKVEFSA------N-VNYQAKYTDMMLYTV-------P----- BSFG_RS03795_Bacteroides_sp_4_3_47FAA_495946224 YS-Q-APLPFVGQKRMFASEFRKVL-KR----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SRDMHGIFLERIRR-EENT-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNIRLSDY-SCEG---YLDGLEVVCCDYRELT----DKYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------T--TEA BSBG_RS01095_Bacteroides_sp_9_1_42FAA_495950973 YL-S-APLPFVGQKRMFARKFMKVL-EQ----Y------PEST---VFVDLFGGSGLLSHITKRCKPEATVIYNDFDNYHKRLENIPRTNRLIADLRAMV-G--NS-V--PRHKTI-TGELRERIFSRILQ-EEHETGYVDFITLSSSLMFSMKYK------LNVPEMRK-E--ALYNNIRKADYPECTD---YLEGLEIVSCDYKELF----NRYKNTPG-----VVFLVDPPYLSTDVGTY--NM-----SWRMSDYLDVLNVL-S-GHP-FVYFTSNKSSILELCEWIGKN--KN-IGNPFE---G-----CTRMEFNA------H-INYSSSYTDMMLFKK-------E----- HMPREF1981_RS13185_Bacteroides_pyogenes_545407693 HL-S-APLPFVGQKRMFAKEFVKVL-EQ----F------SEKT---VFVDLFGGSGLLSHITKCVRPDAVVVYNDFDNYRKRLGNIPRTNRLLSDLREIS-N--G--T--PKHKPI-TGEKREKVFARIQK-EEKEYGYVDYITLSSSLLFSMKYK------ICLEEMKK-E--TIYNKIRVSDYPEAGD---YLQGLTITCDDYKKVF----NQYKDVPG-----VLFLIDPPYLSTEVGTY--NM-----SWRLADYLDVLGVL-K-EHS-FVYFTSNKSSILELCDWIGRN--HS-IGNPFK---K-----CRKVEFNA------S-MNYSAKYIDIMLYTV-------P----- BVU_RS04850_Bacteroides_vulgatus_500646766 YS-Q-APLPFVGQKRMFASEFRKVL-KR----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SKKMHGMFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNMRLSDY-CCEG---YLDGLEVVYCDYRELV----DRYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-E-GHP-FIYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGISTLTA------R-MNYSSSYTDIMLYKE-------M--TEA M137_RS14330_Bacteroidales_492281093 YL-S-APLPFVGQKRMLAKEFMKVL-EQ----Y------PDGT---LFVDLFGGSGLLSHITKSLKPHSTVIYNDFDNYRFRMKHIPQTNQLLADIREMV-G--NS-V--PRHKII-KGELRERIFSRIEQ-EENSTGYVDFITLSSSILFSMKYK------LSVQDMRK-E--ALYNNIRKTGYPECTD---YLEGLEIVSCDYKEVF----NRYKDIPG-----VVFLVDPPYLSTDVGTY--NM-----YWNMADYLDVLNVL-K-GHS-YVYFTSNKSSILELCEWIGKN--RD-LGNPFE---N-----CTKVEFNA------H-MNYNSSYTDMMLYKK-------E----- M080_1486_Bacteroides_fragilis_str_3397_T10_595910038 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA M116_RS19700_Bacteroides_fragilis_492352476 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRGRLENIGRTNTLLGDLRKIV-G--I--Y--PHNQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEH-L--CFYNKIRQADY-RCDG---YLDGLEVVCYDYKELA----ETYRVLPG-----VVFLVDPPYMGTDISTY--RM-----DWKLGDYLDVLPVL-K-GHP-FVYFTSSKSPILDFCKWMEEH--PG-TGNPFK---G-----TGRSAITA------R-MNYNSSYTDIMLYNN-------M--ACT M098_0958_Bacteroides_vulgatus_str_3775_SR(B)_19_649521449 YS-Q-APLPFVGQKRMFASEFRKVL-KH----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SKKMHSIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELEK-L--NFYNNMRLSDY-CCEG---YLDGLEVVCCDYRELV----DRYRDSPN-----VVYLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FIYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA M098_RS09095_Bacteroides_vulgatus_696374681 YS-Q-APLPFVGQKRMFASEFRKVL-KH----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SKKMHSIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELEK-L--NFYNNMRLSDY-CCEG---YLDGLEVVCCDYRELV----DRYRDSPN-----VVYLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FIYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA BN535_00547_Bacteroides_492444862 YL-S-APLPFVGQKRMFAKDFIRVL-GQ----F------PGST---VFVDLFGGSGLLSHITKCVRLDAAVVYNDFDNYRRRLANIPATNVLLSDLRRIA-E--G--E--PRNKRI-TGEVRDKMFARIER-EEKEHGYVDYITVSASLLFAMKYV------TSLEGMKK-E--AIYNRIRQTDYPEAKD---YLEGLTITSEDYKEVF----KRYKDVPG-----VVFLVDPPYLSTEVGTY--EM-----SWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKAEFSA------S-VNYQAKYTDMMLYTK-------P----- M116_4685_Bacteroides_fragilis_str_3719_A10_596095999 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRGRLENIGRTNTLLGDLRKIV-G--I--Y--PHNQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEH-L--CFYNKIRQADY-RCDG---YLDGLEVVCYDYKELA----ETYRVLPG-----VVFLVDPPYMGTDISTY--RM-----DWKLGDYLDVLPVL-K-GHP-FVYFTSSKSPILDFCKWMEEH--PG-TGNPFK---G-----TGRSAITA------R-MNYNSSYTDIMLYNN-------M--ACT BSEG_RS20295_Bacteroides_dorei_696373063 YS-Q-APLPFVGQKRMFASEFRKVL-KC----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SREMHGIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNMRLSDY-SCEG---YLDGLEVVCCDYRELV----DKYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA BSEG_01570_Bacteroides_dorei_5_1_36/D4_345456400 YS-Q-APLPFVGQKRMFASEFRKVL-KC----F------SDRT---VFVDLFGGSGLLSHITKRERPDATVIYNDHDNYRERLENISRTNALLSDLRRLS-E--G--I--PRHRML-SREMHGIFLERIRR-EEST-GFVDYLTISSSLLFSGKYA------RNIGELGK-L--NFYNNMRLSDY-SCEG---YLDGLEVVCCDYRELV----DKYRDSPD-----VVFLIDPPYMATDISTY--RM-----DWKLTDYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PE-TGNPFK---G-----AGMSTLTA------R-MNYSSSYTDIMLYKE-------M--TEA HMPREF9441_RS15520_Paraprevotella_clara_495898225 YL-S-APLPFVGQKRMFAREFIKVL-EQ----F------NDKT---VFVDLFGGSGLLSHITKCQRPDATVVYNDFDGYRERLQAIPQTNILLADFRRLA-A--G--V--PKDKPI-RGAVRERILERIAM-AEREWGYVDYITVSSALMFSMKYA------TSLEAMRK-E--TLYNNIRKTDYPPCPD---YLDGLTITSCDYKELY----EKYKDVPG-----VVFFVDPPYLSTEVGTY--KM-----YWRLSDYLDVLNVL-R-DKP-FVYFTSNKSSIIELCEWLGEN--KT-LGNPFK---N-----CGKVEFNA------H-MNYSAKYTDIMLYKK-------Q----- M080_RS26780_Bacteroides_fragilis_499301742 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA M070_RS00960_Bacteroides_fragilis_695540882 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCHWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA HMPREF1057_RS0113675_Bacteroides_finegoldii_495041624 YS-S-APLPFVGQKRMFAKEFIKVL-GQ----F------PDST---VFVDLFGGSGLLSHITKCVRPDATVVYNDFDNYRCRLANIPATNVLLSDLRRIA-E--G--E--PKNKRI-TGEVRDKMFARIER-EEKEHGYVDYITISASLLFAMKYV------ASLEEMKK-E--AIYNRIRRADYSKAED---YLEGIMVTCKDYKEVF----KCYKDVPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLDVLTVL-K-GHS-FVYFTSNKSSILELCDWMDRN--PF-VGSPFK---E-----CRKVEFSA------S-VNYQAKYTDMMLYTK-------P----- D11S_2165_Aggregatibacter_actinomycetemcomitans_D11S-1_261414145 FK-Q-APLPFVGQKRQFLKHFKAIL-NE-QI-P------GDGE-AWTIIDTFGGSGLLAHTAKQLKPCARVIYNDFDGYADRIKHIDDINRLRGQIAALL-S-GV--P--RQKRVT-DKAIKTEIVKTIEA-FD---GYVDLASLASWLLFSGQQV------GSFDELCR-K--DFWHCVCASDYPSADG---YLDGVEVVSESFHTLL----PRFTADPQ-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIDYMQEDK-VD-NWQAFA---N-----AKRIAIKT------K-LNYHGEYEDNLVYKF------------- HMPREF9011_RS05730_Bacteroides_sp_3_1_40A_496057734 YL-S-APLPFVGQKRMFAKEFMKVL-EQ----Y------PDGT---LFVDLFGGSGLLSHITKSLKPHSTVIYNDFDNYRFRMKHIPQTNQLLADIREMV-G--NS-V--PRHKII-KGELRERIFSRIEQ-EENTTGYVDFITLSSSLLFSMKYK------LSVQDMRK-E--ALYNNIRKTGYPECTD---YLEGLEIVSCDYKEVF----NRYKDIPG-----VVFLVDPPYLSTDVGTY--NM-----YWNMADYLDVLNVL-K-GHS-YVYFTSNKSSILELCEWIGKN--KD-LGNPFE---N-----CTKVEFNA------H-MNYNSSYTDMMLYKN-------E----- M117_RS13145_Bacteroides_fragilis_695509259 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRGRLENIGRTNTLLGDLRKIV-R--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QSMEELEH-L--CFYNKIRQADY-RCDG---YLDGLEVVCYDYKELA----DTYRVLPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLPVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA M127_RS12840_Bacteroides_695294566 YS-R-APLPFVGQKRMFVSEFKKIL-KH----F------DDKT---IFVDLFGGSGLLSHITKRERPDAVVIYNDHDNYRERLENIDRTNTLLRDLRKIV-G--I--Y--PRHQKI-TGKMREAFLERIRL-EETT-GFVDYLTLSTSLLFSGKYA------QNMEELEG-L--YFYNKIRQSDY-RCDG---YLDGLEVVCYDYKELA----DTYGVFPG-----VVFLVDPPYMGTDISTY--KM-----DWKLADYLDVLLVL-K-GHP-FVYFTSGKSPILDFCRWMEEH--PG-IGNPFK---G-----AGRSTLTA------R-MNYNSSYTDIMLYKD-------L--PRA HMPREF9952_RS06315_Haemophilus_pittmaniae_494451533 YK-Q-APLPFVGQKRQFLKHFEVVL-NE-NI-P------GDGD-DWTIIDTFGGSGLLSHAAKQLKPKARVIYNDFDGYAERIKHIDDINRLRAQIAALL-V-DI--P--RQKRIT-DKALKAQIIDTIKA-FD---GYIDLATLTSWLLFSGQQV------GTFEELFA-K--DFWHCIRQSDYPSADG---YLDGIEVVSESFHTLL----PRFSADQQ-----AVFVLDPPYLCTRQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVEDK-VD-NWEAFY---N-----SERVVVKA------S-ASYSGKYEDNMVYKF------------- I926_RS02325_Pasteurella_multocida_810414634 FK-Q-APLPFVGQKRMFIKHFEHIL-NE-NI-K------GDGK-DWTIIDVFGGSGLLSHTAKRVKPKARVIYNDFDRYVERLNHITETNQLREILYHSV-S-EI--I--PKNKLI-SKQAKEEIINKIKA-FN---GYKDVNCLSSWLLFSGQQV------DSLDELFK-Q--RFYNCIRQSNYALADG---YLDGLEVINESFHQLL----PRFIDKKK-----VLLVLDPPYLCTRQESY--KQS-T--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFIEAMVEDK-WD-NWQAFN---E-----INRITVNA------S-ASYSGKYEDNLIYKF------------- SALWKB29_RS09160_Snodgrassella_alvi_739635808 FN-K-APLPFVGQKRNFLKQFAKVL-NE-NI-D------GQGE-DWTIIDVFGGSGLLAHHAKRLKPNARVIYNDFDNYADRLAYIEDINRLRQQLAVTV-K--D--I--PIKSKL-DKRTQSLVIEQIRS-FK---GYIDLGSLQTWLLFSGDIL------ENLDQIYKES--VLYNRVKKSDYPLATG---YLDGVEVVSKSFDVLL----PDYVDNEN-----TLLVLDPPYLFSEQKAY--RKA-E--EFRFISFIKLMTLI-R-PP--FILFSNKHSEILEYLDYAIEQK-D----ERFA---G-----YDFCAINA------T-ISKKRTYKDYMIYKF------------- L11_RS07795_Neisseria_weaveri_750388073 HA-K-APLPFVGQKRNFIKHYLGVL-DK--I-P------GSGS-GWNIVDVFGGSGLLAHVAKRIKPDARVIYNDFDNYAARVKAIPDINRLRRLISGYL-A--G--Y--VKKQRI-PDDVKQVIIGEIER-FD---GYKCHVVLASWFLFSGRQA------ANLERFYR-S--EWYFNLPLSDYPVADD---YLDGLEIIRQSYETLI----PQFSDDPQ-----ALLVLDPPYLSTTQAAY--AQD-G--RFGLVDYLKLVNLV-R-PP--YLFFSSTRSEFIDYIDAVVSMQ-LD-NWHVFD---H-----STRLTVQA------K-VSKYASYEDNLVYKL------------- PSYCG_RS09460_Psychrobacter_sp_G_521257429 QH-K-APLPFVGQKRFFLKHFRQVI-DE-HI-P------DNGE-GWTIIDVFGGSGLLSNNAKHLKPAATVIFNDFDNYCERLKHVDDSNRLRRQLMDVL-A--D--Y--PRQTLL-DRNIKSQVIDVIEK-FK---GHIDLRVLSTWLLFAGKHA------TSLDELYA-S--HLYNSLRRTDFTAVDD---YLTGLDIVCESYDTLI----PQYANQPK-----TLLVFDPPYVNTQQGAY--AQK-E--YFGMVQFLKLMQCV-R-PP--YIFFSSTRSELPAYLDFLREHD-SC-AWQRVG---N-----YETISLKA------Q-MNKNSSYEDHMIYRF-Q----------- l11_17040_Neisseria_weaveri_LMG_5135_343968128 HA-K-APLPFVGQKRNFIKHYLGVL-DK--I-P------GSGS-GWNIVDVFGGSGLLAHVAKRIKPDARVIYNDFDNYAARVKAIPDINRLRRLISGYL-A--G--Y--VKKQRI-PDDVKQVIIGEIER-FD---GYKCHVVLASWFLFSGRQA------ANLERFYR-S--EWYFNLPLSDYPVADD---YLDGLEIIRQSYETLI----PQFSDDPQ-----ALLVLDPPYLSTTQAAY--AQD-G--RFGLVDYLKLVNLV-R-PP--YLFFSSTRSEFIDYIDAVVSMQ-LD-NWHVFD---H-----STRLTVQA------K-VSKYASYEDNLVYKL------------- PMCN03_RS01910_Pasteurella_multocida_492125251 YK-Q-APLPFVGQKRLFLNHYINII-NE-HI-P------DDGE-GWTIIDAFGGSGLLSHVTKHIKPKARVIYNDFDGYSERLKHIRDLNKLRRILLELL-K--N--E--PRSKQL-SCDMKYKVIQAIEA-FT---GYKDPHVLSTWLLFSGQQV------RTLSELYR-L--SFYNRIRLSDYSEAQD---YFNGFEVANESFHSLL----PRFVDKQK-----TLFVLDPPYLCTHQAAY--SMD-T--YFDLIDFLRLINLT-R-PP--FIFFSSTKSEFIRFVDFMLETK-TH-NWESFT---D-----YKKISINT------S-TNYSGKYEDNLVYKF------------- C228_RS0112985_Actinobacillus_capsulatus_517482436 YK-Q-APLPFVGQKRQFLAQYAAIL-NQ-YI-P------NDGQ-GWTIIDAFGGSGLLSHTAKQLKPAARVIYNDFDGYATRLKHIDDINQLRGKIYTLL-D-GV--P--RQKRIT-DHSIKIKIIETIEA-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRLSDYQNADG---YLDGIEITNESFHTLL----PRFINDQR-----TVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMQTDK-VD-NWQAFD---G-----AQRIAIKT------S-LNYQGEYEDNLVYKF------------- GGE_RS05405_Haemophilus_haemolyticus_763375484 FK-Q-APLPFVGQKRMFLKHFETIL-NE-NI-K------DDGE-GWTIIDTFGGSGLLSHAAKRLKPKACVIYNDFDGYAERLAHIDDINALRSQLFTIV-G-NA--T--PKNKRM-PKELKAECVKIIQA-FD---GYKDLNCLASWLLFSGQQV------ATIDELFQ-N--DFWHCIRQSDYPKADG---YLDGVEIVRESFHTLL----PKFADNPK-----ALFVLDPPYLCTRQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMQEDK-ID-NWEAFD---N-----TKRIVVNA------S-ASYSGKYEDNMVYKF------------- MOMA_RS09420_Moraxella_macacae_750343301 YQ-K-APLPFVGQKRMFLKEFRKIL-EK--I-P------NDGE-SWTIVDVFGGSGLLANNAKACKPKATVIYNDFDGYTKRLAHIDDINRLRTILFGLT-K--D--V--PRQKRI-PDGLKGRILQVIAD-FD---GYIDTRSVSTWLLFSGKQI------AHIDELAD-N--QMYNTVRTNDYDRADG---YLNGILVTNESFEILI----PKYADRPN-----TMLLLDPPYICTEQKAY--AMT-G--YFGMTKFLRLMKLV-R-PP--YLLFSSTRSELLDYMDYLKDCE-PV-MWERIG---G-----FEKVSVQS------Y-VNYTSEYEDNMIFKF------------- HMPREF9065_RS02985_Aggregatibacter_sp_oral_taxon_458_545363364 FK-Q-APLPFVGQKRMFLNHFKAIL-NE-QI-P------GDGE-GWTIIDTFGGSGLLSHTAKQLKPRARVIYNDFDGYAERIKHIDDINRLRAQIAALL-A-GV--P--RQKRVT-DKALKAQIIDTIKA-FD---GYVDLASLTSWLLFSGQQV------GSFDELCK-K--DFWHCVRASDYPSADG---YLDGVEVVSESFHTLL----PRFTADPQ-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMLQDK-VD-NWQAFD---G-----AQRIAIKT------S-LNYQGEYEDNLVYKF------------- E9K_RS05470_Moraxella_catarrhalis_489757769 HH-K-APLPFVGQKRMFLKEFRKIL-DK--I-P------SDGE-NWTIIDVFGGSGLLANNAKAYKPNATVIYNDFDGYTKRLAHIDDINRLRAILFEMT-K--D--V--PRQKRI-SDELKGRILQAIDD-FD---GYVDARSVSTWLLFSGKQI------NHISELTD-H--SMYNTVRTSDYDNAQD---YLDGLVITHESFDTLI----PKFADKPN-----ALLLLDPPYVCTEQKAY--ALK-G--YFGMTKFLRLMKLV-R-PP--YLFFSSTRSELLDYMDYLKDCE-PV-MWELVG---D-----FEKVSVNS------H-VNYNAEYEDNMIFKF------------- RHAA1_RS00240_Aggregatibacter_actinomycetemcomitans_491746110 FK-Q-APLPFVGQKRKFLKHFNAIL-NR-HI-A------GDGQ-GWTIIDTFGGSGLLAHAAKQLKPRARVIYNDFDGYFERIKHIDDINRLRGQIAALL-S-GV--P--RQKRVT-DKALKADIIKTIEA-FD---GYVDLASLASWLLFSGQQV------GSFDELCG-K--DFWDCVRASDYPSAEG---YLDGVEVVCESFHTLL----PRFTADPQ-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMQEDK-VD-NWQAFA---G-----AQRIAIKT------S-LNYQGEYEDNLVYKF------------- E9U_RS07920_Moraxella_catarrhalis_489767871 HH-K-APLPFVGQKRMFLKEFRKIL-DK--I-P------NYGE-NWTIIDVFGGSGLLANNAKAYKPNATVIYNDFDGYTKRLAHIDDINRLRAILFEMT-K--D--V--PRQKRI-SDELKGRILQAIDD-FD---GYVDARSVSTWLLFSGKQI------NHISELTD-H--SMYNTVRTGDYDNAQD---YLDGLVITHESFDTLI----PKFADKPN-----ALLLLDPPYVCTEQKAY--ALK-G--YFGMTKFLRLMKLV-R-PP--YLFFSSTRSELLDYMDYLKDCE-PV-MWELVG---D-----FEKVSVNS------H-VNYNAEYEDNMIFKF------------- E9G_RS00410_Moraxella_catarrhalis_489754581 HH-K-APLPFVGQKRMFLKEFRKIL-DK--I-P------SDGE-NWTIIDVFGGSGLLANNAKAYKPNATVIYNDFDGYTKRLAHIDDINRLRGILFEMT-K--D--V--PRQKRI-SDELKGRILQAIDD-FD---GYVDARSVSTWLLFSGKQI------NHISELTD-H--SMYNTVRTSDYDNAQD---YLDGLVITHESFDTLI----PKFADKPN-----ALLLLDPPYVCTEQKAY--ALK-G--YFGMTKFLRLMKLV-R-PP--YLFFSSTRSELLDYMDYLKDCE-PV-MWELVG---D-----FEKVSVNS------H-VNYNAEYEDNMIFKF------------- HMPREF1053_RS00285_Haemophilus_haemolyticus_491876509 FK-Q-APLPFVGQKRMFLKHFETIL-NE-NI-E------DDGE-GWTIIDTFGGSGLLSHAAKAIKPKARVIYNDFDGYAERLAHIDDINKLRAELYSVV-G-NA--T--PKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--DFWHCIRQSDYQKADG---YLDGVEIVQESFHTLL----PKFSDDPK-----ALFVLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIKYMVDDK-VH-NWQNFD---N-----AQRIVVNA------S-ASYSGKYEDNMVYKF------------- HMPREF9016_RS00975_Neisseria_sp_oral_taxon_014_496464403 HS-T-APLPFVGQKRYFIKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQARVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITRQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D--------- P1062_RS06555_Pasteurella_multocida_514429666 FK-Q-APLPFVGQKRMFLKHFEQVL-DE-NI-Q------GDGE-GWTIIDVFGGSGLLSHTAKRLKPKARVIYNDFDRYTERLNHIAETNQLREILYQTV-N-GI--I--PKNKLI-NKRLKEEIINKINR-FN---GYKDVNCLSSWLLFSGQQV------GSLHELFK-R--RFYNCVRKTDYVLTEG---YLEGLEVVSESFHQLL----PKFQNKEK-----VLIVLDPPYLCTRQESY--RQA-T--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFLDYTQEDK-TD-NWQTFE---G-----YKRIIVNT------S-ASYSGKYEDNLIYKF------------- P1062_RS03850_Pasteurella_multocida_514429566 FK-Q-APLPFVGQKRMFLKHFEQVL-DE-NI-Q------GDGE-GWTIIDVFGGSGLLSHTAKRLKPKSRVIYNDFDGYSERLNHIAEINQLREILYQTV-N-GI--I--PKNKLI-NKRLKEEIINKINR-FN---GYKDVNCLSSWLLFSGQQV------GSLHELFK-R--RFYNCVRKTDYVLTEG---YLEGLEVVSESFHQLL----PKFQNKEK-----VLIVLAPPYLCTRQESY--RQA-T--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFLDYTQEDK-TD-NWQTFE---G-----YKRIIVNT------S-ASYSGKYEDNLIYKF------------- NMA510612_RS09285_Neisseria_meningitidis_488141552 HS-T-APLPFVGQKRYFIKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQARVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITQQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D--------- NM70082_RS106455_Neisseria_meningitidis_488182095 HS-T-APLPFVGQKRYFIKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQAQVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITQQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D--------- K941_RS0107980_Moraxella_caprae_656071953 FK-T-APLPFVGQKRQFIGRFEKLLLNN----I------PNDGEGWTVIDVFGGSGLLAHNAKRLLPKTTVIYNDFDDYTNRLKHIPTTNALRQALSDIL-K--H--E--PRSLKL-SSTVKQQVLDIVKD-FQSQGKFIDVQTIAGWLLFSGRQV------ADLDEFMA-E-STLYNRITKTDYELADG---YLDGLVITCESFEQLL----TKHQATPN-----CLLLLDPPYVCTTQSAY--NLHERGGYFGMTKFLTLMH-Y-V-KPP-YIFFSSTRSELLDYMSYVEQY-----EPHTWERIGG-----FERIVVKV------T-VNKGLGYEDNILAK-------------- IO48_RS11150_Gallibacterium_anatis_746098344 FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKRLKPNARVIYNDFDGYAERLKHIDDINRLRQQLSNLL-T--G--Y--PRQKRL-DIAMRHKVIDAIES-FD---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSELL----PKFANDKK-----AIFVLDPPYLCAHQASY--KQE-S--YFGLINFLELIRLT-R-PP--YLFFSSTKSEFVRFVDWLVETR-SD-NWQSFA---D-----YQRIIVRT------S-ASYIGKYEDNLIYKC------------- JL04_RS11025_Gallibacterium_anatis_746100920 FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSYAAKQLKPKARVIYNDFDGYAERLKHIDDINRLRQQLSDLL-T--G--C--PRQKRL-DIAMRHKIIDAIES-FD---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSELL----PKFANDKK-----AIFVLDPPYLCTHQASY--KQE-S--YFGLINFLELIRLT-R-PP--YLFFSSTKSEFVRFIDWLIATK-GD-NWQSFA---D-----YQRIVIQT------S-ASHNGQYEDNLIYNF------------- P375_RS00515_Gallibacterium_genomosp_2_746064999 FS-Q-APLPFVGQKRMFLNQFKSVL-NE-MI-T------NDGE-GWTIVDAFGGSGLLSHTAKRLKQKATVIYNDFDGYVEWLKHIDDINHLRQQLSDLL-A--D--Y--PRRKRL-DIAMRHKIIDVIES-FD---GYKDPHILCAWLLFSGQQV------KSVDELYR-H--GFYNCVRQSDYPTADG---YLDGIEVVHESFSTLL----PQFSDDKR-----VIFVLDPPYLCTHQASY--KQD-G--YFDLINFLGLIRLT-R-PP--YIFFSSTKSEFVRFVDWLIATK-GD-NWQSFI---D-----YKRIIVQI------S-TSYSGKYEDNLIYRA------------- GCWU000324_01234_Kingella_oralis_ATCC_51147_237868315 YR-K-APLPFVGQKRNFLKHLIPVL-QQ-NI-P------NDGA-GWTIVDVFGGSGLLAHTTKRTLPKARVIYNDFDGYAERIKNIPDTNRLRDRLAEVL-D--K--Q--PRDKAL-NADAKAAVVVIIRG-FG---GYKDLNCLRSWLLYSGKEA------AT-PDCLYGE--TLYNRLRLSPYPEAAD---YLDGLDITSQSFETLL----PRYAGQDN-----TLLILDPPYVCTQQGMY--ANQ-T--YFGMVPFLTLAQMV-R-PP--FVFFSSTRSEFLDYLGFLRQYK-PQ-EWANWA---G-----FGQVTIRG------T-LSKGSCYEDNMVYRF--ER--------- JP30_RS08890_Gallibacterium_anatis_746079094 FS-Q-APLPFVGQKRMFLKQFKAVL-NQ-MI-D------NDGE-GWTIVDAFGGSGLLSHTAKQLKPKAKVIYNDFDGYAERLKHIDDINHLRQQLSDLL-A--D--Y--PRRKRL-DIAMRHKIIDVIES-FD---GYKDPHILCAWLLFSGQQV------KSVDELYR-H--GFYNCVRQSDYPTADG---YLDGIEVVHESFSTLL----PQFSDDKR-----VIFVLDPPYLCTHQASY--KQD-G--YFDLINFLELIRLA-R-PP--YIFFSSTKSEFVRFVDRLITTK-GD-NWQSFV---D-----YHRIVVQT------S-TSYSGKYEDNLIYKS------------- JP36_RS09335_Gallibacterium_genomosp_1_746108169 FT-Q-APLPFVGQKRMFLNHFKTVL-NE-MI-T------NDGE-GWTIVDAFGGSGLLSHTAEQLKPQARVIYNDFDGYAERLKHIDDINRLRQILSKLL-A--N--Y--PRQKRL-DIAMRHKVIDAIES-FD---GYKDPHILCTWLLFSGQQV------KSIDELYR-H--GFYNCVRQSDYPEADG---YLDGIEVVNESFSDLL----PKFFDDSK-----AIFVLDPPYLCTHQDSY--KQE-S--YFDLINFLELIRLT-R-PP--YLFFSSTKSEFVRFVDWLIAAK-GD-NWQSFE---D-----YHRIVVQT------S-TSYSGKYEDNLIYKC------------- HMPREF0669_RS04845_Prevotella_sp_oral_taxon_299_496519123 HM-K-APLPFVGQKRNFIKALTPII-ER----Q------PDNT---IFVDLFGGSGLLSNLVKELKPNARVIYNDFDNYSERLAHIKETEELRHMIGEKL-K--D--V--PKCSKV-SEELKAEICDLIED-FKAKKGFVDIVTVASWLLFSNRTA------GDIDDIRA-KRNTFYNSVIKAPLK-ADG---YLEGAERVCKDFQKLI----DEFKNVPN-----VLFICDPPYMLTEKAHY--KKT----YWGLGKYLNLLKDM-T-GLN-SIYFTSSKSGLLDFYHWWEKN-----MPQAIKK--P-----YKIISNNV------GYFHESREYEDIMMYN-------------- UMN179_RS08310_Gallibacterium_anatis_762905187 FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKRLKPNARVIYNDFDGYAERLKHIDDINRLRQQLSNLL-T--G--Y--PRQKRL-DIAMRHKVIDAIES-FD---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSELL----PKFANDKK-----AIFVLDPPYLCTHQARY--KQE-S--YFDLINFLELIKLT-R-PP--YLFFSSTKSEFVRFVDWLIATK-GD-NWQSFA---D-----YQRIIVQT------S-TSYSGKYEDNLIYKC------------- JP35_RS03815_Gallibacterium_anatis_746003746 FA-Q-APLPFVGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKHLKPNARMIYNDFDGYAERLKHIDDINRLRQILSELL-A--N--C--PRDKRL-DIAMRHKVIDAIEA-FE---GYKDPHILCAWLLFSGQQV------SSINELYR-R--GFYNCVRQNDYPTADG---YLDGIEVVNESFVTLL----PKFADDSK-----AIFVLDPPYLCTKQASY--RQD-S--YFDLIAFLDLIKLT-R-PP--YLFFSSTKSEFIRFVDWLIASK-GD-NWQSFV---D-----YQRIIVQT------S-TSYSGKYEDNLIYKC------------- ASUC_RS07260_Actinobacillus_succinogenes_501020456 FN-Q-APLPFVGQKRMFLQHFRKIL-NQ-HI-A------NNGD-GWTIVDVFGGSGLLSHTARNTKSAATVIYNDFDGYSERLQHIGDINRLRADLYALV-D-NA--A--PKNKRL-SKALKSDVIHVIQN-FD---GYKDLNCLSSWLLFSGQQV------GNFTELFN-R--DFWHCIRKSDYPEADG---YLEGITVTNESFDSLI----PRFAGKDK-----VLLILDPPYLCTRQDSY--KQA-T--YFDLIDFLKLINLT-K-PP--YIFFSSTKSEFIRFIDYMIDSK-AD-NWRSFD---N-----CHRIAVNA------S-ASYSGQYEDNLVFKF------------- HMPREF1054_1309_Haemophilus_paraphrohaemolyticus_HK411_385696246 FK-Q-APLPFVGQKRMFLKHVQAVL-DK-HI-D------GEGE-GWTIVDVFGGSGLLSHTAKHIKPKATVIYNDFDGYAKRLKYIDDINRLRQIIFNHL-H-GI--V--PKNGRL-SKEIKAEIINKIND-FQ---GYKDLNCLASWLLFSGQQV------SSFEALFA-K--DFWHCVRQSDYPSAEG---YLDGIEIVSESFHKLI----PRYKDQEK-----VLLLLDPPYLCTRQESY--KQS-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRYLDYMQESK-TD-NWQAFE---N-----YERIVVKA------S-TSKDGIYEDNMIYKF------------- NEIPOLOT_RS06080_Neisseria_polysaccharea_489846667 YR-K-APLPFTGQKRNFLKLFKQVL-ND-HI-P------GDGE-DWTILDAFGGFGLLSHTAKQCKPAARVIYNDYDGYSERLQHIPDINHLRRLLAGLL-T--P--V--PRSKPV-PPAIKAAIVAAIRS-FG---GYIDLDCLVSWLLFSGNTA------ADLDELGR-K--TMYNCISLSDYPEVQD---YLQGVEIVDQSYRELL----PQHIGNPR-----TLLVLDPPYVCTQQGNY--RKA-A--YFGMVEFLRLMAMV-R-PP--FIFFSSTRSELPAYLDLVAELR-LP-GWERFT---G-----SQTLTVSS------T-INRNSSYDDHLIYKF------------- Q321_RS0105900_Conchiformibius_steedae_652666097 YT-Q-APLPFTGQKRRFLNHFKSLL-NQ-HL-D------GDGA-GWTIIDAFGGSGLLAHTAKRCKPAARVIYNDFDGYAERLAHIPDTNRLRQILADLL-Q--H--Q--PRQKRL-PDAVKKAAEAAING-FG---GYLDLDCLAAWLLFSGKTA------RDLPDLYS-H--QFYHCVRQHDYPDATD---YLDGLEIVSQPYHELL----PPHLDDPK-----TLLVLDPPYVCTQQGSY--RKA-G--YFGMVQFLRLMRLV-R-PP--FVFFSSTRSELLEYLDLVIGDK-ME-GWDRFK---D-----YQKISLTT------H-INQQAVYEDNLVYRF------------- HMPREF9021_RS11285_Simonsiella_muelleri_488719230 YT-Q-APLPFTGQKRRFLNHFKTLL-KQ-HI-P------NDGD-GWTIVDAFGGSGLLAHTAKQVLPKARVIYNDFDGYTERLKNINDTNQLRKIIFDLT-R--D--Y--PLKQKL-PETLKKTIQATLQT-FG---GYIDLDCVASWILFSSRQT------TDLNDLIYNH--TYFNNVRLSDYPSADG---YLDSVEVVSKSYHELL----PEFQDNSK-----ALLVLDPPYVSTAQGAY--RKA-G--YFGMVEFLRLMRLV-R-PP--FVFFSSTRSELLDYLDLIVNEK-AE-GWQNLQ---D-----YQKISVHI------T-MNKTAHYEDNMIYKF-----QAA----- Q338_RS03200_Alysiella_crassa_736171011 FY-Q-APLPFTGQKRRFLTAFKQVL-NQ-CI-A------DDGV-GWTIIDVFGGSGLLAHTAKRGKPAARVIYNDFDDYATRLQNINDTNRLRQIIFDLT-Q--H--I--PRNQKL-PENTQKQVQAALKS-FD---GYVDLHSVASWLLFSGRQT------TDLDDLLHNH--TYYNGVRISDYPQADG---YLDGLEITSQSYADLL----PQWIGRDK-----VLFLFDPPYVCTAQGAY--RNE-T--YFGMVEFLKLMRLV-R-PP--FVFFSSTRSEFLDYLNLVIGYK-LD-GWEHFA---G-----YRKINVQV------H-LNKNAQYEDNLVYKL-NSAFEIA----- Q338_RS01810_Alysiella_crassa_736169879 YH-Q-APLPFTGQKRNFLKAFKQVL-NE-QI-S------GDGD-GWTIIDVFGGSGLLAHTAKRCKPAARVIYNDFDGYATRLQHIDDTNRLRQVIFNLL-Q--D--C--PRKTKL-TPALKAVVQATLQT-FG---GYVDVASVASWILFSGQQA------RTLDDLMQ-H--NFYNRVRLSGYPSADG---YLDGLEIVSQSYVDLL----PQFINQDK-----VLLLLDPPYVCTAQGAY--YND-V--YFGMVEFLKLMSMV-R-PP--FVFFSSTRSEFLDYLDLVIGYK-LD-GWERFA---G-----YHKVSLMA------G-LNYSARYEDNMVYKF------------- HMPREF9021_RS03005_Simonsiella_muelleri_488717644 YS-Q-APLPFTGQKRNFLKFFQQVL-KE-NI-S------NQGQ-GWTIIDAFGGSGLLAHTAKQTLPEARVIFNDFDGYTERLAHIDDTNRLRELIFNRL-N--D--LNVPKNQGL-IPQEKAEIEVIIHD-FG---GYKDVISLGSWLLFSGRQV------NQLSDLFN-Q--NWYRKIRETPYPSAVG---YLDDVEIVRRNAHELL----PDFVDSPR-----VLLVLDPPYVCTEQGSY--RQD-D--YFGMVQFLRLMSVV-R-PP--FVFFSSTRSEFLAYLDFVIETK-QA-GWERFV---D-----YRKISIHT------S-LNKQSRYEDNLVFKF--E---------- NELON_RS04600_Neisseria_elongata_489871056 YR-K-APLPFTGQKRNFLKLFKQVL-NE-HI-P------GDGE-DWTILDAFGGSGLLSHTAKQCKPAARVIYNDYDGYSERLQHIPDINRLRRLLAGIL-E--P--V--PRSKPV-PPAIKAAIVAAIRS-FG---GYVDLDCLVSWLLFSGNTA------ADLDELCR-K--TMYNCISLSDYPEAQD---YLQGVEIVGQSYRELL----PQHIGNPR-----TLLVLDPPYVCTQQGNY--RKA-A--YFGIVEFLRLMAMV-R-PP--FVFFSSTRSELPAYLDLVPELR-LP-GWERFA---N-----SQTLTVSS------T-INRNSSYDDHLIYKF------------- KKB_RS07455_Kingella_kingae_489886467 FY-Q-APLPFTGQKRRFLTAFKQVL-NQ-CI-A------DDGA-DWTIVDVFGGSGLLAHTAKRCKPAARVIYNDFDGYSERLQNINDTNKLRTIIADLL-A--H--Y--PRNQKL-PDTLKKTVQATLNS-FG---GYIDLDCVASWLLFSGRQT------TDLHDLLHNH--TYYNGVRLSNYPSADG---YLDNVEVVSKSYHELL----PEFQDNPK-----ALLVLDPPYICTAQGSY--RKA-G--YFGMVQFLRLMRLV-Q-PP--FIFFSSTRSELIDYLDFIVNEK-AE-GWQNLQ---D-----YQKISVHV------T-MNKTAQYEDNMVYKF-----QAA----- EIKCOROL_RS01150_Eikenella_corrodens_489918676 YR-K-APLPFTGQKRNFLKLFKQVL-NE-HI-P------GDGE-DWTILDAFGGSGLLSHAAKQCKPAARVIYNDYDGYSERLQHIPDINRLRRLLAGIL-E--P--V--PRSKLV-PPAIKAAIVAAIRS-FG---GYVDLDCLVSWLLFSGNTA------ANLDELCR-K--TMYNCISLSDYPEAQD---YLQGVEIVSQSYRELL----PQHISNPC-----TLLVLDPPYVCTQQGNY--RKA-A--YFGMVEFLRLMAMV-R-PP--FIFFSSTRSELPAYLDLVAELR-LP-GWERFA---G-----SQTLTVSS------T-INRNSGYDDHLIYKF------------- HD_RS00615_Haemophilus_ducreyi_499246665 FK-Q-APLPFTGQKRMFLNHFKAVL-NE-HI-V------GDGE-GWTIVDVFGGSGLLSHTAKQLKPAARVIYNDFDGYAERLKHIDDINRLRGQIHALL-R-DV--P--SQKRIT-DKALKTKIIATINA-FD---GYKDLASLSSWLLFSGQQV------ATFDDLFK-K--DFWCCIRQSDYPRAEG---YLDGIEVTSESFHTLL----PQFIADKK-----TLFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMQTDK-VH-NWQAFD---G-----AQRIAIKT------N-LNYHGEYEDNLVYKF------------- HMPREF9021_RS06670_Simonsiella_muelleri_750347144 HT-Q-APLPFTGQKRRFLNHFKTLL-KQ-QI-P------NDGD-GWTIVDAFGGSGLLAHTAKQVLPKARVIYNDFDGYTERLQNINDTNQLRKIIFDLT-R--D--Y--PRNQKL-PETLKKTIQTTLQT-FG---GYIDLDCVASWILFSSRQP------TDLHDLIYNH--TYFNNVRLSDYPSADD---YLDGVEVVSKSYHELL----PEFQDNSN-----ALLVLDPPYVSTAQGAY--RKA-G--YFGMVEFLRLMRLV-R-PP--FVFFSSTRSELLDYLDLIVNEK-AE-GWQNLQ---D-----YQKISVHI------T-MNKTAHYEDNMIYKF-----QAA----- L278_RS122210_Mannheimia_haemolytica_544865770 FK-Q-APLPFSGQKRMFLSHFKRVL-ND-NI-K------DDGK-GWTIIDVFGGSGLLSHTAKYEKSLAKVIYNDFDNYTERLEHIKDTNQLRQEIYRIV-D-RI--I--PKNKRI-SNEVKAKIINKIND-FE---GFKDLKCLSSWLLFSGEQV------ATLEELFK-H--DFWNCVRQSDYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQA-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVEGK-KD-NWKSFE---G-----AKRIVVNA------S-ASYNGQYEDNMVYKF------------- D650_21760_Mannheimia_haemolytica_USDA-ARS-USMARC-183_472258915 FK-Q-APLPFSGQKRMFLSHFKQVL-NA-NI-E------ADGK-DWTIIDVFGGSGLLAHTAKREKPLARVIYNDFDNYAERLNHIKETNQLRQEIYQIV-D-EI--I--PKNKRI-SNEIKAKIINKIND-FE---GFKDLKCLSSWLLFSGEQV------ATLDELFK-H--DFWHCVRQSDYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQE-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVQNR-KE-NWQAFE---G-----AERIVVNA------S-ASYNGKYEDNMVYKF------------- BN741_01478_Prevotella_stercorea_CAG:629_548211070 FS-S-SPLPFRGSKRYYVRRFREVL-AQ----T------QDID---TVVDLFGGSGLLSRVAKDTLPNCRVIYNDFDHYDTRLANAANTNALLRSIAPLL-V--N--V--PDNKKV-PTETKIKILELCAE-EEKR-HAVDYITLSGSLLFSGNWA------QSYEELSK-Q--TMYNRMVKTDY-NVAN---YLSGLEVTHCDYRELF----NAHKANKK-----ALFLLDPPYLQTEHSAYKADT-----YWQLKDYLDVLTLL-D-DTK-YVLFTSGKSQIIELCDWINQS--FG--GKLLK---D-----AQKYVQNS------R-INDFAAYKDIMIAK-------------- HMPREF9148_RS11290_Prevotella_sp_F0091_545434898 YF-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVIYNDFDGYQKRLTMLSETNALLSELRKIV----D--A--PRYKAI-LGVQREKVLECVRK-YERIYGCVDYITLSSSILFSMKYV------TTYADLEK-E--TLYNNIKSTDYPPCDD---YLDGLTITSCDYKEVF----EKYKDVPN-----VVFLVDPPYLSTDSTTY--KM-----YWKLSDYLDVLTIL-A-GHR-FIYFTSNKSSIVELCEWIGKN--KL-IGNPFE---S-----CQRKEFNA------R-MNYNSSYTDIMLYTD-------V----- P150_RS0104410_Prevotella_sp_HUN102_655515586 YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDCT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLAELRAIM----D--V--PRHKAI-MGEQRKQVLSCIRR-HEREYGYVDYITLSSSILFSMKYA------TEYADLEK-E--TLYNKIKGVDYPPCDD---YLDGLTITSCDYKEVF----ERYKDVAN-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIIELCDWMGRH--PN-LGDPFR---N-----CHRREFNA------H-MNYSSSYTDIMLYTD-------A----- WEEVI_RS00470_Weeksella_virosa_503362551 YT-Q-SPLPFQGQKRKFINHVKQVL-AN----S------SEDA---TYVDLFGGSGILSHTVKQLKPNAKVVYNDYDDFSKRLAAVAQTNVLLDKIRAIT-K--E--L--PKDKLI-PEVHKLKLLELIKD-EECRLGYVDYITLSSSLLFSAKYV------TNYNDLTK-Q--TFYNNVRQSNY-VTDD---YLAGVEIVHQDYKELF----DQYKDLDN-----VVFLVDPPYLSTDTSTY--TSD---KYWKLKDYLNVLDVL-V-GTN-YLYFTSNKSQIVELCQWMEDRTSMA-DVNPFS---G-----STTVCINT------T-LNHSAKYTDMMLYKL-------K----- BZARG_RS04055_Bizionia_argentinensis_495910797 FN-T-SPLPFQGQKRRFVKQFKEAL-NV----F------SDSA---TYVDLFGGSGLLAHTVKQKYPNAKVIWNDYDNFQNRLESISETNLLLTELRSFL-I--D--L--PRKQRM-EAIDRERVLRVVKA-HETKYGYVDYVTLSGSLLFSAKYA------TNYKEFAN-E--SFYNRIKLSDY-NATG---YLSGVERVQNDYKALF----DSYKS-DT-----TVFLVDPPYLSTDTSSY--NKD---NYWKLRDYLDVLSVL-D-GSK-YFYFTSNKSQIVELCEWIETR--TM-TGNPFQ---G-----STMTTTAG------T-INHTASYTDIMLFK-------------- P150_RS0110495_Prevotella_sp_HUN102_655516580 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------LDGT---TFVDLFGGSGLLSHIAKYQKPNSTVVYNDFDGYRKRLEVLPQTNALLAELRTIV----D--V--PRHKAI-MGEQREQVLSCIRR-HEREHGYVDYITLSSSILFSMKYA------IEYADLEK-E--TLYNNIKGVDYPPCND---YLDGLIITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-S-SHH-FIYFTSNKSSIIELCDWMGRH--PN-LGDPLR---N-----CHRREFNA------H-MNYSSSYTDIMFYTD-------A----- HMPREF1651_RS08825_Prevotella_bivia_739005860 YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLDALPQTNALLAELRAIV----D--V--PRHKPI-MGEQREQVLSCIRR-HERVHGCVDYITLSSSILFSMKYA------TEYAELEK-E--TLYNNIKGVDYPPCED---YLDGLTITSCDYKEVF----ERYKNVPG-----VVFLVDPPYLNTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCDWMGRH--PN-FGDPFK---N-----CHRREFNA------H-MNYNSSYTDIMLYTD-------V----- P150_RS0109795_Prevotella_sp_HUN102_655516468 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLAELRAIV----D--V--PRHKAI-MGEQREQVLSCIRR-HERERGYVDYITLSSSILFSMKYA------TEYADLEK-E--TLYNNIKGVDYPPCDD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIIELCDWMGRH--PN-LGDPFR---N-----CHRREFNA------H-MNYSSSYTDIMLYTD-------A----- ORNRH_RS05640_Ornithobacterium_rhinotracheale_504603820 FK-S-APLPFQGQKRNFVKHFKEAL-KG----F------PSNA---IYVDLFGGSGLLSHTVKCVHPEAKVVYNDFDNFQKRLKAIPETNKILEELRALN-L--K--T--PRGKII-QGEEKEKVLEVLKR-ADKR-GFVDWITLSGSLKFSMNYG------LKLEDFTN-D--TLYNTIRKTNFDEASD---YLAGIEVVSEDYRHLF----QKYKDLDN-----VVFLADPPYLSTDTATY--AND---KYWKLTDYLEVLETL-Q-GSN-FFYFTSNKSQVVELCQWLGTRT-NE-SLNPFK---D-----ATCTAMTN------C-PTHKTSYQDIMYHYKK------------ D468_RS0112575_Prevotella_oris_648594256 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKYQKPNSTVIYNDFDGYRKRLEHIPQTNALLSRLRSIL-C--D--Y--PRKKAI-AGAMRRSVLSCIHE-YEHTFGYVDYITLSGSLLFSMKYA------TCYEELSK-E--TLYNRIKATDYPLADT---YLDGLTVTSCDYRQLF----EQYKNIPD-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDILTIL-A-GHR-FIYFTSNKSSITELCEWIGKN--KL-IGNPFE---N-----CHRREFKA------H-MNYNASYTDIMLYKN-------T----- K334_RS0105170_Prevotella_baroniae_647603997 YL-S-APLPFQGQKRMFAKEYIKAL-RQ----F------PDDT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRRRLEALSQTNALLAELRAIV----D--V--PRHKAI-VGAQRERVLSCIRK-HEQDYGYVDYITLSSSILFSMKYA------TEYAGLEK-E--TLYNNIKGVDYPPCED---YLDGLTITSCDYKKVF----ERYKNVPG-----VVFLVDPPYLSTDIKTY--RM-----CWKLSDYLDVLTIL-A-GHR-FIYFTSNKSSIIELCEWIGKN--KL-IGNPFE---N-----CRCQEFNA------H-MNHNASYTDIMLYTD-------V----- HMPREF0654_RS11780_Prevotella_disiens_739003412 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKHQKPNSTVVYNDFDGYRHRLERIAQTNELLEELRAIV----D--V--PRSKPI-LGEVRKRVLDCIRK-HEQKYGCVDYITLSTSLLFSMKYA------TCFAEMEK-E--ILYNRIKSTNYLLCTD---YLDGLTITSYDYKEVF----EKYKDVPN-----VVFLVDPPYLSTDIKTY--RM-----NWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCDWMGRH--PN-LGDPFK---N-----CHRREFNA------H-INYNSSYTDIMLYTD-------V----- PREBIDRAFT_RS00610_Prevotella_bivia_490468432 YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLDALPQTNALLAELRAIV----D--V--PRHKPI-MGEQREQVLSCIRR-HERVHGCVDYITLSSSILFSMKYA------TEYAELEK-E--TLYNNIKGVDYPPCED---YLDGLTITSCDYKEVF----ERYKNVPG-----VVFLVDPPYLNTDSKTY--RM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCDWMGRH--PN-FGDPFK---N-----CHRREFNA------H-MNYNSSYTDIMLYTD-------V----- HMPREF0665_RS09490_Prevotella_oris_490512514 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKYQKPNSTVIYNDFDGYRKRLEHIPQTNALLSRLRSIL-C--D--Y--PRKKAI-AGAMRRSVLSCIHE-YEHTFGYVDYITLSGSLLFSMKYA------TCYEELSK-E--TLYNRIKATDYPLADT---YLDGLTVTSCDYRQLF----EQYKNIPD-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDILTIL-A-DHR-FIYFTSNKSSITELCEWIGKN--KL-IGNPFE---N-----CHRREFKA------H-MNYNASYTDIMLYKN-------T----- B739_RS09680_Riemerella_anatipestifer_504751701 WK-S-APLPFQGQKRGFIRHFSQAV-KE----Y------PNNA---IYIDLFGGSGLLSHTVKCVHPNAKVIYNDYDNFRKRLEAIPKTNQILDELRALN-L--Q--T--PRGKKI-EGAEREAVFKILKK-ADER-GFVDWISLSSSLKFSMNYG------TKLKDFTE-D--TLYNSVRRSNYDPADD---YLEGIEVVSEDYKKLF----DIYRGKNN-----VVFLVDPPYLSTDTSTY--NKE---SYWKLSDYLEVLETL-Q-GSN-YFYFTSNKSQIVELCQWLETRT-SN-NSNPFK---G-----ATRTAVNN------K-TTHNTGYTDLMYHLKKN----------- M949_RS00775_Riemerella_anatipestifer_740907932 WK-S-APLPFQGQKRGFIRHFSQAV-KE----Y------PNNA---IYIDLFGGSGLLSHTVKCVHPNAKVIYNDYDNFRKRLEAIPKTNQILDELRALN-L--Q--T--PRGKKI-EGAEREAVFKILKK-ADER-GFVDWISLSSSLKFSMNYG------TKLKDFTE-D--TLYNSVRRSNYDPADD---YLEGIEVVSEDYKKLF----DIYRGKNN-----VVFLVDPPYLSTDTSTY--NKE---SYWKLSDYLEVLETL-Q-GSN-YFYFTSNKSQIVELCQWLETRT-SN-NSNPFK---G-----ATRTAVNN------K-TTHNTGYTDLMYHLKKIKIELNALL--- CCYN49044_RS09840_Capnocytophaga_cynodegmi_517090945 YK-Q-APLPFQGQKRRFLKEFEKAL-QE----Y------PSKG---FYVDLFGGSGLLSHTVKRLYPNATVIYNDFDDYHKRLEAIPQTNAILSELRNLN-L--T--T--PREKRI-AGLEREAVLAVLKR-ADES-GFVDWITISSSLKFSMNYG------FSYEDFEG-D--TLYNCVHTSNYELATD---YLQGIEIVKLDYKTLF----KQYKDVPN-----VVFLIDPPYLSTDTATY--NSK---DYWRLRDYLDVLDCL-H-GQS-FFYFTSNKSQIVELCEWLETRT-SD-NANPFK---G-----ANISTTAN------A-PSHNTKFTDIMYHIKR------------ LS70_RS01430_Helicobacter_sp_MIT_11-5569_736161659 FK-A-PPLPFMGNKKNALKLVESLI-KEIRAKY------NEQD-L-IFLDCFGGSGFLSHTFKYHLPNARVIYNDYDDYLDRVKNAKTTEEILGRISALV-T--S-----PKNAKI-TEEKKQKIISILEE-YEQRGQKIDYVSISSFVLFQGNYA------KDLTKLKK-A--QFYYKFGSIKK-ETRG---YLTGVEAVKMDFKAMI----EKYKAEAKISGKIAFLILDPPYLQTNTDVY--NT--E--FYRLPQFLELIDRI-E-KP--FMLFSSLKSDIVDFLAWYDRL-------NP-----K-----LKGRKIRS--------YNLCDVYSSVIPKTDFCFYE-------- HMPREF9420_RS08510_Prevotella_salivae_494223274 YL-S-APLPFQGQKRMFAKEYIKIL-QQ----F------PDNA---TFVDLFGGSGLLSHIAKHQKPNSTIVYNDFDGYRKRLEALPQTNALLAELRAIV----N--V--PRHKPI-LGGTRERVLSCIRR-HECTYGYVDYITLSSSLMFSMKYA------TEFSDFEK-E--TLYNNIKAVDYPSCSD---YLDGLVITSCDYKELF----EKYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTIL-T-GHR-FIYFTSNKSSIIELCKWIGKN--KI-IGNPFE---N-----CHHKEFNA------H-MNYNSSYTDIMLYTD-------A----- HMPREF9304_RS12585_Prevotella_timonensis_739058226 HL-S-APLPFQGQKRMFAREYIKVL-QQ----Y------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLSELRTIV----D--V--PRHKAI-IGTQREQVLSCIRK-HERTHGYVDYITLSSSILFSMKYA------TEYSDLEK-D--TLYNNIKGVDYPPCDD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----YWKLSDYLDVLTVL-A-GHR-FIYFTSNKSSVIELCDWMGNH--PN-LGNPFR---N-----CHRKEFNA------H-MNYSSSYTDIMLYTD-------V----- X919_RS0112015_Prevotella_sp_HJM029_655519412 YL-S-APLPFQGQKRMFAREFKNVL-KQ----F------PDTA---TFVDLFGGSGLLSHIAKHEKPNATVVYNDFDGYRDRLAHIPQTNKLLAQLRLIL-N--D--Y--SRGKAI-IGEHRQRVLQCIEE-HQVRYGYVDYVTLSSSIMFAMHYK------QSLNEMRR-E--TLYNRIRQTDYPLCND---YLEGLTIVSADYKQIF----HQYKDVPG-----VVFLVDPPYLSTDCKTY--KM-----SWNLADYLDVLHVL-H-GHR-FIYFTFNKSSILELCDWMGKN--RN-LGNPFE---G-----CTKATFNA------H-ANFNATYTDMMLCKN-------D----- JCM14966_RS06695_Prevotella_oulorum_640643393 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRKRLEALPQTNALLAELRAIV----D--V--PRHKAI-MGEQREQVLSCIRW-HESEHGYVDYITLSSSILFSMKYA------MGYADLEK-E--TLYNNIKGVDYPPCDD---YLEGLTITSSDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--RM-----FWKLSDYLDVLTIL-A-GHR-FIYFTSNKSSIIELCDWMGRH--PN-LGDPFR---N-----CHRREFNA------H-MNYNSSYTDIMLYTD-------V----- CAPSP0001_RS11720_Capnocytophaga_sputigena_488758369 HT-T-SPLPFQGQKRKFVKHFKEAL-KH----F------PANA---TYIDLFGGSGLLSHTVKTTHPNARVIWNDYDDFAHRLALIPTTNEIIAQLRPIV-A--N--H--PKGTRI--NEVKPTILEVLRQ-YPPE--ALDYITLSANLLFSGKYA------TSLEALAK-D--GFYAKVSQTPY-NADG---YLAGVERRQTDYRKLI----AEFEYTPN-----TVFILDPPYLSTDISSY--RGA---QDWKLKDYLHIVKAL-N-TMPRYIYFGSNKGQLLDLFDFLANE--YN-LPSPFN---N-----TTRVTVST------S-VNYSSSYEDLMIYK-------------- TMA01S_RS05515_Tenacibaculum_maritimum_639782857 YG-S-SPLPFQGQKRNFIKQFKEAL-KT----Y------PEDA---VYVDLFGGSGLLSHTVKQEKPKAKVIYNDYDSFKNRIAAIPKTNNILRKLRELL-S--D--Y--PKSKKI-NGDKRKAVLELLKL-ENNK-GYVDFITISSSILFSMNYV------QTYEELEK-Q--TFYNRIRKSDF-NAEG---YLNNVEFVYGEYKEVF----KQYKNVPN-----VVFLVDPPYLSTDCTTY--K-----NYWKLTDYLDVLKVL-Y-SNN-YFYFTSNKSSVIELCKWIENN--TG-GVNPFN---K-----AKVVYQYN------K-TTHNTGYTDIMLHKC-------Y-TTDT HMPREF9141_RS12020_Prevotella_multiformis_494610799 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDNT---TFVDLFGGSGLLSHIAKCQKPDSTVVYNDFDGYRLRLEHIPQTNELLAELREIV-Q--G--T--PRYKPI-TGDAREKVFECLQK-YQDRYGYLDFITISSSIMFSMKYR------LNIEEMRK-E--ALYNTIRKTDYPLCSD---YLDGLNIVSADYKQVF----NQYKDKPG-----VVFLVDPPYLSTEVGTY--KM-----YWRLADYLDVPTVL-A-GHS-FVYFTSNKSSILELCDWIGRN--KT-IGNPFE---K-----CTKVEFNA------H-MNYNATYTDMMLYKK-------A----- HMPREF9420_2325_Prevotella_salivae_DSM_15606_315663782 YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDNT---TFVDLFGGSGLLSHIAKCQKPDSTVVYNDFDGYRLRLEHIPQTNELLAELRKIV----D--V--PRHKPI-LGEARERVLSCILR-HERTHGYVDYITLSSSVMFSMKYA------TEFSDFEK-E--TLYNNIKAADYPSCSD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCEWIGKN--KL-IGNPFE---N-----CHRREFNA------H-MNYNASYTDIMLYTD-------V----- JCM15124_RS09550_Prevotella_falsenii_640568267 YN-S-APLPFQGQKRRFAREFAKVL-RH----F------PDDA---VFVDLFGGSGLLSHITKCQKPNATVVYNDFDGYRQRLAHVAETNELLAQLRVIL-K--D--V--PHHKLV-PAGTKELAIKCIEK-HEARYGYVDYITLSSSLMFSAEYA------TSLNGIAK-E--SMYNRVRKVDYSTGED---YLGGLTVVSEDYKELF----EQYKNVPN-----VVFLADPPYLNTEADSY--KM-----TWRLNDYLDVLLVL-L-KNS-FIFFTSNKSSVVELCEWLARN--GG-MPNPFE---R-----CDKVELDA------L-LNHHASYTDIMYYTT-------L----- H526_RS0116665_Aquimarina_latercula_737095939 YM-A-APLPFQGQKRNLAKQFKAAL-NK----R------TPPK---VYVDLFGGSGLLSRIAKDVHPQATVVYNDYDNYRQRIQHIPATNRVLNDIRKMV-V--D--L--PAKTRI-PQIIKNKIIERIEQ-ET---GYLDYITLSTYLLFSMNFV------NSLEELKK-Q--TFYNRVRHTEIPEAKD---YLKGLEVVFYDYRELF----ERYNGSDQ-----VVFIVDPPYLSTDCGSY--KN-----YWKLKEHLDVLKVL-K-GGR-YIYFTSNKSNVVELCEWIETN--TG-GVNPFY---N-----AETISRTN------I-VNYQSSYSDIMLID-------------- ATCC51562_RS05210_Campylobacter_concisus_544657538 FN-A-APLPFQGQKRNFIKQFRELIKDE----F------RAYRNG-IFIDAFGGSGLLSHNIKQIYPNARVIYNDYDNYSERLANIETTNEILQTIEPIT-K--K--Y--KKNEKV-SEEDREKIIKIIDE-YIKRGYFIDWLTLSSNLLFSAKYA------HNKDEFKK-E-KTFFATSPKMPLYQKNS---YLKGVEIAHKEAMELI----KEFENK-D-----VVLVLDPPYLQTNKAGY--K-----CFWGLRDFLKLIR-L-V-REP-FIFFSSENSDILPYIDDLVEY-----GDEAFK---G-----YSLKQARL------N-NNNEQAKIDYMIYK-------------- PIN17_RS06195_Prevotella_intermedia_763168088 YL-S-APLPFQGQKRMFAKEYIKVL-QQ----F------PDGT---TFVDLFGGSGLLSHIAKCQKPNSTVVYNDFDGYRHRLERIAQTNELLEELRAIV----D--V--PRSKPI-LGEIRKRVLDCIRK-HEQKYGYVDYITLSASLLFSMKYA------TCFADLEK-E--TLYSRVKSTNYPLCTD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIMELCEWIGKN--KI-IGNPFE---N-----CHRREFNA------H-MNYNASYTDIMLYTD-------V----- HMPREF9420_RS10145_Prevotella_salivae_763205581 YL-S-APLPFQGQKRMFAKEYIKVL-RQ----F------PDNT---TFVDLFGGSGLLSHIAKCQKPDSTVVYNDFDGYRLRLEHIPQTNELLAELRKIV----D--V--PRHKPI-LGEARERVLSCILR-HERTHGYVDYITLSSSVMFSMKYA------TEFSDFEK-E--TLYNNIKAADYPSCSD---YLDGLTITSCDYKEVF----ERYKDVPG-----VVFLVDPPYLSTDSKTY--KM-----YWKLSDYLDVLTVL-S-GHR-FIYFTSNKSSIVELCEWIGKN--KL-IGNPFE---N-----CHRREFNA------H-MNYNASYTDIMLYTD-------V----- LS72_RS05710_Helicobacter_apodemus_736539564 FK---PPLPFMGSKIAMLKRVKEAL-HS----MAFTQAIRKDT---IFYDVFGGSGLLSHYIKQLYPQNEVIWNDFDNFKERLDNIEKTENLRFRLHNLC-K--D--Y--STKIKL-PQEIIDKIKQILEQ----E-QYLDLTTLSSYLCFAGNYA------ITKEQLFN-N--IKYHRIPAKPL-NAKG---YLRGVMRVSKDFKQLLEEIPQEEKDKQQ-----AFLILDPPYLQTQKGNY----R-D--FYTLKDFCLLVENI-F-KP--YLFFSSKNSDILPFIDFYKKYN----PV-------------FEDYKIDK--------ASLKIGGEDYMICSS----------TQT JCM21142_RS20860_Saccharicrinis_fermentans_763356669 YT-S-APLPFMGQKRRFLTLFKSAL-NE----F------KTAN---TFIDLFGGSGLLSHTAKSVRPDAQVVYNDYDDYHTRLLHVDKTNRMLEHIRVLV-K--D--C--PPDKKI-PNEIKQKVIDYIQN-EEQK-GFVDYITLSSSLLFSMNYS------KSLKDLEK-Q--TMYNCVRKSNY-NVAG---YLDGLHIVKYDYKELF----NKYKGIDN-----VVFFVDPPYLSTEVGTY--NN-----YWRLADYLDVLNTL-K-DTS-YFYFTSDKSSIIELCDWLQKN--LE-ANNPFD---G-----AIKYEMPV------K-VNHNAGYTDIMLCK-------------- C506_RS0110745_Alistipes_497282263 YT-S-APLPFMGQKKRFIREFRKAL-RE----F------DHAT---VFVDLFGGSGLLSHVTKRERPDARVIYNDFDDYHVRLENIKRTNALLHDIRSIV-G--D--Y--PTAKRL-TPQMRTTILDTVRS-AEKT-GYVDYITLSSSLLFSSKYV------TDYTELQN-A--GLYNNLRASDY-TCEG---YLDGIEVVHADYRELF----NQYKDIPG-----VVFLVDPPYLSTEVGVY--KC-----RWRLSDYLDVLTLL-S-STS-YFYFTSNKSSIIELCEWISEA--KV-NANPFL---N-----AVRKEMGA------Q-LNYNSRYTDIMIYRR------------- BN590_01677_Alistipes_sp_CAG:29_547931225 YN-S-APLPFMGQKRRFVGEFKKAL-GQ----F------PNAT---VFVDLFGGSGLLSHVAKQERPDVQVVYNDFDDFHLRLQNIPRTNALLADIRNFVGG--G--I--SKNQRL-SEALRKQILDRVAE-EEKT-GFVDYITLSGSLLFSGKYV------TSFDELAK-D--GFYNTVRQTDF-SAEG---YLDGVEIVKQDYHELF----EQYKDVSG-----AVFLVDPPYLSTEVGAY--KC-----YWRLADYLDVLKIL-Q-GKS-YVYFTSNKSQIVELIDWFTRT--QF-NSNPFE---G-----AERREFNV------S-INHNSKYTDIMLYKQ-------A----- ATHG_RS03835_Alistipes_timonensis_497946642 YN-S-APLPFMGQKRRFVGEFRKAL-GH----F------PGAT---VFVDLFGGSGLLSHVAKQERPDVQVVYNDFDDFHLRLQNIPRTNALLADIRNFVGG--G--I--SRNQRL-SEALQKQILDRVAE-EEKT-GLVDYITLSGSLLFSGKYV------TSFDELAK-D--GFYNTVRQTDF-SAEG---YLDGVEIVKQDYRELF----EQYKDVPG-----AVFLVDPPYLSTEVGPY--KC-----YWRLADYLDVLKVL-Q-GKS-YVYFTSTKSQIVELIDWFTRT--QF-DSNPFE---G-----AERREFNV------T-INHNSKYTDIMLYRR-------I----- JCM21142_114604_Saccharicrinis_fermentans_DSM_9555_=_JCM_21142_588488100 YT-S-APLPFMGQKRRFLTLFKSAL-NE----F------KTAN---TFIDLFGGSGLLSHTAKSVRPDAQVVYNDYDDYHTRLLHVDKTNRMLEHIRVLV-K--D--C--PPDKKI-PNEIKQKVIDYIQN-EEQK-GFVDYITLSSSLLFSMNYS------KSLKDLEK-Q--TMYNCVRKSNY-NVAG---YLDGLHIVKYDYKELF----NKYKGIDN-----VVFFVDPPYLSTEVGTY--NN-----YWRLADYLDVLNTL-K-DTS-YFYFTSDKSSIIELCDWLQKN--LE-ANNPFD---G-----AIKYEMPV------K-VNHNAGYTDIMLCKG-------NTSVSG SU65_11745_Flavobacterium_psychrophilum_806965486 FT-K-APLPFMGQKRNFIKQFKPAL-NK----Y------SESA---TYVDLFGGSGLLSHTVKSIYPGAKVVYNDFDNYKIRLENIGKTNQLIADLRVIL-K--D--S--PKDKII-LGEFRSKVLERVLL-EENS-GYVDYITLSSSILFSMKYA------LSFEALQK-E--TLYNTMRQSEY-TADG---YLDGLEVVSLCYKELF----AKYKDLPN-----VVFLVDPPYLSTESGTY--KS-----FWKLRDYLDVLQVL-D-GTK-YFYFTSKKSSIIELCEWIETK--MP-MSNPFT---G-----ASLETMNA------T-VTYQSSYTDIMLYKY-------E----- JCM12083_RS12170_Prevotella_shahii_647521559 YQ-Q-APLPFMGQKRKFVKAFRQIL-RG----Y------PDDV---TIVDLFGGSGLLSHVAKREKPNATVVYNDFDNYQWRIATIPRTNALLARIREVT-D--S--L--PRGKVI-RHPHRDRILEIIAE-EEQC-GFVDYITLSPSLLFSMKYA------NNMDELVK-Q--TFYNTVRRNDY-CADG---YLDGLTIVHKDYKALF----AEYRDKPN-----VLFLVDPPYLSTEVGTY--TM-----SWRLADYLDVLTVL-Q-GHD-YVYFTSNKSQIIELCEWIGRS--RI-NRNPFE---C-----AHRVEVNT------T-MNYNSNYTDIMLYRK-------N----- HMPREF0670_RS03300_Prevotella_sp_oral_taxon_317_496521463 YQ-Q-APLPFMGQKRKFVKAFRQIL-KS----Y------PDNV---TIVDLFGGSGLLSHVAKREKPNATVVYNDYDNYHRRIAAIPRTNALLARIREVT-E--S--L--PRGKVI-RQPHRDRILEIIAE-EEQR-GFVDYITLSPSLLFSMKYA------NKMDELVK-Q--TFYNTVRRNDY-CADG---YLDGLTIVHKDYKALF----NEYRDKPN-----VLFLVDPPYLSTEVGTY--TM-----TWKLADYLDVLTIL-Q-GHD-YVYFTSNKSQIIELCEWIGQS--RI-DRNPFE---C-----AHRVEVNT------T-MNYNSSYTDIMLYRK-------N----- BN863_RS14255_Formosa_agariphila_740746518 YN-T-APLPFMGQKRKFIKSFKDAL-HN----Y------PPDG---IYVDLFGGSGLLAHTAKQHYPNATVVYNDFDNYRKRINAIPETNNLLEKLRMLI-S--E--W--PKDKRI-TGVTRENVLKAIKR-HEDQYNYVDYITLSSSLLFSMKYV------LNYEDLVK-S--TLYNCIRMSDY-KAEG---YLNGLDIVSLDYKVLF----EQYKDSDK-----VVFLIDPPYLQTTSVTY--KN-----YWNLTDYLDVLSVL-E-GHR-YFYFTSNKSSIIELCEWVGNR--TL-TTNPFA---H-----ATKLEVNT------S-VNYNSSYTDIMLFK-------------- IW16_RS16985_Chryseobacterium_vrystaatense_736743227 YV-Q-APLPFQGQKRRFLKSFKEAL-KD----F------PEDA---IYVDLFGGSGLLSHTVKQFYPNSEVIYNDFDGYTFRLENVQKTNSLLSDVREIC-S--K--S-IDRKGKL-SNELHSEIIGRISK---EK-GFVDWVTISSSLLFSMNYA------TSFEQLKK-E--TFYNKVRLSDY-CVDG---YLEGVSKVREDYQCLF----AKYQHYPK-----AVFLIDPPYLSTNCSTY--TNP---DYWKLSDYLNVLNTV-D-NTS-YFYFTSNKSQIIELCDWMSKKK-CF-K-NPFS---C-----STTVSINT------S-LTHNAKYDDIMIYRYKNV---------- HMPREF9715_RS04510_Myroides_odoratimimus_493305395 FC-A-SPLPFLGQKRKYLKEVKQVL-NH----T------NPRG---TYVDLFGGSGLLSHTIKRHYPDATVIYNDYDGFSDRISNITTTNNLLERIRLLL-V--D--I--DSKTKV-PDTIKQQILQLIKA-DEEANVYVDYITLSSTLLFTMKYE------QTYEGFAK-Q--TLYNRLTKTPY-NADG---YLEGLIIESSDYKALF----EKYKHIPG-----VCFLVDPPYLSTEVSGY--KMN----YWKLKDYLNVLNVL-D-GHK-YLYFTSNKSQIVELCEWVESR--KD-KGNPFN---H-----SRTVSMTN------K--SKNTTYEDILIHN-----------ITP ANH9381_RS06760_Aggregatibacter_actinomycetemcomitans_503933737 YK-Q-APLPFIGQKQQFLTHYTTIL-NQ-HI-Q------DEGK-GWTIIDAFGGSGLLSHTAKQLKPAARVIYNDFDDYVMRLKHIDDINRLRGKIYTLL-D-GV--P--RQKRIT-DHLLKTKIIKVIET-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRKSDYPNAHG---YLDGIEITNESFHTLL----PRFINDER-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMCEDK-VN-NWQAFD---G-----AQRITIKT------N-LNYQGQYEDNLVYKF------------- HMPREF0198_0362_Cardiobacterium_hominis_ATCC_15826_258520722 HS-K-APLPFIGQKRAFLNQFATVL-HQ-II-P------DDGD-GWTILDAFGGSGLLSHAAKHHKPAARVIYNDYDGYAERLRHIPDINRLRRILEDVL-R--H--H--PRGVHL-KTAKRAEVVAAIRA-FD---GYTDLNCLISWLLFSGNQA------SSIEELCG-K--HMYHAVRRSDFPAADG---YLDGLEITRESYTTLL----PQHTANPR-----CLLILDPPYICTMQGAY--KQQ-G--YFGMVEFLRLMLHV-R-PP--FIFFSSTRSELPAYLQLVIGDR-LA-GWERFI---D-----YQTISINT------V-LNSTARYEDNLIYKC------------- SCC393_RS02190_Aggregatibacter_actinomycetemcomitans_491717013 YK-Q-APLPFIGQKQQFLTHYTTIL-NQ-HI-Q------DEGK-GWTIIDAFGGSGLLSHTAKQLKPAARFIYNDFDDYVMRLKHIDDINRLRGKIYTLL-D-GV--P--RQKRIT-DHLLKTKIIKAIET-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRKSDYPNAHG---YLDGIEITNESFHTLL----PRFINDER-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKSEFVRFIEYMCEDK-VN-NWQAFD---G-----AQRITIKT------N-LNYQGQYEDNLVYKF------------- HMPREF0198_RS01770_Cardiobacterium_hominis_750049800 HS-K-APLPFIGQKRAFLNQFATVL-HQ-II-P------DDGD-GWTILDAFGGSGLLSHAAKHHKPAARVIYNDYDGYAERLRHIPDINRLRRILEDVL-R--H--H--PRGVHL-KTAKRAEVVAAIRA-FD---GYTDLNCLISWLLFSGNQA------SSIEELCG-K--HMYHAVRRSDFPAADG---YLDGLEITRESYTTLL----PQHTANPR-----CLLILDPPYICTMQGAY--KQQ-G--YFGMVEFLRLMLHV-R-PP--FIFFSSTRSELPAYLQLVIGDR-LA-GWERFI---D-----YQTISINT------V-LNSTARYEDNLIYKC------------- HPNK_00382_Haemophilus_parasuis_str_Nagasaki_598907105 FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-S------GDGE-GWTIVDVFGGSGLLSHTAKQLKPRARVIYNDFDNYAERLQHIPDINQLRQQLAIAL-A--D--C--SKGKRL-DKAKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSAEG---YLDGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVINT------S-TSYSGKYEDNLVYKF------------- GGE_RS03480_Haemophilus_haemolyticus_491864737 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKQLKPKAHVIYNDFDGYAERLVHIDDTNALRTQIFAKI-G-NT--T--PKNKRL-PKSLKAEIIQIIDE-FQ---GYKDLNCLASWLLFSGQQV------GSLEELYR-K--DFWHCVRLSDYPSADG---YLDGVEVVHESFHTLL----PKYANDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- HPS42_05865_Haemophilus_parasuis_ST4-2_633956025 FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-E------GDGE-GWIIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKDKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSAEG---YLDGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TSYSGKYEDNLVYKF------------- HMPREF1128_RS04375_Haemophilus_sputorum_494789040 FK-Q-APLPFIGQKRMFLKHFQNIL-NE-HI-K------DDGE-GWIIIDAFGGSGLLSHVAKAIKPKARVIYNDFDGYSERLAHIGDINTLRSQLFTAV-G-SA--V--PKNKRM-PKEVKAKCVKIIQE-FD---GYKYLNCLASWLLFSGQQV------ATTDELFQ-N--DFWNCIRQSDYPKADC---YLDDIEIIRESFHTLL----PKFSGNRK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFIRFIEYMQKDK-VD-NWQSFD---G-----AKRIVVNG------S-ASYSGKYEDNLVYKF------------- HPS9_RS04300_Haemophilus_parasuis_737511689 FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-S------GDGE-GWTIVDVFGGSGLLSHTAKQLKPRARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKDKRL-DKAKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSADG---YLEGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVINT------S-TSYSGKYEDNLVYKF------------- SASC598J21_017980_Snodgrassella_alvi_SCGC_AB-598-J21_662243730 FK-Q-APLPFIGQKRYFIKSFCEVL-ND-NI-Q------GLGD-EWTIVDVFGGSGLLSHHAKRLKPHARVIYNDFDNYAQRLKHIDDINRLRQQLAEVL-K--G--I--PRKNKI-DPKTHALIIETIKN-FD---GFIDIDCVWAWLLFSGNQA------ESLNQIYTQP--VLYNRLRKSDYPDAKD---YLTGVEVVSKSFDELL----PEYVNNEK-----TLLVLDPPYLFSEQKGY--RKA-K--DFGLASFLQLMELI-R-PP--FIMFSNYRSEILDYFDYQIKRN-D----ERFL---N-----YKYTSISA------P-LNNVGFYRDNMIYKF------------- SASC598J21_RS08380_Snodgrassella_alvi_739535589 FK-Q-APLPFIGQKRYFIKSFCEVL-ND-NI-Q------GLGD-EWTIVDVFGGSGLLSHHAKRLKPHARVIYNDFDNYAQRLKHIDDINRLRQQLAEVL-K--G--I--PRKNKI-DPKTHALIIETIKN-FD---GFIDIDCVWAWLLFSGNQA------ESLNQIYTQP--VLYNRLRKSDYPDAKD---YLTGVEVVSKSFDELL----PEYVNNEK-----TLLVLDPPYLFSEQKGY--RKA-K--DFGLASFLQLMELI-R-PP--FIMFSNYRSEILDYFDYQIKRN-D----ERFL---N-----YKYTSISA------P-LNNVGFYRDNMIYKF------------- hia5_Haemophilus_influenzae_359359006 FK-Q-APLPFIGQKRMFLKQFEQIL-NE-NI-S------DNGE-GWTILDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--SKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEIVKESFHTLL----PKFSNDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- SU55_RS07055_Haemophilus_influenzae_756154060 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTVIDTFGGSGLLSHAAKRLKPKARVIYNDFDGYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- A160_0967_Aggregatibacter_actinomycetemcomitans_serotype_a_str_A160_443550816 YK-Q-APLPFIGQKQQFLTHYTTIL-NQ-HI-Q------DEGK-GWTIIDAFGGSGLLSHTAKQLKPAARFIYNDFDDYVMRLKHIDDINRLRGKIYTLL-D-GV--P--RQKRIT-DHLLKTKIIKAIET-FD---GYKDLNCLASWLLFSGQQV------ATLSDLYH-K--DFWNCIRKSDYPNAHG---YLDGIEITNESFHTLL----PRFINDER-----AVFVLDPPYLCTKQESY--KQA-H--YFDLIDFLRLINIT-R-PP--YIFFSSTKASSCGLLSICVKIK-SI-TGRHLM---V-----HKELRLRQ------T-STTKGNMKIIWFINFRD----------- W820_RS02320_Haemophilus_influenzae_748782878 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHTAKRLKPKARIIYNDFDGYAERLAHIDDINQLRAELYSVV-G-NA--T--PKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGVEVIRESFHTLL----PKFADNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFMNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- SU30_RS04070_Haemophilus_influenzae_756154896 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GEGE-GWTIIDTFGGSGLLSHTAKRLKPKARVIYNDFDGYAERLAHIDDINQLRTELYSVV-G-NA--T--PKNKRM-TKDCKAECIRIIQN-FK---GYKDLNCLASWLLFSGQQV------ATLDDLFQ-H--NFWHCIRQSDYPKADG---YLDGIEIVKESFHTLL----PKFSDDPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YVFFSSTKSEFIRFVNYMLEDK-VD-NWQAFE---N-----AKRITVNA------K-LNYQVAYEDNLVYKF------------- SU30_RS01820_Haemophilus_influenzae_756151906 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGK-DWTIIDVFGGSGLLSNTAKRVKPKSRVIYNG---YSERLNHITEINQLREILYQTV-N-GI--I--TKNKLI-SKRLKEEIINKINN-FS---GYKDVNCLSSWLLFSGQQV------NSLDELFK-Q--RFYNCIRKSDYELADG---YLNGLEVINESFHQLL----PKFIDKEK-----VLLILDPPYLCTRQESY--KQA-S--YFDLIDFLRLIHLT-K-PP--FIFFSSTKSEFIRFIEAMVEDK-WD-NWQVFN---E-----VNRITVNA------S-ASYNGKYEDNLIYKF------------- COK_0640_Mannheimia_haemolytica_serotype_A2_str_BOVINE_261312126 FK-Q-APLPFIGQKRMFLSHFKQVL-NA-NI-E------ADGK-DWTIIDVFGGSGLLAHTAKREKPLARVIYNDFDNYAERLNHIKDTNQLRQEIYQIV-D-GV--I--PKNKRI-SNEVKAKIINKIND-FE---GFKDPKCLSSWLLFSGEQV------ATLDELFK-H--DFWNCVRQSNYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQA-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVQNK-KE-NWQAFE---G-----AERIVVNA------S-PSYNGKYEDNMVFKF------------- J450_RS10910_Mannheimia_haemolytica_493294268 FK-Q-APLPFIGQKRMFLSHFKQVL-NA-NI-E------ADGK-DWTIIDVFGGSGLLAHTAKREKPLARVIYNDFDNYAERLNHIKDTNQLRQEIYQIV-D-GV--I--PKNKRI-SNEVKAKIINKIND-FE---GFKDPKCLSSWLLFSGEQV------ATLDELFK-H--DFWNCVRQSNYPEATG---YLDDIEVVSESFHQLL----PRFHDKEK-----VLLILDPPYLCTRQESY--KQA-N--YFDLIDFLRLIDLT-K-PP--YIFFSSTKSEFIRFIHYMVQNK-KE-NWQAFE---G-----AERIVVNA------S-PSYNGKYEDNMVFKF------------- K941_RS0100010_Moraxella_caprae_738435815 HS-T-APLPFIGQKRQIIQVFRNTL-DR-IV-P------DDGQ-GWTIIDVFGGSGLLAHNAKYLKPKARVIYNDFDNFSQRLYHLCDTNRLRQQLYAIL-E--P--L--PRAKRI-DEPTKQKLLEIIQN-FD---GFVDCHSVSTWLLFSGNQI------SHIDELPK-H--EFYNTIRRSDYPTADG---YLDGLEIVSESFEILM----PKFYHQDK-----TLFILDPPYLRTKQEAY--GLG-E--YFGMIQFLKLMKWV-R-PP--YLFFSSTKSEFLSYLDYVKEYE-PV-MWERLG---G-----FEKLSFTS------Y-VNKSSTYEDNMIFKI------------- MS_RS00345_[Mannheimia]_succiniciproducens_499512619 FK-Q-APLPFIGQKRMFLKHFERLL-ED--I-P------NDGE-GWTIIDAFGGSGLLSHVAKHLKPEATVIYNDFDGYAERLAHIDDINRLRQAIYPLL-A--N--C--AKSKKV-PNDIKTQIIDVIKG-FD---GYINEHILCSWLCFSGQQV------KTLDELFK-E--DFWNCIRKSDYPSADG---YLDGIEVVSESFHTLL----PKYQTDPK-----ALFVLDPPYLCTQQASY--KQE-N--YFDLIDFLRLVHLT-R-PP--YVFFSSSKSEFVRFIEAMIEDK-WD-NWQAFE---N-----YERVIVKT------S-SSYSGKYEDNMVFKF------------- ACEE_RS02875_Actinobacillus_equuli_746131177 FK-Q-APLPFIGQKRMFLKQVESVL-NQ-HI-D------GDGK-DWIIVDVFGGSGLLSHTAKRVKPNATVIYNDFDGYSDRLKHIDDINALRRIIYNIC-V-DI--I--PKNSRL-SKELKAKIINEINQ-FK---GYKDLNCLATWLLFSGQQI------GSFDELYA-K--EFYNCVRMTDYPQATG---YLDGLEIMSESFHTLI----PKFANKTN-----VLLLLDPPYLCTRQESY--KQK-N--YFDLVDFLRLVNLT-R-PP--YIFFSSTKSEFIRFIDTAIEDK-WN-NWQAFD---E-----YKRIVVHV------S-ASYTGKYEDNMIYKF------------- HMPREF1052_RS08385_Pasteurella_bettyae_492143056 FK-Q-APLPFIGQKRMFLKHFQNIL-NE-HI-K------DDGE-DWIIIDAFGGSGLLSHVAKAIKPKARVIYNDFDGYSERLAHIGDINTLRSQLFTAV-G-SA--V--PKNKRM-PKEVKAKCVKIIQE-FD---GYKDLNCLASWLLFSGQQV------ATTDELFQ-N--DFWNCIRQSDYPKADC---YLDDIEIIRESFHTLL----PKFSDNRK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFIRFIEYMQKDK-VD-NWQSFE---G-----AKRIVVNG------S-ASYSGKYEDNLVYKF------------- HICG_RS06205_Haemophilus_influenzae_696250595 FK-Q-APLPFIGQKRMFLKHVEIVL-NK-HI-D------GEGE-GWTIVDVFGGSGLLSHTAKQLKPKATVIYNDFDGYAERLNHIDDINRLRQIIFNCL-H-GI--I--PKNGRL-SKEIKEEIINKIND-FK---GYKDLNCLASWLLFSGQQV------GSVEALFA-K--DFWNCVRQSDYPTAEG---YLDGIEVISESFHKLI----PRYQNQDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRYLNYMQESK-TD-NWRAFE---N-----YKRIVVKA------S-ASKDGIYEDNMIYKF------------- WQG_17550_Bibersteinia_trehalosi_USDA-ARS-USMARC-192_469489659 FK-Q-APLPFIGQKRMFLKHFEQVL--A-HI-P------DDGN-GWTIVDVFGGSGLLSHTAKRLKPKARVIYNDYDNYSERLQHIDDINRLRRIIADLM-A--D--T--PKYKRL-DNAKKLQIIEAIEA-FQ---GYKDLHILCSWLAFSGQQV------SSFDELYK-Q--NFWHCIRQSDYLTADG---YLDGVEIVRESFHQLV----PRFTGQPN-----TLLVLDPPYLCTHQESY--KQE-R--YFDLVDFLRLIHLT-K-PP--YVFFSSTKSEFVRFIDAMVEDK-WD-NWQAFD---D-----AQRIVVQT------S-ASYNGKYEDNMVYKF------------- IO45_RS00140_Gallibacterium_anatis_746088554 FA-Q-APLPFIGQKRMFLQQFRSVL-NQ-MI-A------DNGD-GWTIVDAFGGSGLLSHTAKCLKPNARVIYNDFDGYAERLKHIDDINRLRQILSELL-A--N--C--PRDRRL-DIAMRHKVIDAIES-FN---GYKDPHILCAWLLFSGQQV------KSINELYS-R--GFYNCIRQSDYTTADG---YLDGIEVVNESFVTLL----PKFADDSK-----AIFVLDPPYLCTKQASY--KQE-R--YFDLIDFLELIRLT-R-PP--YLFFSSTKSEFIRFVDWLIASK-GD-NWQSFV---D-----YQRIIVQT------S-TSYSGKYEDNLIYKC------------- HD_RS06430_Haemophilus_ducreyi_753848096 YK-N-APLPFIGQKRQFLTHYTEIL-NQ-YI-S------GDGQ-GWTIIDAFGGSGLLSDVAKRIKPAARVIYNDFDNYAERLLHIDEINELRLKISDTI-G-NT--I--PKNKKL-TPDVKSKVINVIQS-FQ---GYKDLNCLASWLLFSGNQV------GSLEDLFN-K--DFWHCVRQSDYPRADG---YLDGIEIIQESFHQLL----PKFRDEPN-----TLFVLDPPYLCTRQESY--RQA-S--YFDLIGFLRLIHLT-R-PP--YIFFSSSKSEFVRFIDAMVEDK-WD-NWQAFE---N-----YGKISINT------S-ASYSGKYEDNMVFKF------------- HI1523_Haemophilus_influenzae_491961424 FK-Q-APLPFIGQKRMFLKHVEIVL-NK-HI-D------GEGE-GWTIVDVFGGSGLLSHTAKQLKPKATVIYNDFDGYAERLNHIDDINRLRQIIFNCL-H-GI--I--PKNGRL-SKEIKEEIINKIND-FK---GYKDLNCLASWLLFSGQQV------GSVEALFA-K--DFWNCVRQSDYPTAEG---YLDGIEVISESFHKLI----PRYQNQDK-----VLLLLDPPYLCTRQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRYLNYMQESK-TD-NWRAFE---N-----YKRIVVKA------S-ASKDGIYEDNMIYKF------------- IE01_RS11495_Gallibacterium_anatis_517158409 FS-Q-APLPFIGQKRMFLNQFKTVL-NQ-MI-A------NDGE-GWTIVDAFGGSGLLSYAAKQLKPKARVIYNDFDGYAERLKHIDDINRLRQQLSDLL-T--G--C--PRQKRL-DIAMRHKVIDVIES-FN---GYKDPHILCAWLLFSGQQI------KSLNELYR-H--GFYNCVRQSDYDTADG---YLDGIEVVSESFSKLL----PKFANDKK-----AIFVLDPPYLCTHQASY--KQE-N--YFDLIHFLELIRLT-R-PP--YIFFSSTKSEFVRFVDWLVATK-GN-NWQSFV---D-----YKRIIVQT------S-TSYSGKYEDNLIYKC------------- HD_1581_Haemophilus_ducreyi_35000HP_33148844 YK-N-APLPFIGQKRQFLTHYTEIL-NQ-YI-S------GDGQ-GWTIIDAFGGSGLLSDVAKRIKPAARVIYNDFDNYAERLLHIDEINELRLKISDTI-G-NT--I--PKNKKL-TPDVKSKVINVIQS-FQ---GYKDLNCLASWLLFSGNQV------GSLEDLFN-K--DFWHCVRQSDYPRADG---YLDGIEIIQESFHQLL----PKFRDEPN-----TLFVLDPPYLCTRQESY--RQA-S--YFDLIGFLRLIHLT-R-PP--YIFFSSSKSEFVRFIDAMVEDK-WD-NWQAFE---N-----YGKISINT------S-ASYSGKYEDNMVFKF------------- HMPREF9095_RS07250_Haemophilus_aegyptius_494053240 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVIKPKAHVIYNDFDSYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFTDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-R-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQTFD---N-----AERIVVNA------S-ASYSGKYEDNMVYKF------------- HPS41_RS06910_Haemophilus_parasuis_737547081 FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-D------GNGE-GWTIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKGKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSLEELYT-Q--DFWHCLRQSDYPSAES---YLDGVEIVCESFHQLV----SRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TNCRGKYEDNLVYKF------------- A3G3_RS0107195_Moraxella_boevrei_518349657 YK-S-APLPFIGQKRFFISHFVKLL-KD-KI-P------NDGE-NWTIIDVFGGSGLLAHNAKRLLPKARVIYNDFDGYAQRLQNIADTERLRQKLFDLL-A--N--A--EDERKL-NDQQRKQIIETINQ-FD---GYLDINAIATWILFSGKQA------KTLDELYQ-N--TFYNTVRKTPYKTADG---YLDGLEITHESFETLI----PKFANQPN-----TLLLLDPPYVFTEQTAY--HQA-K--YFGMVEFLTLMSLV-R-PP--YIFFSSTKSELLDYLAYVQKHQ-PH-DWDRLG---G-----FDRLFLQS------Q-VNYHKSYQDNMIWKF------------- F543_RS02755_Bibersteinia_trehalosi_644530078 FK-Q-APLPFIGQKRMFLKHFEQVL--A-HI-P------DDGN-GWTIVDVFGGSGLLSHTAKRLKPKARVIYNDYDNYSERLQHIDDINRLRRIIADLM-A--D--T--PKYKRL-DNAKKLQIIEAIEA-FQ---GYKDLHILCSWLAFSGQQV------SSFDELYK-Q--NFWHCIRQSDYLTADG---YLDGVEIVRESFHQLV----PRFTGQPN-----TLLVLDPPYLCTHQESY--KQE-R--YFDLVDFLRLIHLT-K-PP--YVFFSSTKSEFVRFIDAMVEDK-WD-NWQAFD---D-----AQRIVVQT------S-ASYNGKYEDNMVYKF------------- SVR5_RS07195_Haemophilus_parasuis_491999424 FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-D------GNGE-GWTIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKDKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSIEELYT-Q--DFWHCLRQSDYPSAEG---YLDGVEIVCESFHQLV----PRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TSYSGKYEDNLVYKF------------- HPS41_RS09445_Haemophilus_parasuis_737547480 FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-S------GDGE-GWTIVDVFGGSGLLSHVTKRLKPKATVIYNDFDGYAERLAHIDDINRLRRLIYPLL-A--A--C--EKQKKV-PNDVKAQIIEVIKN-FD---GYINEHILCSWLCFSGQQV------ATLDELFK-E--DFWHCIRQSDYSSADG---YLDDIEVVSESFYTLL----PKYQNDPK-----ALFVLDPPYLCTHQASY--KQA-T--YFDLVDFLRLIHLT-R-PP--FVFFSSTKSEFVRYVDAMIEDK-WD-NWQAFQ---D-----YERIVVNT------S-TSYSGKYEDNLVYKF------------- F543_5700_Bibersteinia_trehalosi_USDA-ARS-USMARC-189_575451422 FK-Q-APLPFIGQKRMFLKHFEQVL--A-HI-P------DDGN-GWTIVDVFGGSGLLSHTAKRLKPKARVIYNDYDNYSERLQHIDDINRLRRIIADLM-A--D--T--PKYKRL-DNAKKLQIIEAIEA-FQ---GYKDLHILCSWLAFSGQQV------SSFDELYK-Q--NFWHCIRQSDYLTADG---YLDGVEIVRESFHQLV----PRFTGQPN-----TLLVLDPPYLCTHQESY--KQE-R--YFDLVDFLRLIHLT-K-PP--YVFFSSTKSEFVRFIDAMVEDK-WD-NWQAFD---D-----AQRIVVQT------S-ASYNGKYEDNMVYKF------------- BN1226_RS02290_Mannheimia_sp_MG13_764738267 FK-Q-APLPFIGQKRMFLQHFERLL-ND-NI-P------NDGD-GWTILDAFGGSGLLSHVAKRLKPKATVIYNDFDGYAERLQHIDDINRLRRQIAPLL-A--E--Q--PKQKRL-SPELKAQIIDVIKA-FD---GYINVHVLCSWLLFSGQQV------KTLDELFT-Q--DFWHCLRQSDYPSADG---YLDGLTVVSESFHTLL----PKYQHDPK-----ALFVLDPPYLCTHQESY--GQQ-R--YFDLIDFLRLIHLT-R-PP--FVFFSSTKSEFVRFIDAMITDQ-WD-NWQSFA---N-----YERIAVKT------S-TSYSGKYEDNMVFKF------------- HPS41_07110_Haemophilus_parasuis_ST4-1_633953678 FK-Q-APLPFIGQKRMFLKHFSQIL-ND-NI-D------GNGE-GWTIVDVFGGSGLLSHTAKQLKPQARVIYNDFDNYAERLQHIPDINQLRQQLAVAL-A--D--C--PKGKRL-DKTKKLQLIEIIEA-FK---GYKDPHILCSWLLFSGQQV------KSLEELYT-Q--DFWHCLRQSDYPSAES---YLDGVEIVCESFHQLV----SRFSGKEK-----VLLVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLINLT-K-PP--YIFFSSTKSEFIRFIEYMQEDK-VD-NWQAFD---G-----AKRIVVNT------S-TNCRGKYEDNLVYKF------------- HICON_RS02315_Haemophilus_influenzae_503292691 FK-Q-APLPFIGQKRMFLKHFETVL-NE-NI-K------GDGE-GWTIIDTFGGSGLLSHAAKVIKPKAHVIYNDFDSYAERLAYINDTNALRTQIFAKI-G-NA--T--PKNKRL-PKSLKAEIIKIIDQ-FK---GYKDLNCLTSWLLFSGQQV------SSLDELYK-K--DFWHCVRLSDYPSAEG---YLDGVEVIRESFHTLL----PKFSDNPK-----ALFVLDPPYLCTKQESY--KQA-T--YFDLIDFLRLVNIT-L-PP--YIFFSSTKSEFVRFIEYMVDDK-VH-NWQTFD---N-----AERIVVNA------S-ASYSGKYEDNMVYKF------------- APPSER11_RS04990_Actinobacillus_pleuropneumoniae_491784781 YK-N-APLPFIGQKRQFLTHYTEIL-NQ-YI-P------GDGQ-GWTIIDAFGGSGLLSHVAKRIKPAARVIYNDFDNYAERLLHIDEINELRLKISDTI-G-NA--I--PKNKKL-TPDVKSKVINAIQS-FQ---GYKDLNCLASWLLFSGNQV------GSLEDLFN-K--DFWHCVRQSDYPRADG---YLDGVEIIQESFHQLL----PKFRDEPN-----TLFVLDPPYLCTRQESY--RQA-S--YFDLIDFLRLIHLT-R-PP--YIFFSSSKSEFVRFIDAMVEDK-WD-NWQAFE---N-----YGKISINT------S-ASYSGKYEDNMVFKF------------- POREN0001_0004_Porphyromonas_endodontalis_ATCC_35406_229317608 YT-T-APLPFAGQKRRWLKQLEPII-RS----L------PSNT---IFVDVFGGSGLVSRLCKDVHPAARVIYNDYDNYSERLRHIKETEQLRQEIVSIL-A--P--L--KHNSRV-PEEYKALVLRAVQA-HEKRCQYVDWVTLSGWLLFTNNFA------YSPKDLAS-R--GLYAHPSRTALSDAGASERYLCGLEIVSVDYRELL----SAYKDASN-----TILILDPPYLSTECGGY--RGN----SWSCDDYLDLLTLM-P-TNN-YLYFSSTKFDFVPFL----RR-----TSAAW----G-----YQHPFIDAQVLYRSSPFGSGGANPEVLYYK-------------- L13_RS00005_Neisseria_weaveri_738545947 HA-K-APLPFAGQKRNFIKHYLGVL-DK--I-P------GSGS-GWTIVDVFGGSGLLTHVAKRVKPDARVIYNDFDNYAARVKAIPDINRLRRLISGYL-A--G--Y--VKKQRI-PDDVKQVIIGEIER-FD---GYKCHVVLASWFLFSGRQA------ANLERFYR-S--EWYFNLPLSDYPVADD---YLDGLEITRQSYETLI----PQFSDDPQ-----ALLVLDPPYLSTTQAAY--AQD-G--RFGLVDYLKLVNLV-R-PP--YLFFSSTRSEFIDYIDAVVSMQ-LD-NWHVFD---H-----STRLTVQA------K-VSKYASYEDNLVYKL------------- NM70021_RS109520_Neisseria_meningitidis_488180979 HS-T-APLPF-------IKHFTKVL-SQ--I-P------ADGK-HWTIVDVFGGSGLLAHVAKRIKPQARVIYNDYDNYSDRLRHIPDYNRLREQIAQIV-G--G--I--PKGSRL-DPERTRSVQQTITN-FQ---GHIDVRVLSSWLLFSAKQA------NSLEQLLG-F--EFYNKVRQSPYSIAAD---YLDGLEITQQDYNLLM----AEHQHNPN-----TLLVLDPPYVSTAQGAY--AAD-K--YFNMVSFLRMIQYM-R-PP--FILFSSTRSEALDYFQFLQECE-PD-KYRRFS---G-----YNIVSLDA------K-MGKGIEYQDNMIYKI---D--------- CUP_RS09075_Campylobacter_upsaliensis_490401642 YK-T-PPLAFNGNKKNMLKLYREAL-ED----MKCY--VNKDT---IFYDVFGGSGLLAHETKRQFQNNKVIWNDFDNFQKRLNMLDKTEALRLKIVNII-K--DKRF--QKEERI-KRIERQKIEKLLKE----E-GEFDYIQLSSWLRFGGSYAKDCEDFFRAKEFYN-K--IAYDKV----L-DKKD---YLKGVIRVQKDYKELL----KEAKERGN-----FFFILDPPYIQTDKAHY----E-G--FFGLCEFLELIISI-E-MP--FIFFSSAKSEILNFMDFCKEPK----NLQNLQ---NLQNLHFKRVNLNK--------CIKNKDNSDFIFYKQ----------R-- HMPREF1052_RS06075_Pasteurella_bettyae_492141771 FK-Q-APIPFIGQKRMFLQHFERLL-ND-NI-P------NDGD-GWTIVDAFGGSGLLSHIAKRLKPKATVIYNDFDGYAERLQHIDDINRLRRQIAPLL-ALAE--Q--PKQKRL-SPELKAQIIDVIKA-FD---GYINVHVLCSWLLFSGQQV------KTLDELFT-Q--DFWHCLRQSDYPSADG---YLDGLTVVSESFHTLL----PKYQHDPK-----ALFVLDPPYLCTHQESY--GQQ-R--YFDLIDFLRLIHLT-R-PP--FVFFSSTKSEFVRFIDAMITDQ-WD-KWQSFA---N-----YERIAVKT------S-TSYSGKYEDNMVYKF------------- consensus/100% .......hsF..........h...l......................hhD.FuG..hls...c.....s.hlhNs...a..bh..h...p.hb..h............................h...h...............l.s.h.a................h........a................Y..sh......h..h...................hhhhsssah.s....Y..........a.h...h..............hhFs..p............................................................................. consensus/95% a..p.sPLPF.GpKp.hhp.h..hL..p...................hlD.FGGSGLLuc.sK...s.u.llaNDaDsa..Rl..l.p.N.lb..l...................h.....+..hb..l........sa.D...lss.lhFu.p.........ph..h.......ha..hp..sh..s.s...YLpsl.h.p.sap.lh.....pa..........shhllDPPYh.T...sY..........a.h.paLplh..........ahhFoSs+S.h..hh.hh..........p.h...............h...........s....hpD.hhh............... consensus/90% a..p.APLPF.GQKR.Fhp.h.phL..p...........sp......hlDhFGGSGLLuH.sKp.bP.upVlYNDaDsY..Rl..I.p.N.Lb..l...............p.p.l.....+..hh..lp.......Ga.D..slss.lhFS.pb........sh.ph.......happlp.ssa..sps...YLpGlplsp.sap.lh.....pa.s........slhllDPPYhsTp..sY..........a.h.paLplh......s...alaFoSs+S.hhchhphh.p........p.a..........h.b..hps......p..s.ps.YpD.hlap.............. consensus/85% a..p.APLPF.GQKRbFhppa.plL.pp...........sssp....hlDhFGGSGLLuH.sKp.+PpupVlYNDaDsY.pRL.pIsp.N.Lb..l..hh..........s+pp.l.s...+..hhp.lp...p...Ga.Dh.slso.LLFS.pb........shpph.p.p..shappl+.ssY..sps...YL-Glplsp.sapplh.....pa.s........slhllDPPYlsTp..sY..p.......aph.-aLclhph..p.s...alaFoSs+Sphlchhphh.p........p.F..........h.b..hps......p.hs.ps.YpD.hlap.............. consensus/80% a..p.APLPF.GQKRbFhppabplL.pp...........sssp...shlDhFGGSGLLSH.sKp.+PpApVlYNDaDsY.pRL.pIsp.N.Lb.pl..hl..........P+pp.l.s...+..lhp.Ip...p...Ga.Dh.slso.LLFS.pbh.......shp-hbp.p..shappl+bsDY..sps...YL-GlplsppsacpLh.....pa.s........slhllDPPYlsTc..sY..pb.....bapL.DaLcllpl..p.s...alaFoSsKSphlchhcah.pp.......psF....s.....hpc..hps......p.hsbpspYpD.hlap.............. consensus/75% a..p.APLPFhGQKRbFhppabplL.pp...........s-sp...shlDhFGGSGLLSH.sKpb+PpApVIYNDaDsY.pRL.pIsp.NpLb.pl..hl..........P+pcbl.s.p.+.bllp.Ipp.bp...Ga.Dh.sluo.LLFS.pbh.......shp-lbp.p..shaspl+bsDY.pscs...YL-Glplsppsa+pLh....spa.s.sp.....slhllDPPYlsTc..oY..pb.....aacL.DaLcllpl..p.s...alaFoSsKSphlchhcah.pp.......psFp...s.....hp+..hps......p.hsapupYpD.hlY+.............. consensus/70% ap.p.APLPFhGQKRbFlpcFbplL.pp...........s-up...shlDlFGGSGLLSH.sKpb+PpApVIYNDFDsY.cRLppIsc.NpLb.plbpll....s..h..P+p+bl.s.pb+.bllp.Ipp.bc...GY.Dh.sLuS.LLFSupbh.......shc-Lbp.p..shaNslRboDYspscs...YLDGl-llppsa+pLh....scapspsp.....slFllDPPYLsTcp.oY..+b.....YacL.DaLcllpl..c.s...alaFoSsKSphlchhcah.cp.......psFp...s.....hp+h.lss......p.hsasupYpD.hlYK..............Back to Contents
GI Gene neighborhood Architecture Pfam architecture Gene name Len Taxonomy Species name Genbank description # 1; Eukaryotic versions 123207322 <-N6-MTase* N6-MTase MethyltransfD12 TVAG_557140 280 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. <-123207322_N6-MTase* 123481438 MULE-transposase->?->?->?-><-?||?-><-N6-MTase*<-?||?-><-?<-?<-?<-A32-like_ATPase N6-MTase MethyltransfD12 TVAG_344370 280 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. <-123481412_?||123481416_MULE-transposase->123481420_?->123481423_?->123481427_?-><-123481431_?||123481435_?-><-123481438_N6-MTase*<-123481442_?||123481445_?-><-123481449_?<-123481453_?<-123481456_?<-123481460_A32-like_ATPase||123481463_?-> 123421258 <-N6-MTase* N6-MTase MethyltransfD12 TVAG_007390 273 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. <-123421244_?||123421247_?->123421249_?->123421251_?-><-123421254_?||123421256_?-><-123421258_N6-MTase*<-123421261_?||123421265_?->123421267_?-><-123421269_?<-123421272_?<-123421274_?||123421276_?-> 123479010 T4gp10-like-baseplate->?-><-?||?-><-?||?-><-N6-MTase* N6-MTase MethyltransfD12 TVAG_271330 273 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. <-123478996_?||123478998_T4gp10-like-baseplate->123479000_?-><-123479002_?||123479004_?-><-123479006_?||123479008_?-><-123479010_N6-MTase*||123479012_?->123479014_?-><-123479016_?<-123479018_?||123479020_?->123479022_?->123479024_?-> 123471301 <-N6-MTase+Phage-tailfib<-?<-?||?->?-><-?||?->N6-MTase+Phage-tailfib*->?->DUF3839-> N6-MTase+Phage-tailfib PTR TVAG_056220 566 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. <-123471287_N6-MTase+Phage-tailfib<-123471289_?<-123471291_?||123471293_?->123471295_?-><-123471297_?||123471299_?->123471301_N6-MTase+Phage-tailfib*->123471303_?->123471305_DUF3839-> 123976294 N6-MTase+Phage-tailfib*->DUF3839->?->?-><-A32-like_ATPase<-DUF4108 N6-MTase+Phage-tailfib PTR TVAG_051460 527 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. <-123976292_?||123976294_N6-MTase+Phage-tailfib*->123976296_DUF3839->123976298_?->123976300_?-><-123976302_A32-like_ATPase<-123976304_DUF4108<-123976306_?||123976308_?-> 123484516 <-MULE-transposase<-?<-?||?*->DUF3839-><-A32-like_ATPase||?-><-DUF4108 - PAT1 TVAG_039120 451 eukaryota>parabasalia Trichomonas vaginalis G3 hypothetical protein [Trichomonas vaginalis G3]. 123484490_?-><-123484493_?||123484497_?-><-123484501_?<-123484505_MULE-transposase<-123484509_?<-123484512_?||123484516_?*->123484521_DUF3839-><-123484525_A32-like_ATPase||123484529_?-><-123484533_DUF4108||123484536_?-><-123484539_?<-123484543_? # 218; Prokaryotic homologs 472258915 <-N6-MTase*<-?<-?<-?<-?<-?<-?<-Tail_P2_I N6-MTase SP D650_21760 323 bacteria>proteobacteria>gammaproteobacteria Mannheimia haemolytica USDA-ARS-USMARC-183 hypothetical protein D650_21760 [Mannheimia haemolytica USDA-ARS-USMARC-183]. <-472258908_?||472258909_?-><-472258910_?<-472258911_?||472258912_?->472258913_?->472258914_?-><-472258915_N6-MTase*<-472258916_?<-472258917_?<-472258918_?<-472258919_?<-472258920_?<-472258921_?<-472258922_Tail_P2_I 469489659 GP46->Baseplate_J->DUF2313->?->DUF4376->?->N6-MTase*-> N6-MTase SP WQG_17550 317 bacteria>proteobacteria>gammaproteobacteria Bibersteinia trehalosi USDA-ARS-USMARC-192 D12 class N6 adenine-specific DNA methyltransferase [Bibersteinia trehalosi USDA-ARS-USMARC-192]. 469489652_?->469489653_GP46->469489654_Baseplate_J->469489655_DUF2313->469489656_?->469489657_DUF4376->469489658_?->469489659_N6-MTase*->469489660_?->469489661_?->469489662_?->469489663_?->469489664_?->469489665_?-><-469489666_? 345456400 <-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC-> N6-MTase MethyltransfD12 BSEG_01570 316 bacteria>bacteroidetes Bacteroides dorei 5_1_36/D4 D12 class N6 adenine-specific DNA methyltransferase [Bacteroides dorei 5_1_36/D4]. <-345456398_?||345456399_?-><-229435357_VirD4-FtsK<-229435356_?<-229435355_LPD3+N4ART+N4ART+MPTase+MPTase<-229435354_?<-229435353_?<-345456400_N6-MTase*<-229435351_?||229435350_ParA->229435349_?->229435348_?->229435347_DOC-><-229435346_?<-345456401_? 598907105 DUF4376->?->N6-MTase*-> N6-MTase Methyltransf_26 HPNK_00382 315 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis str. Nagasaki hypothetical protein HPNK_00382 [Haemophilus parasuis str. Nagasaki]. 598907103_DUF4376->598907104_?->598907105_N6-MTase*-><-598907106_?<-598907107_?||598907108_?->598907109_?->598907110_?->598907111_?->598907112_?-> 507741308 <-VirD4-FtsK<-?<-?<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC-> N6-MTase SP+MethyltransfD12 C799_02456 314 bacteria>bacteroidetes Bacteroides thetaiotaomicron dnLKV9 hypothetical protein C799_02456 [Bacteroides thetaiotaomicron dnLKV9]. 507741301_?->507741302_?-><-507741303_VirD4-FtsK<-507741304_?<-507741305_?<-507741306_?<-507741307_?<-507741308_N6-MTase*<-507741309_?||507741310_ParA->507741311_?->507741312_?->507741313_DOC->507741314_?-><-507741315_? 335946057 STN+Cna_B_2+Plug->?->?->?-><-?<-N6-MTase*<-?||ParA-> N6-MTase MethyltransfD12 HMPREF1018_02174 313 bacteria>bacteroidetes Bacteroides sp. 2_1_56FAA hypothetical protein HMPREF1018_02174 [Bacteroides sp. 2_1_56FAA]. 335946050_?->335946051_?->335946052_STN+Cna_B_2+Plug->335946053_?->335946054_?->335946055_?-><-335946056_?<-335946057_N6-MTase*<-335946058_?||335946059_ParA->335946060_?->335946061_?->335946062_?-><-335946063_?<-335946064_? 387775820 STN+Cna_B_2+Plug->?->?->?-><-?<-N6-MTase*<-?||?->ParA-> N6-MTase MethyltransfD12 HMPREF1055_02982 313 bacteria>bacteroidetes Bacteroides fragilis CL07T00C01 hypothetical protein HMPREF1055_02982 [Bacteroides fragilis CL07T00C01]. 387775813_?->387775814_?->387775815_STN+Cna_B_2+Plug->387775816_?->387775817_?->387775818_?-><-387775819_?<-387775820_N6-MTase*<-387775821_?||387775822_?->387775823_ParA->387775824_?->387775825_?-><-387775826_?<-387775827_? 392705106 STN+Cna_B_2+Plug->?->?->?-><-?<-N6-MTase*<-?||?->ParA-> N6-MTase MethyltransfD12 HMPREF1079_00192 313 bacteria>bacteroidetes Bacteroides fragilis CL05T00C42 hypothetical protein HMPREF1079_00192 [Bacteroides fragilis CL05T00C42]. 392705099_?->392705100_?->392705101_STN+Cna_B_2+Plug->392705102_?->392705103_?->392705104_?-><-392705105_?<-392705106_N6-MTase*<-392705107_?||392705108_?->392705109_ParA->392705110_?->392705111_?->392705112_?-><-392705113_? 575451422 <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46 N6-MTase SP F543_5700 313 bacteria>proteobacteria>gammaproteobacteria Bibersteinia trehalosi USDA-ARS-USMARC-189 D12 class N6 adenine-specific DNA methyltransferase [Bibersteinia trehalosi USDA-ARS-USMARC-189]. 575451415_?-><-575451416_?<-575451417_?<-575451418_?<-575451419_?<-575451420_?<-575451421_?<-575451422_N6-MTase*<-575451423_?<-575451424_DUF4376<-575451425_?<-575451426_DUF2313<-575451427_Baseplate_J<-575451428_GP46<-575451429_? 596213380 <-ParA||?->N6-MTase*-><-?||?->?-><-?<-?<-?<-STN+Cna_B_2+Plug N6-MTase MethyltransfD12 M070_4300 313 bacteria>bacteroidetes Bacteroides fragilis str. A7 (UDC12-2) D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. A7 (UDC12-2)]. 596213373_?->596213374_?-><-596213375_?<-596213376_?<-596213377_?<-596213378_ParA||596213379_?->596213380_N6-MTase*-><-596213381_?||596213382_?->596213383_?-><-596213384_?<-596213385_?<-596213386_?<-596213387_STN+Cna_B_2+Plug 695509259 <-ParA||?->N6-MTase*->?-><-METHYLASE N6-MTase MethyltransfD12 M117_RS13145 313 bacteria>bacteroidetes Bacteroides fragilis DNA methyltransferase [Bacteroides fragilis]. <-695509272_ParA||695509275_?->695509259_N6-MTase*->695509264_?-><-695509267_METHYLASE 33148844 tail_3->N6-MTase*-> N6-MTase - HD_1581 308 bacteria>proteobacteria>gammaproteobacteria Haemophilus ducreyi 35000HP conserved hypothetical protein [Haemophilus ducreyi 35000HP]. 33148837_?->33148838_?->33148839_?->33148840_?->33148841_?->33148842_?->33148843_tail_3->33148844_N6-MTase*-><-33148845_?<-33148846_?<-33148847_?<-33148848_?<-33148849_?||33148850_?->33148851_?-> 595910038 <-ParA<-?||?->N6-MTase*-><-?||?-><-?<-?<-?<-STN+Cna_B_2+Plug N6-MTase MethyltransfD12 M080_1486 302 bacteria>bacteroidetes Bacteroides fragilis str. 3397 T10 D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. 3397 T10]. 595910031_?->595910032_?-><-595910033_?<-595910034_?<-595910035_ParA<-595910036_?||595910037_?->595910038_N6-MTase*-><-595910039_?||595910040_?-><-595910041_?<-595910042_?<-595910043_?<-595910044_STN+Cna_B_2+Plug<-595910045_? 261312126 N6-MTase*-> N6-MTase SP COK_0640 301 bacteria>proteobacteria>gammaproteobacteria Mannheimia haemolytica serotype A2 str. BOVINE hypothetical protein COK_0640 [Mannheimia haemolytica serotype A2 str. BOVINE]. 261312126_N6-MTase*-><-261312127_?||261312128_?->261312129_?->261312130_?->261312131_?->261312132_?->261312133_?-> 588488100 Collar->?->?->?->?->N6-MTase*-> N6-MTase - JCM21142_114604 298 bacteria>bacteroidetes Saccharicrinis fermentans DSM 9555 = JCM 21142 site-specific DNA methylase [Saccharicrinis fermentans DSM 9555 = JCM 21142]. 588488094_?->588488095_Collar->588488096_?->588488097_?->588488098_?->588488099_?->588488100_N6-MTase*-> 491961424 GP46->Baseplate_J->DUF2313->?->?->N6-MTase*-> N6-MTase - HI1523 296 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 16273417_?->16273418_?->16273419_GP46->16273420_Baseplate_J->30995455_DUF2313->16273422_?->16273423_?->491961424_N6-MTase*-><-16273425_?<-16273426_?||30995456_?->16273428_?->16273429_?->16273430_?-><-16273431_? 633956025 <-N6-MTase* N6-MTase - HPS42_05865 292 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis ST4-2 hypothetical protein HPS42_05865, partial [Haemophilus parasuis ST4-2]. 633956024_?-><-633956025_N6-MTase* 656071953 <-N6-MTase* N6-MTase - K941_RS0107980 292 bacteria>proteobacteria>gammaproteobacteria Moraxella caprae hypothetical protein [Moraxella caprae]. <-738436646_?<-656071949_?<-656071950_?<-656071951_?<-738436648_?<-738436650_?<-656071952_?<-656071953_N6-MTase*<-656071954_?<-656071955_?<-656071956_?<-656071957_?<-738436652_?||656071958_?->656071959_?-> 736161659 N6-MTase*-> N6-MTase - LS70_RS01430 291 bacteria>proteobacteria>epsilonproteobacteria Helicobacter sp. MIT 11-5569 hypothetical protein [Helicobacter sp. MIT 11-5569]. 736161636_?->736161640_?->736161643_?->736161646_?->736161652_?->736164915_?->736161656_?->736161659_N6-MTase*->736161663_?->736161667_?->736161671_?->736161675_?->736161679_?->736161683_?->736161687_?-> 740907932 <-N6-MTase* N6-MTase - M949_RS00775 290 bacteria>bacteroidetes Riemerella anatipestifer DNA methyltransferase [Riemerella anatipestifer]. 504751695_?->491057476_?->504751696_?->504751697_?->504751698_?->504751699_?->504751700_?-><-740907932_N6-MTase*<-504751703_?<-740908888_?<-740907934_?<-504751706_?<-504751707_?<-504751709_?<-504751710_? 493294268 Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->?->DUF4376->?->N6-MTase*-> N6-MTase - J450_RS10910 289 bacteria>proteobacteria>gammaproteobacteria Mannheimia haemolytica hypothetical protein [Mannheimia haemolytica]. 493293624_Phage_Mu_Gp45->525964514_GP46->525964517_Baseplate_J->493293627_DUF2313->525964520_?->525759286_DUF4376->493294352_?->493294268_N6-MTase*->665824783_?->493291229_?-><-493292924_?<-525964527_?<-493292928_?<-493292929_?<-525964533_? 237868315 <-Phage_sheath_1<-?<-?<-?<-?<-?<-N6-MTase*<-?<-?<-Collar<-Tail_P2_I<-Baseplate_J<-?<-Phage_base_V N6-MTase MethyltransfD12 GCWU000324_01234 288 bacteria>proteobacteria>betaproteobacteria Kingella oralis ATCC 51147 D12 class N6 adenine-specific DNA methyltransferase [Kingella oralis ATCC 51147]. <-237868308_?<-237868309_Phage_sheath_1<-237868310_?<-237868311_?<-237868312_?<-237868313_?<-237868314_?<-237868315_N6-MTase*<-237868316_?<-237868317_?<-237868318_Collar<-237868319_Tail_P2_I<-237868320_Baseplate_J<-237868321_?<-237868322_Phage_base_V 323094424 N6-MTase*-> N6-MTase - HMPREF0663_11914 288 bacteria>bacteroidetes Prevotella oralis ATCC 33269 D12 class N6 adenine-specific DNA methyltransferase [Prevotella oralis ATCC 33269]. 323094417_?->323094418_?->323094419_?->323094420_?->323094421_?-><-323094422_?||323094423_?->323094424_N6-MTase*-><-323094425_?||323094426_?->323094427_?-><-323094428_?<-323094429_?<-323094430_?<-323094431_? 491876509 N6-MTase*-> N6-MTase - HMPREF1053_RS00285 288 bacteria>proteobacteria>gammaproteobacteria Haemophilus haemolyticus hypothetical protein [Haemophilus haemolyticus]. 696248627_?->696248628_?->491876467_?->491876208_?->491876315_?->491876522_?->491876174_?->491876509_N6-MTase*-><-491876092_?<-491876286_?<-491876146_?<-491876339_?||491876477_?-><-491876085_?<-491849867_? 656071893 GPW_gp25->Baseplate_J->Tail_P2_I->?->DUF4376->?->?->N6-MTase*-> N6-MTase MethyltransfD12 K941_RS0107590 288 bacteria>proteobacteria>gammaproteobacteria Moraxella caprae hypothetical protein [Moraxella caprae]. 656071886_GPW_gp25->656071887_Baseplate_J->656071888_Tail_P2_I->738436601_?->656071890_DUF4376->656071891_?->656071892_?->656071893_N6-MTase*->656071894_?->656071895_?->738436603_?->656071897_?->656071898_?-><-656071899_?<-656071900_? 738435815 <-N6-MTase*<-?<-tail_3 N6-MTase - K941_RS0100010 288 bacteria>proteobacteria>gammaproteobacteria Moraxella caprae hypothetical protein [Moraxella caprae]. <-738435815_N6-MTase*<-738435817_?<-738435838_tail_3<-738435841_?<-656070712_?<-656070713_?<-656070714_?<-738435843_? 762905187 <-Phage_integrase||?->?->N6-MTase*-> N6-MTase - UMN179_RS08310 288 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 762905530_?->503511886_?-><-503511887_?<-762905184_?<-503511888_Phage_integrase||503511889_?->503511890_?->762905187_N6-MTase*-><-503511892_?<-503511893_?<-503511894_?<-503511895_?<-503511896_?<-503511897_?<-503511898_? 494836361 Thymidylate_synthase->N6-MTase*-> N6-MTase - M137_RS11600 286 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. 757767589_?->495949220_?->494836371_?->494836369_?->494836367_?->494836365_Thymidylate_synthase->494836361_N6-MTase*->695408851_?->695399977_?->499301954_?->492242375_?->695399987_?-><-695399991_?||492348593_?-> 496521463 STN+Cna_B_2+Plug->?-><-Thymidylate_synthase<-N6-MTase* N6-MTase Methyltransf_26 HMPREF0670_RS03300 286 bacteria>bacteroidetes Prevotella sp. oral taxon 317 DNA methyltransferase [Prevotella sp. oral taxon 317]. 496521457_?-><-763204112_?<-763204113_?||496521459_?->496521460_STN+Cna_B_2+Plug->496521461_?-><-763204334_Thymidylate_synthase<-496521463_N6-MTase*<-496521464_?<-496521465_?<-496521466_?<-496521467_?<-496521468_?<-496521469_?<-763204114_? 521257429 N6-MTase*->Phage_base_V->?->Baseplate_J->?->?->?->Phage_sheath_1-> N6-MTase - PSYCG_RS09460 286 bacteria>proteobacteria>gammaproteobacteria Psychrobacter sp. G hypothetical protein [Psychrobacter sp. G]. 521257422_?->521257423_?->521257424_?->521257425_?->521257426_?->521257427_?->521257428_?->521257429_N6-MTase*->754143218_Phage_base_V->521257431_?->521257432_Baseplate_J->754143219_?->521257434_?->521257435_?->521257436_Phage_sheath_1-> 647521559 N6-MTase*->Thymidylate_synthase-><-?<-ABC-ATPase N6-MTase - JCM12083_RS12170 286 bacteria>bacteroidetes Prevotella shahii DNA methyltransferase [Prevotella shahii]. 647521551_?->647521553_?->647521557_?->647521559_N6-MTase*->647521563_Thymidylate_synthase-><-763202136_?<-647521566_ABC-ATPase<-647521567_?<-647521568_?<-647521569_?||647521571_?-> 649521449 <-STN+Cna_B_2+Plug<-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC-> N6-MTase MethyltransfD12 M098_0958 286 bacteria>bacteroidetes Bacteroides vulgatus str. 3775 SR(B) 19 D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides vulgatus str. 3775 SR(B) 19]. <-649521442_?<-649521443_STN+Cna_B_2+Plug<-649521444_VirD4-FtsK<-649521445_?<-649521446_LPD3+N4ART+N4ART+MPTase+MPTase<-649521447_?<-649521448_?<-649521449_N6-MTase*<-649521450_?||649521451_ParA->649521452_?->649521453_?->649521454_DOC-><-649521455_?||649521456_?-> 736171011 Phage_base_V->GPW_gp25->Baseplate_J->Tail_P2_I->Collar->Caudo_TAP->?->N6-MTase*-> N6-MTase Methyltransf_26 Q338_RS03200 286 bacteria>proteobacteria>betaproteobacteria Alysiella crassa hypothetical protein [Alysiella crassa]. 736170990_Phage_base_V->736170994_GPW_gp25->736170998_Baseplate_J->736171002_Tail_P2_I->736171072_Collar->736171076_Caudo_TAP->736171005_?->736171011_N6-MTase*->736171080_?->736171084_?-><-736171015_?<-736171088_?<-736171019_?<-736171022_?<-736171026_? 495898225 <-N6-MTase*<-Thymidylate_synthase N6-MTase - HMPREF9441_RS15520 285 bacteria>bacteroidetes Paraprevotella clara DNA methyltransferase [Paraprevotella clara]. <-495898213_?||495898214_?-><-748607312_?||495898218_?->495898219_?->495898220_?->495898221_?-><-495898225_N6-MTase*<-495898226_Thymidylate_synthase<-495898227_?<-495898228_?<-748607316_?<-495898231_?<-748607319_?<-495898233_? 495946269 <-N6-MTase*<-Thymidylate_synthase N6-MTase - BSFG_RS03650 285 bacteria>bacteroidetes Bacteroides sp. 4_3_47FAA DNA methyltransferase [Bacteroides sp. 4_3_47FAA]. <-495946260_?<-495946261_?<-495123637_?<-495123633_?<-495946264_?<-696364638_?<-696364632_?<-495946269_N6-MTase*<-495946270_Thymidylate_synthase<-495946271_?<-495946272_?<-495946273_?<-495123608_?<-495946274_?<-495946275_? 315663782 N6-MTase*->Thymidylate_synthase-> N6-MTase - HMPREF9420_2325 284 bacteria>bacteroidetes Prevotella salivae DSM 15606 hypothetical protein HMPREF9420_2325 [Prevotella salivae DSM 15606]. 315663841_?-><-315663842_?<-315663843_?||315663778_?->315663779_?->315663780_?->315663781_?->315663782_N6-MTase*->315663783_Thymidylate_synthase->315663784_?->315663785_?->315663786_?-><-315663787_?||315663788_?->315663789_?-> 443550816 tail_3->N6-MTase+Phage-tailfib*-> N6-MTase+Phage-tailfib SP A160_0967 284 bacteria>proteobacteria>gammaproteobacteria Aggregatibacter actinomycetemcomitans serotype a str. A160 hypothetical protein A160_0967 [Aggregatibacter actinomycetemcomitans serotype a str. A160]. 443550809_?->443550810_?->443550811_?->443550812_?->443550813_?->443550814_?->443550815_tail_3->443550816_N6-MTase+Phage-tailfib*-><-443550817_?<-443550818_?<-443550819_?<-443550820_?<-443550821_?<-443550822_?<-443550823_? 490455210 Thymidylate_synthase->N6-MTase*-> N6-MTase MethyltransfD12 BA92_RS10770 284 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. 492713924_?->753198973_?->490455216_?->490455215_?->490455214_?->490455213_?->494419100_Thymidylate_synthase->490455210_N6-MTase*-><-753198974_?||753196135_?->753196136_?->753196137_?->753196138_?->753198975_?->753199045_?-> 490456001 Thymidylate_synthase->N6-MTase*-> N6-MTase - HMPREF1070_RS05245 284 bacteria>bacteroidetes Bacteroides ovatus DNA methyltransferase [Bacteroides ovatus]. 490455991_?->490455993_?->490455995_?->696272652_?->696272653_?->490455999_?->696272654_Thymidylate_synthase->490456001_N6-MTase*->696272656_?->490456003_?->490456004_?->490425270_?->490425271_?-><-490425272_?<-490425273_? 492444862 <-STN+Cna_B_2+Plug||?-><-?<-?<-?||?-><-N6-MTase*<-Thymidylate_synthase N6-MTase MethyltransfD12 BN535_00547 284 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. <-491935049_?<-547309720_STN+Cna_B_2+Plug||547309721_?-><-547309722_?<-491933064_?<-547309723_?||547309724_?-><-492444862_N6-MTase*<-492444861_Thymidylate_synthase<-492444860_?<-492444859_?<-492444858_?||547309725_?->547309726_?-> 492741740 STN+Cna_B_2+Plug-><-N6-MTase*<-Thymidylate_synthase N6-MTase MethyltransfD12 M125_RS18320 284 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. 496045361_?->492280413_?->695547684_STN+Cna_B_2+Plug-><-492741740_N6-MTase*<-695547708_Thymidylate_synthase<-695547689_?<-492741743_?<-695547696_?<-757749882_?<-492741756_?<-757749883_? 495041624 <-N6-MTase*<-Thymidylate_synthase N6-MTase MethyltransfD12 HMPREF1057_RS0113675 284 bacteria>bacteroidetes Bacteroides finegoldii DNA methyltransferase [Bacteroides finegoldii]. <-495041624_N6-MTase*<-696271763_Thymidylate_synthase<-495041629_?<-696271765_?||495033235_?->495034690_?-><-490456521_?<-490456520_? 496037689 Thymidylate_synthase->N6-MTase*-> N6-MTase - HMPREF9007_RS09530 284 bacteria>bacteroidetes Bacteroides sp. 1_1_14 DNA methyltransferase [Bacteroides sp. 1_1_14]. 496037681_?->496037682_?->696274161_?->496037684_?->496037687_?->490455213_?->696274162_Thymidylate_synthase->496037689_N6-MTase*->496037690_?->496037691_?->496037692_?->496037693_?->496037694_?->496037695_?->496037696_?-> 512184299 MuF->?->?->?->?->Thymidylate_synthase->N6-MTase*-><-?<-?<-Phage_tail_S N6-MTase MethyltransfD12 BN456_01886 284 bacteria>bacteroidetes Prevotella sp. CAG:1031 hypothetical protein [Prevotella sp. CAG:1031]. 512184292_?->512184293_MuF->512184294_?->512184295_?->512184296_?->512184297_?->512184298_Thymidylate_synthase->512184299_N6-MTase*-><-512184300_?<-512184301_?<-512184302_Phage_tail_S<-512184303_?<-512184328_?<-512184329_?<-512184330_? 545407693 <-Thymidylate_synthase - SP HMPREF1981_RS13185 284 bacteria>bacteroidetes Bacteroides pyogenes D12 class N6 adenine-specific DNA methyltransferase [Bacteroides pyogenes]. <-545407686_?<-545407687_?<-545407688_?<-748714848_?<-748714859_?<-545407690_?<-545407691_?<-545407693_?*<-545407694_Thymidylate_synthase<-545407695_?<-545407696_?<-545407697_?<-545407698_?<-545407699_?<-545407700_? 545595274 <-N6-MTase*<-?<-DUF4376 N6-MTase - AJF4211_RS12815 284 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum Putative uncharacterized protein [Avibacterium paragallinarum]. 516417631_?->516417630_?->516417629_?->737727144_?-><-737727161_?||545595273_?->545595597_?-><-545595274_N6-MTase*<-737726748_?<-737726759_DUF4376<-545596245_? 545595679 Baseplate_J->Tail_P2_I->Collar->?->DUF4376->?->?->N6-MTase*-> N6-MTase - AJF4211_RS06820 284 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum Putative uncharacterized protein [Avibacterium paragallinarum]. 737726877_Baseplate_J->737726879_Tail_P2_I->737691486_Collar->648446893_?->737726882_DUF4376->737726885_?->545595678_?->545595679_N6-MTase*-><-545595597_?<-545595273_?||545595680_?-> 545595880 N6-MTase*-> N6-MTase - AJF4211_RS08790 284 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum Putative uncharacterized protein [Avibacterium paragallinarum]. 545595880_N6-MTase*-><-545595597_?<-545595273_?||545595680_?->737691335_?->516418003_?->516418004_?->545595883_?-> 696270804 Thymidylate_synthase->N6-MTase*-><-?||?->?-><-?<-?<-?<-STN+Cna_B_2+Plug N6-MTase MethyltransfD12 M082_RS01650 284 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. 757774896_?->496037684_?->757774897_?->696270757_?->696270759_?->490455213_?->696270802_Thymidylate_synthase->696270804_N6-MTase*-><-696270761_?||696270763_?->696270765_?-><-696270768_?<-495295222_?<-696270770_?<-696270806_STN+Cna_B_2+Plug 229455867 <-STN+Cna_B_2+Plug<-?||?-><-N6-MTase*<-Thymidylate_synthase N6-MTase - BSBG_02560 283 bacteria>bacteroidetes Bacteroides sp. 9_1_42FAA D12 class N6 adenine-specific DNA methyltransferase [Bacteroides sp. 9_1_42FAA]. 229455860_?-><-229455861_?<-229455862_?<-229455863_?<-229455864_STN+Cna_B_2+Plug<-229455865_?||229455866_?-><-229455867_N6-MTase*<-229455868_Thymidylate_synthase<-229455869_?<-229455870_?<-229455871_?||229455872_?-><-229455873_?<-229455874_? 490512514 N6-MTase*->Thymidylate_synthase->?-><-?<-?<-STN+Cna_B_2+Plug N6-MTase - HMPREF0665_RS09490 283 bacteria>bacteroidetes Prevotella oris DNA methyltransferase [Prevotella oris]. 490512507_?->490512508_?->490512509_?->490512510_?->490512511_?->490512512_?->490512513_?->490512514_N6-MTase*->490512515_Thymidylate_synthase->739008727_?-><-490512516_?<-490512517_?<-748616038_STN+Cna_B_2+Plug||490512519_?->490512520_?-> 648594256 N6-MTase*->Thymidylate_synthase->?->STN+Cna_B_2+Plug-> N6-MTase - D468_RS0112575 283 bacteria>bacteroidetes Prevotella oris DNA methyltransferase [Prevotella oris]. 517750944_?->517750945_?->490512510_?->490512511_?->517750946_?->648594255_?->648594256_N6-MTase*->490512515_Thymidylate_synthase->739008727_?->517750949_STN+Cna_B_2+Plug->647603487_?->647603486_?->647603485_?->647603484_?-> 489886467 <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45 N6-MTase - KKB_RS07455 282 bacteria>proteobacteria>betaproteobacteria Kingella kingae hypothetical protein [Kingella kingae]. <-489886453_?<-489886455_?<-489886457_?||489886460_?-><-489886461_?<-489886463_?||489886465_?-><-489886467_N6-MTase*<-696250480_?<-696250481_DUF4376<-696250482_?<-489886475_DUF2313<-489886478_Baseplate_J<-489886480_GP46<-489886482_Phage_Mu_Gp45 491717013 N6-MTase+Phage-tailfib*-> N6-MTase+Phage-tailfib SP SCC393_RS02190 282 bacteria>proteobacteria>gammaproteobacteria Aggregatibacter actinomycetemcomitans hypothetical protein [Aggregatibacter actinomycetemcomitans]. 491716979_?-><-491716981_?||491716992_?->491717001_?->491717003_?->757650929_?->757650930_?->491717013_N6-MTase+Phage-tailfib*-><-757650923_?<-491717022_?<-491700764_?<-491717025_?<-491717030_?<-491700760_?<-757650924_? 491784781 tail_3->N6-MTase*-><-?||?-><-?<-?<-?<-ABC-ATPase N6-MTase - APPSER11_RS04990 282 bacteria>proteobacteria>gammaproteobacteria Actinobacillus pleuropneumoniae hypothetical protein [Actinobacillus pleuropneumoniae]. 491784768_?->491784770_?->491784772_?->491806154_?->491784776_?->491806156_?->491806158_tail_3->491784781_N6-MTase*-><-763113371_?||491784785_?-><-491784787_?<-491784789_?<-491784791_?<-491784793_ABC-ATPase<-491784795_? 499301742 <-ParA||?->N6-MTase*-><-?||?-><-?<-?<-?<-STN+Cna_B_2+Plug N6-MTase MethyltransfD12 M080_RS26780 282 bacteria>bacteroidetes Bacteroides fragilis DNA methyltransferase [Bacteroides fragilis]. 492279352_?->492279349_?->492279345_?-><-492279342_?<-492279339_?<-492279337_ParA||492279331_?->499301742_N6-MTase*-><-657213291_?||492279325_?-><-492279322_?<-492279320_?<-492291773_?<-492279316_STN+Cna_B_2+Plug<-492291771_? 503933737 tail_3->N6-MTase+Phage-tailfib*-> N6-MTase+Phage-tailfib SP ANH9381_RS06760 282 bacteria>proteobacteria>gammaproteobacteria Aggregatibacter actinomycetemcomitans hypothetical protein [Aggregatibacter actinomycetemcomitans]. <-491690510_?<-491690512_?<-491762168_?||491690521_?->696438157_?->491736689_?->491736696_tail_3->503933737_N6-MTase+Phage-tailfib*-><-491755859_?||754504819_?-><-491731099_?<-491684677_?<-491731104_?<-491684680_?||491731108_?-> 504751701 <-N6-MTase* N6-MTase - B739_RS09680 282 bacteria>bacteroidetes Riemerella anatipestifer DNA methyltransferase [Riemerella anatipestifer]. 504751695_?->491057476_?->504751696_?->504751697_?->504751698_?->504751699_?->504751700_?-><-504751701_N6-MTase*<-504751703_?<-740908888_?<-740907934_?<-504751706_?<-504751707_?<-504751709_?<-504751710_? 517482436 <-N6-MTase* N6-MTase - C228_RS0112985 282 bacteria>proteobacteria>gammaproteobacteria Actinobacillus capsulatus hypothetical protein [Actinobacillus capsulatus]. <-517482436_N6-MTase*<-748200589_? 596095999 <-ParA||?->?->N6-MTase*->?-><-?||?-><-?||STN+Cna_B_2+Plug-> N6-MTase MethyltransfD12 M116_4685 282 bacteria>bacteroidetes Bacteroides fragilis str. 3719 A10 D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. 3719 A10]. <-596095996_ParA||596095997_?->596095998_?->596095999_N6-MTase*->596096000_?-><-596096001_?||596096002_?-><-596096003_?||596096004_STN+Cna_B_2+Plug->596096005_?->596096006_?-> 640568267 <-N6-MTase* N6-MTase - JCM15124_RS09550 282 bacteria>bacteroidetes Prevotella falsenii DNA methyltransferase [Prevotella falsenii]. <-640568267_N6-MTase*<-640568268_?<-640568269_?<-640568270_?<-640568271_?<-640568272_?<-739003492_?<-640568274_? 696373063 <-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC-> N6-MTase MethyltransfD12 BSEG_RS20295 282 bacteria>bacteroidetes Bacteroides dorei DNA methyltransferase [Bacteroides dorei]. <-696372985_?||495114778_?-><-495114781_VirD4-FtsK<-495114782_?<-495114784_LPD3+N4ART+N4ART+MPTase+MPTase<-495114785_?<-495114786_?<-696373063_N6-MTase*<-495114788_?||495114789_ParA->495114791_?->495114793_?->696372991_DOC-><-696372992_?<-495114800_? 696374681 <-STN+Cna_B_2+Plug<-VirD4-FtsK<-?<-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC-> N6-MTase MethyltransfD12 M098_RS09095 282 bacteria>bacteroidetes Bacteroides vulgatus DNA methyltransferase [Bacteroides vulgatus]. <-696374562_?<-696374565_STN+Cna_B_2+Plug<-696374568_VirD4-FtsK<-495946037_?<-696374569_LPD3+N4ART+N4ART+MPTase+MPTase<-696374677_?<-696374574_?<-696374681_N6-MTase*<-495946968_?||696374577_ParA->696374581_?->696374583_?->696374585_DOC-><-696374588_?||696374592_?-> 737547081 N6-MTase*-> N6-MTase - HPS41_RS06910 282 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis hypothetical protein [Haemophilus parasuis]. 737547081_N6-MTase*-><-737547084_?||737547091_?->737547093_?->737547086_?->491993831_?->737547088_?->491993827_?-> 738993231 <-N6-MTase* N6-MTase - HMPREF1475_RS02705 282 bacteria>bacteroidetes Prevotella oralis DNA methyltransferase [Prevotella oralis]. <-490504035_?<-490504034_?||490504033_?->490504032_?->490504030_?-><-490504029_?<-490504028_?<-738993231_N6-MTase*<-514976994_?||490504024_?-><-490504023_?<-490503678_?<-490504022_?<-490504021_?<-490504020_? 746108169 <-N6-MTase*<-?<-?||Phage_integrase-> N6-MTase - JP36_RS09335 282 bacteria>proteobacteria>gammaproteobacteria Gallibacterium genomosp. 1 hypothetical protein [Gallibacterium genomosp. 1]. 746108158_?->746108161_?->746108163_?->746068593_?->746108165_?->746108167_?->746108192_?-><-746108169_N6-MTase*<-746108171_?<-746108173_?||746108176_Phage_integrase->746108178_?-> 753848096 tail_3->N6-MTase*-> N6-MTase - HD_RS06430 282 bacteria>proteobacteria>gammaproteobacteria Haemophilus ducreyi hypothetical protein [Haemophilus ducreyi]. 499246981_?-><-499246982_?||499246983_?->499246984_?->499246990_?->499246991_?->499247853_tail_3->753848096_N6-MTase*-><-499247855_?<-499247856_?<-499247857_?<-499247859_?||499247860_?->499247861_?->499247862_?-> 229317608 <-N6-MTase* N6-MTase MethyltransfD12 POREN0001_0004 281 bacteria>bacteroidetes Porphyromonas endodontalis ATCC 35406 D12 class N6 adenine-specific DNA methyltransferase [Porphyromonas endodontalis ATCC 35406]. 229317606_?->229317610_?->229317612_?-><-229317608_N6-MTase*<-229317617_?<-229317615_?||229317607_?-><-229317609_?<-229317611_?<-229317616_?<-229317614_? 359359006 N6-MTase*-> N6-MTase - hia5 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae Hia5 [Haemophilus influenzae]. 359359005_?->359359006_N6-MTase*-> 491864737 GPW_gp25->Baseplate_J->Tail_P2_I->Collar->?->?->?->N6-MTase*-><-?||?->?->?->?->?->Collar-> N6-MTase - GGE_RS03480 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus haemolyticus hypothetical protein [Haemophilus haemolyticus]. 491864725_GPW_gp25->491864727_Baseplate_J->491864729_Tail_P2_I->763375215_Collar->491864732_?->491864734_?->491864735_?->491864737_N6-MTase*-><-491864739_?||491864742_?->763375197_?->491864746_?->763375217_?->491864749_?->763375218_Collar-> 491953443 <-N6-MTase*<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J<-GP46 N6-MTase - HMPREF9095_RS06800 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus MULTISPECIES: hypothetical protein [Haemophilus]. 494053311_?->491916408_?->494053312_?-><-494053313_?<-696248082_?||491910609_?->491864739_?-><-491953443_N6-MTase*<-491866182_?<-494053317_?<-494053318_?<-494053319_Collar<-696248122_DUF2313<-494053321_Baseplate_J<-494053322_GP46 492125251 tail_3->?->N6-MTase*-><-?||?->?->ABC-ATPase-> N6-MTase SP PMCN03_RS01910 281 bacteria>proteobacteria>gammaproteobacteria Pasteurella multocida hypothetical protein [Pasteurella multocida]. 492125273_?->504481140_?->504481139_?->492125264_?->504481138_?->643672669_tail_3->492125255_?->492125251_N6-MTase*-><-514165557_?||512748675_?->504091885_?->492023654_ABC-ATPase->492113843_?-><-643672685_?<-492023662_? 492143056 <-N6-MTase*<-?<-?<-?<-Collar<-Baseplate_J<-GPW_gp25<-Phage_base_V N6-MTase - HMPREF1052_RS08385 281 bacteria>proteobacteria>gammaproteobacteria Pasteurella bettyae hypothetical protein [Pasteurella bettyae]. 492143048_?->492143050_?-><-492142921_?||492143029_?-><-492142909_?||492143018_?->750314533_?-><-492143056_N6-MTase*<-750314506_?<-492143138_?<-492142949_?<-750314535_Collar<-492143012_Baseplate_J<-492143123_GPW_gp25<-492143004_Phage_base_V 494053240 <-Phage_sheath_1<-N6-MTase*<-?<-?<-?<-Collar<-?<-Baseplate_J N6-MTase - HMPREF9095_RS07250 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus aegyptius hypothetical protein [Haemophilus aegyptius]. 696248080_?-><-494053233_?<-494053234_?<-494053237_?<-491890345_?<-494053238_?<-494053239_Phage_sheath_1<-494053240_N6-MTase*<-491866182_?<-494053242_?<-491901935_?<-494053243_Collar<-494053244_?<-494053245_Baseplate_J<-491852313_? 494789040 <-N6-MTase*<-?<-?<-?<-?<-Tail_P2_I<-Baseplate_J<-GPW_gp25 N6-MTase - HMPREF1128_RS04375 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus sputorum hypothetical protein [Haemophilus sputorum]. <-494789045_?<-494789378_?<-494789560_?<-494788858_?<-494788621_?<-494790093_?<-494789843_?<-494789040_N6-MTase*<-491961422_?<-494788844_?<-494788564_?<-494790005_?<-494789847_Tail_P2_I<-494788657_Baseplate_J<-494788891_GPW_gp25 503290984 <-N6-MTase*<-?<-?<-?<-Collar<-Tail_P2_I<-Baseplate_J<-GPW_gp25 N6-MTase - HIBPF_RS02220 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 491889249_?->494053683_?->494053684_?->503290981_?->503290982_?->752488873_?->752488874_?-><-503290984_N6-MTase*<-752488875_?<-503290985_?<-503290986_?<-503290987_Collar<-491897168_Tail_P2_I<-503290988_Baseplate_J<-503290989_GPW_gp25 503292691 Baseplate_J->?->?->Collar->?->?->?->N6-MTase*->Phage_sheath_1-> N6-MTase - HICON_RS02315 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 494053245_Baseplate_J->494053244_?->752489194_?->503292689_Collar->503292690_?->503290985_?->752489128_?->503292691_N6-MTase*->494053239_Phage_sheath_1->494053238_?->491890345_?->494053237_?->494053234_?->503292692_?-><-491914347_? 649508868 Thymidylate_synthase->N6-MTase*-> N6-MTase MethyltransfD12 M088_0657 281 bacteria>bacteroidetes Bacteroides ovatus str. 3725 D1 iv D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides ovatus str. 3725 D1 iv]. 649508861_?->649508862_?->649508863_?->649508864_?->649508865_?->649508866_?->649508867_Thymidylate_synthase->649508868_N6-MTase*-><-649508869_?||649508870_?->649508871_?->649508872_?-><-649508873_?<-649508874_?<-649508875_? 662243730 N6-MTase*->?->HNH->Terminase_SS->Terminase_LS-> N6-MTase SP SASC598J21_017980 281 bacteria>proteobacteria>betaproteobacteria Snodgrassella alvi SCGC AB-598-J21 hypothetical protein SASC598J21_017980 [Snodgrassella alvi SCGC AB-598-J21]. 662243723_?->662243724_?->662243725_?->662243726_?->662243727_?->662243728_?->662243729_?->662243730_N6-MTase*->662243731_?->662243732_HNH->662243733_Terminase_SS->662243734_Terminase_LS->662243735_?->662243736_?->662243737_?-> 696244941 GP46->Baseplate_J->DUF2313->Collar->?->?->?->N6-MTase*-> N6-MTase - CK45_RS04150 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 696244953_GP46->696244951_Baseplate_J->696244949_DUF2313->696245023_Collar->696244947_?->696244945_?->696244943_?->696244941_N6-MTase*-><-696244940_?<-696244939_?||696244937_?-><-491880641_?||491907518_?-><-696244936_?<-696244934_? 696250595 GP46->Baseplate_J->DUF2313->?->?->N6-MTase*-> N6-MTase - HICG_RS06205 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 696250594_?->491961409_?->491961411_GP46->491961413_Baseplate_J->491961415_DUF2313->491961417_?->491961422_?->696250595_N6-MTase*-><-696250610_?<-491961426_?<-491961428_?||491961431_?->491961436_?->491961439_?->491961442_?-> 746003746 <-Phage_integrase||?->?->N6-MTase*-> N6-MTase - JP35_RS03815 281 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. <-746003737_Phage_integrase||746003738_?->746003739_?->746003746_N6-MTase*-><-746003740_?<-746003741_?||746003742_?-><-746003747_?<-746003743_?||746003744_?->746003745_?-> 746007528 DUF4376->?->Phage_sheath_1-><-N6-MTase* N6-MTase - JP32_RS09880 281 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 746007515_?->746007523_?-><-746007517_?<-746007519_?||746007525_DUF4376->746007521_?->746007527_Phage_sheath_1-><-746007528_N6-MTase* 746088554 <-N6-MTase*<-?<-?||Phage_integrase-> N6-MTase - IO45_RS00140 281 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 746011065_?->746088543_?->545092680_?-><-746088554_N6-MTase*<-746088546_?<-746088550_?||746088556_Phage_integrase-><-746088553_? 746098344 GPW_gp25->Baseplate_J->Tail_P2_I->?-><-Phage_integrase||?->?->N6-MTase*-> N6-MTase - IO48_RS11150 281 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 746089736_GPW_gp25->746098324_Baseplate_J->746098326_Tail_P2_I->746098340_?-><-746098342_Phage_integrase||746098328_?->746098330_?->746098344_N6-MTase*-><-503511892_?<-746098332_?<-746098346_?<-746003424_?<-503510518_?<-503510519_?||545092895_?-> 746100920 <-Phage_integrase||?->?->N6-MTase*-><-?<-?<-ABC-ATPase N6-MTase - JL04_RS11025 281 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 746100909_?-><-746100912_Phage_integrase||545093172_?->545093173_?->746100920_N6-MTase*-><-545092600_?<-665836837_?<-746100915_ABC-ATPase||746100917_?-> 746131177 <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45 N6-MTase - ACEE_RS02875 281 bacteria>proteobacteria>gammaproteobacteria Actinobacillus equuli hypothetical protein [Actinobacillus equuli]. <-746131168_?<-746131169_?<-746131170_?<-746133702_?<-746131172_?||746131174_?->746131176_?-><-746131177_N6-MTase*<-746131178_?<-746131180_DUF4376<-746133704_?<-746131183_DUF2313<-746131185_Baseplate_J<-746131194_GP46<-746131195_Phage_Mu_Gp45 748782878 GP46->Baseplate_J->DUF2313->Collar->?->?->?->N6-MTase*-> N6-MTase - W820_RS02320 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 696246362_GP46->543996823_Baseplate_J->748782883_DUF2313->748782885_Collar->748782874_?->748782875_?->491866182_?->748782878_N6-MTase*-><-491864739_?<-748782880_? 756154060 Phage_base_V->GPW_gp25->Baseplate_J->Tail_P2_I->?->?->?->N6-MTase*-> N6-MTase - SU55_RS07055 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 491897174_Phage_base_V->491897173_GPW_gp25->491897170_Baseplate_J->491897168_Tail_P2_I->756154059_?->491897340_?->491866182_?->756154060_N6-MTase*-><-491955477_?<-491880641_?||491880643_?-><-696242420_?||756154063_?-> 756154896 <-N6-MTase* N6-MTase - SU30_RS04070 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. <-491907958_?<-491909806_?<-491909802_?<-756151526_?||756151519_?-><-491909047_?||499591788_?-><-756154896_N6-MTase*<-491866182_?<-756154897_? 756163264 ABC-ATPase->ABC-ATPase->?->?-><-N6-MTase*<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J<-GP46 N6-MTase - SU58_RS08535 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 756163260_?->527109474_?->491961245_?->756163261_ABC-ATPase->756163262_ABC-ATPase->491906242_?->756163263_?-><-756163264_N6-MTase*<-491866182_?<-756163265_?<-756163266_?<-756163321_Collar<-756163322_DUF2313<-756163267_Baseplate_J<-756163268_GP46 764389671 <-N6-MTase*<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J<-GP46 N6-MTase - NTHI723_RS04270 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. <-740655046_?||491838320_?-><-740655049_?||740655051_?->740655053_?-><-764437855_?||764389667_?-><-764389671_N6-MTase*<-491866182_?<-764436837_?<-764389675_?<-764436842_Collar<-764389994_DUF2313<-764389683_Baseplate_J<-696246362_GP46 764738267 Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->N6-MTase*-><-?<-METHYLASE N6-MTase - BN1226_RS02290 281 bacteria>proteobacteria>gammaproteobacteria Mannheimia sp. MG13 hypothetical protein [Mannheimia sp. MG13]. 764737997_Phage_Mu_Gp45->764737999_GP46->764738003_Baseplate_J->764738005_DUF2313->764738263_Collar->764738265_DUF4376->764738006_?->764738267_N6-MTase*-><-764738269_?<-764738008_METHYLASE<-764738010_?<-764738270_?<-764738012_?<-764738013_?<-492134135_? 777210024 GP46->Baseplate_J->DUF2313->Collar->?->?->?->N6-MTase*-> N6-MTase - ERS450003_01064 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae D12 class N6 adenine-specific DNA methyltransferase [Haemophilus influenzae]. 777210017_GP46->777210018_Baseplate_J->777210019_DUF2313->777210020_Collar->777210021_?->777210022_?->777210023_?->777210024_N6-MTase*-><-777210025_?<-777210026_?<-777210027_?||777210028_?->777210029_?-><-777210030_?||777210031_?-> 803453319 GPW_gp25->Baseplate_J->Tail_P2_I->Collar->?->?->?->N6-MTase*-> N6-MTase - C645_RS00620 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 491897173_GPW_gp25->491897170_Baseplate_J->491897168_Tail_P2_I->803453317_Collar->491901935_?->803453318_?->491866182_?->803453319_N6-MTase*-><-803453320_?||803453656_?->491901278_?->491896093_?->491896090_?-><-491896087_?||491896082_?-> 803453531 GPW_gp25->Baseplate_J->Tail_P2_I->Collar->?->?->?->N6-MTase*-><-?||?->?-><-?<-?||?-><-ABC-ATPase N6-MTase - C645_RS06690 281 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. 803453528_GPW_gp25->491901962_Baseplate_J->491901959_Tail_P2_I->803453529_Collar->803453668_?->803453530_?->491866182_?->803453531_N6-MTase*-><-491864739_?||803453532_?->803453533_?-><-491877319_?<-499591645_?||696245907_?-><-491893936_ABC-ATPase 488719230 <-N6-MTase* N6-MTase - HMPREF9021_RS11285 280 bacteria>proteobacteria>betaproteobacteria Simonsiella muelleri hypothetical protein [Simonsiella muelleri]. <-488719230_N6-MTase*<-488717776_?<-750347722_?<-488717778_?<-488717779_?<-750346942_?<-750347723_? 492352476 <-ParA||?->N6-MTase*->?-><-?<-?||STN+Cna_B_2+Plug-> N6-MTase MethyltransfD12 M116_RS19700 280 bacteria>bacteroidetes Bacteroides fragilis DNA methyltransferase [Bacteroides fragilis]. <-695454433_ParA||492352478_?->492352476_N6-MTase*->492352474_?-><-492352472_?<-695337132_?||492352466_STN+Cna_B_2+Plug->492352464_?->492352462_?-><-695337131_? 493291112 N6-MTase*->METHYLASE-> N6-MTase - VK67_RS05530 280 bacteria>proteobacteria>gammaproteobacteria Mannheimia haemolytica hypothetical protein [Mannheimia haemolytica]. <-505297060_?<-493291101_?||493291104_?->493291105_?->493291108_?->493291109_?->493291110_?->493291112_N6-MTase*->493291113_METHYLASE->493291114_?->493291115_?-><-493291116_?||493291118_?-><-493291119_?<-493291120_? 501020456 <-N6-MTase* N6-MTase - ASUC_RS07260 280 bacteria>proteobacteria>gammaproteobacteria Actinobacillus succinogenes hypothetical protein [Actinobacillus succinogenes]. <-501020449_?||501020450_?->501020451_?-><-501020452_?||501020453_?->501020454_?->501020455_?-><-501020456_N6-MTase*<-501020458_?<-501020459_?||501020460_?-><-501020461_?||501020462_?->501020463_?-><-501020464_? 525759492 N6-MTase*-> N6-MTase - J450_RS04260 280 bacteria>proteobacteria>gammaproteobacteria Mannheimia haemolytica hypothetical protein [Mannheimia haemolytica]. 525759486_?->493295450_?->493293312_?->525759488_?->525759489_?->525759490_?->525759491_?->525759492_N6-MTase*->696267597_?->525759493_?->493293694_?-><-493292220_?<-493292218_?<-493294463_?<-493294462_? 544657538 <-N6-MTase*<-DCM N6-MTase MethyltransfD12 ATCC51562_RS05210 280 bacteria>proteobacteria>epsilonproteobacteria Campylobacter concisus hypothetical protein [Campylobacter concisus]. <-544657507_?<-737181490_?<-737181469_?<-544657524_?<-737181491_?<-544657530_?<-737181471_?<-544657538_N6-MTase*<-544657522_DCM<-544657549_?<-737181492_?<-544657545_?||544657510_?-><-544657536_?||737181493_?-> 544865513 <-N6-MTase* N6-MTase - L278_RS124350 280 bacteria>proteobacteria>gammaproteobacteria Mannheimia haemolytica hypothetical protein [Mannheimia haemolytica]. 493293008_?->544865512_?->525759340_?->493290115_?-><-493290116_?<-493290117_?<-696267597_?<-544865513_N6-MTase*<-544865514_?<-493291109_?<-544865515_?<-544865518_?<-544865520_?||493293688_?-><-493292580_? 544865770 GP46->Baseplate_J->DUF2313->?->DUF4376->?->N6-MTase*-> N6-MTase - L278_RS122210 280 bacteria>proteobacteria>gammaproteobacteria Mannheimia haemolytica hypothetical protein [Mannheimia haemolytica]. 544865763_?->544865764_GP46->544865765_Baseplate_J->544865766_DUF2313->544865767_?->544865768_DUF4376->544865769_?->544865770_N6-MTase*-><-544865771_?<-544865772_?||544865773_?->493296028_?-><-493296027_?||493294126_?->544865774_?-> 639782857 <-METHYLASE||?->?-><-?||?-><-N6-MTase*<-?<-?<-?<-Collar N6-MTase - TMA01S_RS05515 280 bacteria>bacteroidetes Tenacibaculum maritimum DNA methyltransferase [Tenacibaculum maritimum]. 639782849_?->639782851_?-><-639782852_METHYLASE||639782853_?->740182981_?-><-639782855_?||639782856_?-><-639782857_N6-MTase*<-639782858_?<-639782859_?<-639782861_?<-639782863_Collar<-740182974_?<-639782867_?<-639782868_? 695294566 <-ParA||?->N6-MTase*-> N6-MTase MethyltransfD12 M127_RS12840 280 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. 492291783_?->695339004_?->695339060_?-><-695400500_?<-496602564_?<-499301741_ParA||492279331_?->695294566_N6-MTase*->492279325_?-><-492279322_? 695330037 STN+Cna_B_2+Plug->?->?->?-><-?||?-><-N6-MTase*<-?||ParA-> N6-MTase MethyltransfD12 HMPREF1079_RS0100985 280 bacteria>bacteroidetes Bacteroides fragilis DNA methyltransferase [Bacteroides fragilis]. 492291771_?->492279316_STN+Cna_B_2+Plug->492291773_?->492279320_?->492279322_?-><-492279325_?||657213291_?-><-695330037_N6-MTase*<-492279331_?||492291778_ParA->492279339_?->492291779_?->492291780_?-><-492291781_?<-492291782_? 695344948 MuF->?->?->?->?->?->Thymidylate_synthase->N6-MTase+Phage-tailfib*-><-?<-Phage_tail_S N6-MTase+Phage-tailfib SP BFAG_RS07280 280 bacteria>bacteroidetes Bacteroides fragilis DNA methyltransferase [Bacteroides fragilis]. 695344945_MuF->492223777_?->695344946_?->492223780_?->492223783_?->492223786_?->695344947_Thymidylate_synthase->695344948_N6-MTase+Phage-tailfib*-><-492223794_?<-492223795_Phage_tail_S<-695344949_?<-695344950_?<-492223806_?<-695344951_?<-492223813_? 695540882 <-ParA<-?||N6-MTase*-><-?||?-><-?<-?<-?<-STN+Cna_B_2+Plug N6-MTase MethyltransfD12 M070_RS00960 280 bacteria>bacteroidetes Bacteroides fragilis DNA methyltransferase [Bacteroides fragilis]. 492279352_?->492279349_?->492279345_?-><-492279342_?<-492279339_?<-492279337_ParA<-515708972_?||695540882_N6-MTase*-><-515708970_?||492279325_?-><-492279322_?<-496602556_?<-492279318_?<-695430123_STN+Cna_B_2+Plug<-492291771_? 696234173 <-VirD4-FtsK<-?<-?<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC-> N6-MTase MethyltransfD12 C799_RS11490 280 bacteria>bacteroidetes Bacteroides thetaiotaomicron DNA methyltransferase [Bacteroides thetaiotaomicron]. 511014028_?->696233857_?-><-511014030_VirD4-FtsK<-696234170_?<-696234172_?<-511014033_?<-511014034_?<-696234173_N6-MTase*<-511014036_?||696234174_ParA->511014038_?->511014039_?->511014040_DOC->696234175_?-><-511014042_? 258520722 <-N6-MTase* N6-MTase SP HMPREF0198_0362 279 bacteria>proteobacteria>gammaproteobacteria Cardiobacterium hominis ATCC 15826 hypothetical protein HMPREF0198_0362 [Cardiobacterium hominis ATCC 15826]. <-258520715_?<-258520716_?<-258520717_?<-258520718_?<-258520719_?||258520720_?->258520721_?-><-258520722_N6-MTase*||258520723_?-><-258520649_?<-258520650_?||258520651_?->258520652_?->258520653_?->258520654_?-> 488141552 <-N6-MTase*<-?<-?<-?<-DUF2313<-Baseplate_J<-Baseplate_J<-GP46 N6-MTase - NMA510612_RS09285 279 bacteria>proteobacteria>betaproteobacteria Neisseria meningitidis hypothetical protein [Neisseria meningitidis]. <-496710478_?||488147750_?-><-504393489_?||488175119_?->488175120_?->488175121_?-><-488147744_?<-488141552_N6-MTase*<-488141551_?<-488141550_?<-488175122_?<-728043483_DUF2313<-488141547_Baseplate_J<-488141546_Baseplate_J<-488141544_GP46 488182095 <-N6-MTase*<-?<-?<-?<-?<-DUF2313<-Baseplate_J<-Baseplate_J N6-MTase - NM70082_RS106455 279 bacteria>proteobacteria>betaproteobacteria Neisseria meningitidis D12 class N6 adenine-specific DNA methyltransferase family protein [Neisseria meningitidis]. 488163872_?-><-488147748_?||488147747_?->728042758_?->488149296_?->488166023_?-><-488147744_?<-488182095_N6-MTase*<-488141551_?<-488141550_?<-488166024_?<-488166025_?<-728043483_DUF2313<-488141547_Baseplate_J<-488141546_Baseplate_J 488717644 <-N6-MTase* N6-MTase - HMPREF9021_RS03005 279 bacteria>proteobacteria>betaproteobacteria Simonsiella muelleri hypothetical protein [Simonsiella muelleri]. 488717637_?->488717638_?->488717639_?->488717640_?->488717641_?->750346891_?->488717643_?-><-488717644_N6-MTase*<-750346893_?<-488717647_?<-488717648_?<-488717649_?<-488717650_?<-488717651_?<-488717652_? 492281093 Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF N6-MTase - M137_RS14330 279 bacteria>bacteroidetes Bacteroidales MULTISPECIES: DNA methyltransferase [Bacteroidales]. 495943786_?->492281109_?->492281107_?->492281103_?->695409052_?->492281099_Phage_tail_S->492281096_?-><-492281093_N6-MTase*<-492281092_Thymidylate_synthase<-495896300_?<-492281090_?<-495896298_?<-492281087_?<-492281085_MuF<-492281083_? 495950973 Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF N6-MTase - BSBG_RS01095 279 bacteria>bacteroidetes Bacteroides sp. 9_1_42FAA DNA methyltransferase [Bacteroides sp. 9_1_42FAA]. 495950962_?->495950963_?->495950965_?->696359225_?->495950968_?->696359215_Phage_tail_S->495950972_?-><-495950973_N6-MTase*<-495950975_Thymidylate_synthase<-496053796_?<-495950980_?<-696359216_?<-495950989_?<-696359226_MuF<-696359218_? 496053795 Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF N6-MTase - HMPREF9008_RS11875 279 bacteria>bacteroidetes Parabacteroides sp. 20_3 DNA methyltransferase [Parabacteroides sp. 20_3]. 496053793_?->495950963_?->495950965_?->696359225_?->495950968_?->736514586_Phage_tail_S->495950972_?-><-496053795_N6-MTase*<-495950975_Thymidylate_synthase<-496053796_?<-496053797_?<-496053798_?<-495950989_?<-696359226_MuF<-696359218_? 496057734 Phage_tail_S->?-><-N6-MTase*<-Thymidylate_synthase<-?<-?<-?<-?<-MuF N6-MTase - HMPREF9011_RS05730 279 bacteria>bacteroidetes Bacteroides sp. 3_1_40A DNA methyltransferase [Bacteroides sp. 3_1_40A]. 496057730_?->492281109_?->492281107_?->496057731_?->492281101_?->496057732_Phage_tail_S->496057733_?-><-496057734_N6-MTase*<-496057735_Thymidylate_synthase<-696377523_?<-496057737_?<-496057738_?<-496057739_?<-696377528_MuF<-496057741_? 496464403 Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->?->?->N6-MTase*-> N6-MTase - HMPREF9016_RS00975 279 bacteria>proteobacteria>betaproteobacteria Neisseria sp. oral taxon 014 hypothetical protein [Neisseria sp. oral taxon 014]. 496464397_?->496464398_Phage_Mu_Gp45->488141544_GP46->748592186_Baseplate_J->496464400_DUF2313->496464401_?->496464402_?->496464403_N6-MTase*-><-496464404_?||496464405_?->496464406_?->748592187_?->748592188_?->496464409_?->496464410_?-> 500646766 <-ParA||?->N6-MTase*->?-><-?||?->STN+Cna_B_2+Plug-> N6-MTase MethyltransfD12 BVU_RS04850 279 bacteria>bacteroidetes Bacteroides vulgatus DNA methyltransferase [Bacteroides vulgatus]. 500646760_?->500646761_?->500646762_?-><-500646763_?<-500646764_?<-500646765_ParA||495946968_?->500646766_N6-MTase*->752488389_?-><-752488390_?||752488391_?->500646770_STN+Cna_B_2+Plug->500646771_?->500646772_?->500646773_?-> 504603820 <-Thymidylate_synthase<-N6-MTase* N6-MTase - ORNRH_RS05640 279 bacteria>bacteroidetes Ornithobacterium rhinotracheale DNA methyltransferase [Ornithobacterium rhinotracheale]. <-504603116_?||504603814_?->504603815_?->504603816_?->738702467_?->754917207_?-><-504603819_Thymidylate_synthase<-504603820_N6-MTase*<-504603821_?<-754917212_?<-504603823_?<-504603824_?<-504603825_?<-504603826_?<-754917215_? 514429566 <-N6-MTase*<-?<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45 N6-MTase SP P1062_RS03850 279 bacteria>proteobacteria>gammaproteobacteria Pasteurella multocida hypothetical protein [Pasteurella multocida]. <-492015334_?||504092484_?->512748257_?->492027804_?->504092482_?->492027802_?->757461359_?-><-514429566_N6-MTase*<-514429567_?<-757461361_?<-757461140_DUF2313<-514165916_Baseplate_J<-514165917_GP46<-514165918_Phage_Mu_Gp45<-757461141_? 514429666 N6-MTase*-> N6-MTase SP P1062_RS06555 279 bacteria>proteobacteria>gammaproteobacteria Pasteurella multocida hypothetical protein [Pasteurella multocida]. <-492019952_?||492126203_?->514165526_?-><-514165525_?||514165524_?->514429660_?->757461336_?->514429666_N6-MTase*-><-492026848_?<-504092255_?<-492020358_?<-504092254_?||492026860_?->492020596_?->512748065_?-> 518349657 N6-MTase*-> N6-MTase MethyltransfD12 A3G3_RS0107195 279 bacteria>proteobacteria>gammaproteobacteria Moraxella boevrei hypothetical protein [Moraxella boevrei]. 750349540_?->750349543_?->518349655_?->750349546_?->518349657_N6-MTase*-><-518349658_?<-518349659_?<-518349660_?||648521159_?->518349662_?->518349663_?->518349665_?-> 647518976 MuF->?->?->?->?->?->N6-MTase*-><-Phage_tail_S N6-MTase - JCM12083_RS06185 279 bacteria>bacteroidetes Prevotella shahii DNA methyltransferase [Prevotella shahii]. 647518963_?->763201846_MuF->647518967_?->647518969_?->647518972_?->763201833_?->647518974_?->647518976_N6-MTase*-><-647518978_Phage_tail_S<-647518980_?<-647518983_?<-647518986_?<-647518988_?<-763201835_?<-763201837_? 750347144 <-N6-MTase* N6-MTase MethyltransfD12 HMPREF9021_RS06670 279 bacteria>proteobacteria>betaproteobacteria Simonsiella muelleri hypothetical protein [Simonsiella muelleri]. <-750347216_?<-488718372_?<-488718373_?<-488718374_?<-488718375_?<-750347218_?<-750347220_?<-750347144_N6-MTase*<-488718379_?<-488718380_?<-750347146_?<-488718381_?||488718382_?->750347222_?->488718384_?-> 810414634 N6-MTase*-> N6-MTase SP I926_RS02325 279 bacteria>proteobacteria>gammaproteobacteria Pasteurella multocida hypothetical protein [Pasteurella multocida]. <-810414619_?||810414621_?-><-810414623_?||810414625_?->810422398_?->810414628_?->810414631_?->810414634_N6-MTase*->810422401_?-><-810414636_?<-810414639_?||810414642_?->810414644_?->810414646_?->810414648_?-> 343968128 <-N6-MTase*<-?<-?<-?<-Caudo_TAP<-Collar<-DUF2313<-Baseplate_J N6-MTase SP l11_17040 278 bacteria>proteobacteria>betaproteobacteria Neisseria weaveri LMG 5135 hypothetical protein l11_17040 [Neisseria weaveri LMG 5135]. <-343968128_N6-MTase*<-343968129_?<-343968130_?<-343968131_?<-343968132_Caudo_TAP<-343968133_Collar<-343968134_DUF2313<-343968135_Baseplate_J 490416379 <-LPD3+N4ART+N4ART+MPTase+MPTase<-?<-?<-N6-MTase*<-?||ParA->?->?->DOC-> N6-MTase MethyltransfD12 HMPREF1181_RS12035 278 bacteria>bacteroidetes Bacteroides MULTISPECIES: DNA methyltransferase [Bacteroides]. 491885528_?->491885531_?->491885534_?-><-514974061_?<-514974062_LPD3+N4ART+N4ART+MPTase+MPTase<-514974063_?<-514974064_?<-490416379_N6-MTase*<-490416378_?||514974065_ParA->514974066_?->514974067_?->514974068_DOC->514974069_?-><-514974070_? 496519123 N6-MTase*->?-><-?<-?||?->?->ABC-ATPase-> N6-MTase - HMPREF0669_RS04845 278 bacteria>bacteroidetes Prevotella sp. oral taxon 299 hypothetical protein [Prevotella sp. oral taxon 299]. 496519116_?->496519117_?->763165970_?->763166059_?->496519120_?->532473657_?->763165972_?->496519123_N6-MTase*->763166060_?-><-496519125_?<-496519126_?||496519127_?->496519128_?->496519129_ABC-ATPase->496519130_?-> 501302336 GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->DCM->N6-MTase*-><-?<-?<-ABC-ATPase N6-MTase - HSM_RS03540 278 bacteria>proteobacteria>gammaproteobacteria Histophilus somni hypothetical protein [Histophilus somni]. 501302328_GP46->501302329_Baseplate_J->501302330_DUF2313->753849627_Collar->501302332_DUF4376->501302333_?->501302334_DCM->501302336_N6-MTase*-><-501302337_?<-753849413_?<-753849629_ABC-ATPase<-501302340_?<-501302341_?<-501302342_?<-501302343_? 503362551 <-McrC<-McrB<-?<-?<-?<-?<-Thymidylate_synthase<-N6-MTase*<-?<-?<-?<-?<-?<-Collar N6-MTase MethyltransfD12 WEEVI_RS00470 278 bacteria>bacteroidetes Weeksella virosa DNA methyltransferase [Weeksella virosa]. <-503362544_McrC<-754544231_McrB<-503362546_?<-503362547_?<-754544048_?<-503362549_?<-503362550_Thymidylate_synthase<-503362551_N6-MTase*<-503362552_?<-503362553_?<-754544049_?<-503362555_?<-503362556_?<-503362557_Collar<-754544232_? 503512750 Baseplate_J->Tail_P2_I->?-><-?<-?||DUF4376->?->N6-MTase*-> N6-MTase - UMN179_RS12515 278 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 503512743_Baseplate_J->503512744_Tail_P2_I->762905620_?-><-762905621_?<-503512747_?||762905622_DUF4376->503512749_?->503512750_N6-MTase*->503512751_?->503512752_?->503512753_?-><-503512754_?<-503512755_?<-503512756_?<-503512757_? 517157783 Baseplate_J->Tail_P2_I->?->Collar->Caudo_TAP->DUF4376->?->N6-MTase*-> N6-MTase - IE01_RS08000 278 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 517157788_Baseplate_J->517157787_Tail_P2_I->746085363_?->648519298_Collar->746085367_Caudo_TAP->746085370_DUF4376->517157784_?->517157783_N6-MTase*->517157782_?->517157781_?->517157780_?-><-517157779_?<-517157778_?<-746085373_?<-517157776_? 547931225 <-N6-MTase* N6-MTase - BN590_01677 278 bacteria>bacteroidetes Alistipes sp. CAG:29 d12 class N6 adenine-specific DNA methyltransferase [Alistipes sp. CAG:29]. <-547931222_?<-547931223_?<-547931224_?<-547931225_N6-MTase*<-547931226_?<-547931227_?<-547931228_? 640570678 N6-MTase*->Thymidylate_synthase-> N6-MTase MethyltransfD12 JCM15754_RS11780 278 bacteria>bacteroidetes Prevotella aurantiaca DNA methyltransferase [Prevotella aurantiaca]. 640570671_?->640570672_?->640570673_?->640570674_?->640570675_?->640570676_?->640570677_?->640570678_N6-MTase*->640570679_Thymidylate_synthase-> 736169879 <-N6-MTase* N6-MTase - Q338_RS01810 278 bacteria>proteobacteria>betaproteobacteria Alysiella crassa hypothetical protein [Alysiella crassa]. <-736169847_?<-736169851_?<-736169855_?<-736169861_?||736169865_?->736169871_?->736169875_?-><-736169879_N6-MTase*<-736169883_?<-736170079_?<-736169888_?<-736169894_?<-736170083_?<-736170090_?<-736169898_? 737726745 ABC-ATPase-><-?||?->?-><-N6-MTase*<-DCM N6-MTase - AJF4211_RS10020 278 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum hypothetical protein [Avibacterium paragallinarum]. 516416440_?-><-737727047_?<-648446740_?||516416433_ABC-ATPase-><-737727048_?||545595273_?->545595597_?-><-737726745_N6-MTase*<-737726757_DCM<-516418180_?<-516416426_?<-516416425_?<-516416424_?<-516416423_?<-516416422_? 737726850 <-N6-MTase*<-DCM<-?<-DUF4376<-Collar N6-MTase - AJF4211_RS06190 278 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum hypothetical protein [Avibacterium paragallinarum]. <-516417945_?<-737726848_?<-545595613_?||516417941_?->516417940_?->737726843_?->545595615_?-><-737726850_N6-MTase*<-737726757_DCM<-516418180_?<-737726746_DUF4376<-545595617_Collar 746010293 Baseplate_J->Tail_P2_I->?-><-?<-?||DUF4376->?->N6-MTase*-> N6-MTase - JP28_RS09245 278 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 746010285_Baseplate_J->746010287_Tail_P2_I->746010297_?-><-746010289_?<-503512747_?||746010298_DUF4376->746010291_?->746010293_N6-MTase*->746010295_?-> 746011652 <-N6-MTase*<-DCM<-?<-?<-?||Phage_integrase-> N6-MTase - JP34_RS00795 278 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. <-746011641_?<-746011644_?<-746011645_?||746011647_?->746011648_?->746011649_?->746011651_?-><-746011652_N6-MTase*<-746011653_DCM<-746011654_?<-746011655_?<-746011657_?||746011659_Phage_integrase->746011660_?-><-746011834_? 746067969 <-N6-MTase*<-?<-DUF4376<-?<-Tail_P2_I<-Baseplate_J N6-MTase - P375_RS07850 278 bacteria>proteobacteria>gammaproteobacteria Gallibacterium genomosp. 2 hypothetical protein [Gallibacterium genomosp. 2]. 746067963_?->746067964_?->503512755_?->746067965_?-><-746067966_?<-746067967_?<-746067968_?<-746067969_N6-MTase*<-746067970_?<-746068026_DUF4376<-746068028_?<-746067971_Tail_P2_I<-746067972_Baseplate_J<-746067973_?<-746067974_? 746089913 <-N6-MTase*<-?<-DUF4376 N6-MTase - IO46_RS12295 278 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 746010848_?->746010845_?->746010843_?->746010841_?-><-746010840_?<-746010838_?<-757676485_?<-746089913_N6-MTase*<-746089910_?<-746089943_DUF4376||757676487_?->757676488_?-> 746094831 Baseplate_J->Tail_P2_I->?-><-?<-?||DUF4376->?->N6-MTase*-> N6-MTase - JP33_RS07160 278 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. 746094824_Baseplate_J->746094826_Tail_P2_I->746094868_?-><-746010289_?<-503512747_?||746094869_DUF4376->746094829_?->746094831_N6-MTase*->746094834_?->746094836_?->746094840_?-><-746094842_?<-746094845_?<-746094848_?<-746094849_? 756151906 <-N6-MTase* N6-MTase - SU30_RS01820 278 bacteria>proteobacteria>gammaproteobacteria Haemophilus influenzae hypothetical protein [Haemophilus influenzae]. <-756151912_?<-491873780_?<-491853492_?<-491891724_?<-491874530_?<-491906619_?<-756151908_?<-756151906_N6-MTase*<-491866182_?||756151904_?-><-756151952_?||491915004_?-><-491849814_?||491906642_?-><-491906646_? 763375484 <-N6-MTase*<-?<-?<-?<-?<-Collar<-DUF2313<-Baseplate_J N6-MTase - GGE_RS05405 278 bacteria>proteobacteria>gammaproteobacteria Haemophilus haemolyticus hypothetical protein [Haemophilus haemolyticus]. <-491782491_?<-491782456_?<-696247400_?<-491865825_?<-491865829_?<-763375481_?<-763375528_?<-763375484_N6-MTase*<-763375487_?<-491865842_?<-491865844_?<-491865847_?<-763375490_Collar<-491865850_DUF2313<-491865853_Baseplate_J 805420685 <-N6-MTase*<-DCM N6-MTase - Z012_RS09750 278 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum hypothetical protein [Avibacterium paragallinarum]. 805420673_?-><-516417306_?||805420675_?->805420677_?->805420679_?->805420681_?->805420683_?-><-805420685_N6-MTase*<-805420687_DCM 385696246 Collar->?->?->?->?->?->?->N6-MTase*-> N6-MTase - HMPREF1054_1309 277 bacteria>proteobacteria>gammaproteobacteria Haemophilus paraphrohaemolyticus HK411 D12 class N6 adenine-specific DNA methyltransferase [Haemophilus paraphrohaemolyticus HK411]. 385696272_Collar->385696276_?->385696219_?->385696212_?->385696243_?->385696245_?->385696259_?->385696246_N6-MTase*->385696214_?->385696239_?->385696213_?->385696235_?->385696229_?->385696230_?->385696251_?-> 489754581 <-Portal<-?<-Phage_capsid<-?<-?<-N6-MTase*<-?<-Terminase_LS<-?<-Terminase_SS<-HNH N6-MTase - E9G_RS00410 277 bacteria>proteobacteria>gammaproteobacteria Moraxella catarrhalis hypothetical protein [Moraxella catarrhalis]. <-489754569_?<-489754571_?<-489754573_Portal<-489754574_?<-489754575_Phage_capsid<-738393062_?<-489754580_?<-489754581_N6-MTase*<-489754582_?<-489754584_Terminase_LS<-489754585_?<-489754589_Terminase_SS<-489754590_HNH<-489754591_?<-489754592_? 489757769 <-Portal<-?<-Phage_capsid<-?<-?<-N6-MTase*<-?<-Terminase_LS<-?<-Terminase_SS<-HNH N6-MTase - E9K_RS05470 277 bacteria>proteobacteria>gammaproteobacteria Moraxella catarrhalis hypothetical protein [Moraxella catarrhalis]. <-489757759_?<-489757760_?<-489757762_Portal<-489754574_?<-489757764_Phage_capsid<-738383218_?<-489757768_?<-489757769_N6-MTase*<-489754582_?<-738431723_Terminase_LS<-489754585_?<-489757777_Terminase_SS<-489757779_HNH<-489757780_?<-489757782_? 489767871 <-Portal<-?<-Phage_capsid<-?<-?<-N6-MTase*<-?<-Terminase_LS<-?<-Terminase_SS<-HNH N6-MTase - E9U_RS07920 277 bacteria>proteobacteria>gammaproteobacteria Moraxella catarrhalis hypothetical protein [Moraxella catarrhalis]. <-489757759_?<-489757760_?<-489757762_Portal<-489754574_?<-489767867_Phage_capsid<-738383218_?<-489757768_?<-489767871_N6-MTase*<-489767873_?<-738383198_Terminase_LS<-489754585_?<-489757777_Terminase_SS<-489767877_HNH<-489767879_?<-489767881_? 491999424 DAM+DAM->?->?-><-?<-?<-?||?-><-N6-MTase* N6-MTase - SVR5_RS07195 277 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis hypothetical protein [Haemophilus parasuis]. 491999129_DAM+DAM->491999132_?->544677724_?-><-491999431_?<-491999429_?<-491999427_?||491999425_?-><-491999424_N6-MTase* 492141771 <-N6-MTase*<-?<-DUF4376 N6-MTase - HMPREF1052_RS06075 277 bacteria>proteobacteria>gammaproteobacteria Pasteurella bettyae hypothetical protein [Pasteurella bettyae]. <-492141771_N6-MTase*<-750314401_?<-492141772_DUF4376 493305395 RadC-><-?<-?<-?<-?<-?<-N6-MTase* N6-MTase SP+MethyltransfD12 HMPREF9715_RS04510 277 bacteria>bacteroidetes Myroides odoratimimus DNA methyltransferase [Myroides odoratimimus]. <-738522835_?||493305389_RadC-><-493305390_?<-493305391_?<-493305392_?<-493305393_?<-493305394_?<-493305395_N6-MTase*<-493305397_?<-493305398_?<-493305399_?<-493305401_?<-493305402_?<-493305403_?<-493305405_? 494312007 <-Thymidylate_synthase<-N6-MTase* N6-MTase - HMPREF0645_RS12560 277 bacteria>bacteroidetes Prevotella bergensis DNA methyltransferase [Prevotella bergensis]. <-494312003_Thymidylate_synthase<-494312007_N6-MTase*<-763258955_?<-494312011_?<-763258950_?<-494312015_?<-763258952_?<-494312020_?<-763258956_? 494610799 <-Thymidylate_synthase<-N6-MTase* N6-MTase - HMPREF9141_RS12020 277 bacteria>bacteroidetes Prevotella multiformis DNA methyltransferase [Prevotella multiformis]. <-494610798_Thymidylate_synthase<-494610799_N6-MTase*<-494610800_?||494610801_?-><-494610802_?<-494610803_?<-494610804_?<-494610805_?<-750264791_? 517090945 MuF->?->?->?->?->N6-MTase*-> N6-MTase - CCYN49044_RS09840 277 bacteria>bacteroidetes Capnocytophaga cynodegmi hypothetical protein [Capnocytophaga cynodegmi]. 517090938_?->517090939_?->750049331_MuF->750049334_?->750049335_?->517090942_?->517090944_?->517090945_N6-MTase*-> 517158409 <-N6-MTase*<-DCM<-?<-?||Phage_integrase-> N6-MTase - IE01_RS11495 277 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. <-517158405_?<-648519377_?<-517158406_?||517158407_?-><-517158409_N6-MTase*<-648519378_DCM<-517158413_?<-517158414_?||517158415_Phage_integrase-> 545432296 N6-MTase*-> N6-MTase - JCM6334_RS11905 277 bacteria>bacteroidetes Prevotella disiens hypothetical protein [Prevotella disiens]. 545432296_N6-MTase*->545429855_?->545429854_?-><-640636938_? 644530078 <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46 N6-MTase - F543_RS02755 277 bacteria>proteobacteria>gammaproteobacteria Bibersteinia trehalosi hypothetical protein [Bibersteinia trehalosi]. <-505246074_?||740660613_?-><-505246072_?<-505246071_?<-505246069_?<-505246068_?<-505246067_?<-644530078_N6-MTase*<-644530083_?<-505246064_DUF4376<-644530088_?<-505246062_DUF2313<-505246061_Baseplate_J<-505246060_GP46<-505246059_? 647557435 <-Thymidylate_synthase<-N6-MTase* N6-MTase - JCM17725_RS06630 277 bacteria>bacteroidetes Prevotella scopos DNA methyltransferase [Prevotella scopos]. 647557418_?->647557420_?->647557422_?-><-647557426_?<-647557428_?||647557431_?-><-647557433_Thymidylate_synthase<-647557435_N6-MTase*<-647557437_?<-647557438_?<-647557440_?<-647557442_?<-647557444_?<-763207927_?<-647557448_? 654481515 N6-MTase*->?-><-?||?-><-?<-STN+Cna_B_2+Plug N6-MTase - L888_RS0101115 277 bacteria>bacteroidetes Hallella seregens DNA methyltransferase [Hallella seregens]. 654481508_?->654481509_?->654481510_?->654481511_?->654481512_?->654481513_?->654481514_?->654481515_N6-MTase*->654481516_?-><-654481517_?||654481518_?-><-654481519_?<-654481520_STN+Cna_B_2+Plug||654481521_?->763391280_?-> 655519412 N6-MTase*-> N6-MTase - X919_RS0112015 277 bacteria>bacteroidetes Prevotella sp. HJM029 DNA methyltransferase [Prevotella sp. HJM029]. 655519405_?->655519406_?->655519407_?->655519408_?->655519409_?->655519410_?->655519411_?->655519412_N6-MTase*-><-655519413_?||739036906_?->655519415_?-> 736743227 N6-MTase*-> N6-MTase - IW16_RS16985 277 bacteria>bacteroidetes Chryseobacterium vrystaatense DNA methyltransferase [Chryseobacterium vrystaatense]. 736743211_?->736743213_?->736743215_?->736743217_?->736743219_?->736743221_?->736743224_?->736743227_N6-MTase*-><-736743230_?<-736743233_?||736743236_?->736743239_?->736743242_?->736743245_?->736743248_?-> 737547480 <-N6-MTase* N6-MTase - HPS41_RS09445 277 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis hypothetical protein [Haemophilus parasuis]. <-492000594_?<-514062143_?<-491999443_?<-491999446_?<-514062144_?<-498485476_?||737547479_?-><-737547480_N6-MTase*<-737547481_? 738545947 <-N6-MTase*<-?<-?<-Caudo_TAP<-Collar<-DUF2313<-Baseplate_J<-GP46 N6-MTase SP L13_RS00005 277 bacteria>proteobacteria>betaproteobacteria Neisseria weaveri hypothetical protein [Neisseria weaveri]. <-738545947_N6-MTase*<-490412638_?<-490412639_?<-490412640_Caudo_TAP<-490412641_Collar<-490411891_DUF2313<-490411892_Baseplate_J<-490412644_GP46 746064999 <-Phage_integrase||?->?->?->DCM->N6-MTase*-> N6-MTase - P375_RS00515 277 bacteria>proteobacteria>gammaproteobacteria Gallibacterium genomosp. 2 hypothetical protein [Gallibacterium genomosp. 2]. <-746065027_?||746064987_?-><-746065030_Phage_integrase||746064990_?->746064993_?->746064996_?->746065033_DCM->746064999_N6-MTase*-><-746065002_?<-746065036_? 746079094 <-N6-MTase*<-DCM<-?<-?<-?||Phage_integrase-> N6-MTase - JP30_RS08890 277 bacteria>proteobacteria>gammaproteobacteria Gallibacterium anatis hypothetical protein [Gallibacterium anatis]. <-746079074_?<-746079076_?<-746007122_?<-746079080_?<-746079084_?||746079087_?->746079090_?-><-746079094_N6-MTase*<-746079097_DCM<-746079102_?<-746079106_?<-746079109_?||746079113_Phage_integrase->746079116_?-> 750343301 <-Portal<-?<-Phage_capsid<-?<-N6-MTase*<-Terminase_LS<-Terminase_SS<-HNH N6-MTase - MOMA_RS09420 277 bacteria>proteobacteria>gammaproteobacteria Moraxella macacae hypothetical protein [Moraxella macacae]. <-497188035_?<-497188036_?<-497188037_?<-497188038_Portal<-497188039_?<-497188040_Phage_capsid<-750343521_?<-750343301_N6-MTase*<-497188044_Terminase_LS<-497188045_Terminase_SS<-497188046_HNH<-750343523_?<-497188048_?<-750342927_?<-750343524_? 750388073 <-N6-MTase*<-?<-?<-Caudo_TAP<-Collar<-DUF2313<-Baseplate_J<-GP46 N6-MTase - L11_RS07795 277 bacteria>proteobacteria>betaproteobacteria Neisseria weaveri hypothetical protein [Neisseria weaveri]. <-750388073_N6-MTase*<-490411886_?<-490411887_?<-490411889_Caudo_TAP<-490411890_Collar<-490411891_DUF2313<-490411892_Baseplate_J<-490411894_GP46 771514766 N6-MTase*-> N6-MTase - M573_RS10255 277 bacteria>bacteroidetes Prevotella intermedia DNA methyltransferase [Prevotella intermedia]. 771514769_?->771514770_?->771514761_?->771514762_?->771514763_?->771514764_?->771514765_?->771514766_N6-MTase*-> 806965486 <-Thymidylate_synthase<-N6-MTase+Phage-tailfib* N6-MTase+Phage-tailfib SP SU65_11745 277 bacteria>bacteroidetes Flavobacterium psychrophilum DNA methyltransferase [Flavobacterium psychrophilum]. <-806965479_?<-806965480_?<-806965481_?||806965482_?->806965483_?-><-806965484_?<-806965485_Thymidylate_synthase<-806965486_N6-MTase+Phage-tailfib*<-806965487_?<-806965488_?<-806965489_?<-806965490_?<-806965491_?<-806965492_?||806965493_?-> 261414145 <-METHYLASE<-?<-?<-?<-?<-?<-?<-N6-MTase*<-tail_3 N6-MTase - D11S_2165 276 bacteria>proteobacteria>gammaproteobacteria Aggregatibacter actinomycetemcomitans D11S-1 hypothetical protein D11S_2165 [Aggregatibacter actinomycetemcomitans D11S-1]. <-261414138_METHYLASE<-261414139_?<-261414140_?<-261414141_?<-261414142_?<-261414143_?<-261414144_?<-261414145_N6-MTase*<-261414146_tail_3<-261414147_?<-261414148_?<-261414149_?<-261414150_?<-261414151_?<-261414152_? 313137261 MuF->?->?->?->?->Thymidylate_synthase->N6-MTase*-><-?<-Phage_tail_S N6-MTase MethyltransfD12 BFAG_03319 276 bacteria>bacteroidetes Bacteroides fragilis 3_1_12 D12 class N6 adenine-specific DNA methyltransferase [Bacteroides fragilis 3_1_12]. 313137254_?->313137255_MuF->313137256_?->313137257_?->313137258_?->313137259_?->313137260_Thymidylate_synthase->313137261_N6-MTase*-><-313137262_?<-313137263_Phage_tail_S||313137264_?-><-313137265_?<-313137266_?<-313137267_?<-313137268_? 490468432 N6-MTase*->Thymidylate_synthase-> N6-MTase - PREBIDRAFT_RS00610 276 bacteria>bacteroidetes Prevotella bivia DNA methyltransferase [Prevotella bivia]. 490468423_?-><-738993203_?||490468429_?->490468432_N6-MTase*->490468435_Thymidylate_synthase-><-490465488_?<-490466789_?<-490466791_?<-490466793_?||490466795_?-><-490466797_? 491746110 <-N6-MTase*<-tail_3 N6-MTase - RHAA1_RS00240 276 bacteria>proteobacteria>gammaproteobacteria Aggregatibacter actinomycetemcomitans hypothetical protein [Aggregatibacter actinomycetemcomitans]. 491746101_?->491746102_?->491746103_?->491746104_?->491746105_?-><-491746107_?<-696425996_?<-491746110_N6-MTase*<-491746114_tail_3<-491746116_?<-491746118_?<-491746120_?<-491746122_?<-491746124_?<-491746126_? 494223274 <-Thymidylate_synthase<-N6-MTase* N6-MTase - HMPREF9420_RS08510 276 bacteria>bacteroidetes Prevotella salivae DNA methyltransferase [Prevotella salivae]. <-494223259_?<-763205527_?<-494223263_?<-494223265_?<-763205528_?<-763205531_?<-494223272_Thymidylate_synthase<-494223274_N6-MTase*<-494223281_?<-494223283_?<-494223285_?<-494223287_?<-494223289_?<-494223291_?<-763205533_? 494451533 GP46->DUF2313->Collar->Caudo_TAP->?->?->?->N6-MTase*-> N6-MTase SP HMPREF9952_RS06315 276 bacteria>proteobacteria>gammaproteobacteria Haemophilus pittmaniae hypothetical protein [Haemophilus pittmaniae]. 494451755_GP46->494451582_DUF2313->748589684_Collar->494451690_Caudo_TAP->494451609_?->494451730_?->494451628_?->494451533_N6-MTase*->748589685_?->494451618_?->494451551_?->748589669_?->748589686_?->494451693_?->494451748_?-> 497946642 Collar->?->?->?->?->N6-MTase*->Thymidylate_synthase-> N6-MTase - ATHG_RS03835 276 bacteria>bacteroidetes Alistipes timonensis DNA methyltransferase [Alistipes timonensis]. 497946627_?->497946628_?->648239499_Collar->497946630_?->497946631_?->497946634_?->497946636_?->497946642_N6-MTase*->497946644_Thymidylate_synthase-><-497946646_?<-497946647_?<-497946650_?<-522184605_?<-497946655_?||497946658_?-> 499246665 N6-MTase*-> N6-MTase - HD_RS00615 276 bacteria>proteobacteria>gammaproteobacteria Haemophilus ducreyi hypothetical protein [Haemophilus ducreyi]. 499246647_?->499246654_?->499246655_?->499246658_?->499246659_?->499246660_?->499246661_?->499246665_N6-MTase*->499246666_?->753847986_?->753847719_?->499246670_?->499246672_?->499246673_?->499246676_?-> 511019476 <-ABC-ATPase<-?||?->?->?->?-><-N6-MTase*<-Thymidylate_synthase N6-MTase MethyltransfD12 C801_RS14355 276 bacteria>bacteroidetes Bacteroides uniformis hypothetical protein [Bacteroides uniformis]. <-511019470_?<-495939893_ABC-ATPase<-511019471_?||511019472_?->511019473_?->511019474_?->737478242_?-><-511019476_N6-MTase*<-511019477_Thymidylate_synthase<-511019478_?<-737478243_?<-511019480_?<-511019481_?<-511019482_?<-511019483_? 545363364 tail_3->N6-MTase*->?->?->?->ABC-ATPase-> N6-MTase - HMPREF9065_RS02985 276 bacteria>proteobacteria>gammaproteobacteria Aggregatibacter sp. oral taxon 458 hypothetical protein [Aggregatibacter sp. oral taxon 458]. 545363330_?->545363335_?->545363340_?->545363345_?->545363346_?->696449597_?->545363354_tail_3->545363364_N6-MTase*->545363367_?->545363374_?->696449550_?->545363384_ABC-ATPase->696449553_?->696449602_?->545363399_?-> 545434898 <-N6-MTase* N6-MTase MethyltransfD12 HMPREF9148_RS11290 276 bacteria>bacteroidetes Prevotella sp. F0091 D12 class N6 adenine-specific DNA methyltransferase [Prevotella sp. F0091]. <-739039278_?<-545434895_?<-545434896_?<-545434898_N6-MTase*<-545434899_?||545434900_?->545434901_?-><-545434902_?<-739039285_?<-545434904_?<-545434905_? 595939381 Thymidylate_synthase->N6-MTase*-><-?<-Phage_tail_S N6-MTase - M118_4484 276 bacteria>bacteroidetes Bacteroides fragilis str. 3783N1-2 D12 class N6 adenine-specific DNA methyltransferase family protein [Bacteroides fragilis str. 3783N1-2]. 595939376_?->595939377_?->595939378_?->595939379_?->595939380_Thymidylate_synthase->595939381_N6-MTase*-><-595939382_?<-595939383_Phage_tail_S<-595939384_?<-595939385_?<-595939386_?<-595939387_?<-595939388_? 640643393 <-N6-MTase* N6-MTase - JCM14966_RS06695 276 bacteria>bacteroidetes Prevotella oulorum DNA methyltransferase [Prevotella oulorum]. <-739020595_?||496530796_?->496530795_?->640643388_?->640643389_?->640643390_?->640643391_?-><-640643393_N6-MTase*<-640643394_?<-640643395_?<-739020600_?<-640643397_?<-640643398_?<-640643399_?<-739020603_? 647603997 N6-MTase*->Thymidylate_synthase-> N6-MTase - K334_RS0105170 276 bacteria>bacteroidetes Prevotella baroniae DNA methyltransferase [Prevotella baroniae]. 653238928_?->647604005_?->653238929_?->739008434_?->647604001_?->653238930_?->647603998_?->647603997_N6-MTase*->647603996_Thymidylate_synthase-><-647603995_?<-653238931_?<-545310047_?<-545309968_?<-653238932_?<-647603994_? 652666097 <-N6-MTase*<-N6-MTase N6-MTase Methyltransf_26 Q321_RS0105900 276 bacteria>proteobacteria>betaproteobacteria Conchiformibius steedae hypothetical protein [Conchiformibius steedae]. <-652666087_?<-652666088_?<-736299556_?<-652666092_?||652666093_?-><-736299559_?||652666095_?-><-652666097_N6-MTase*<-652666099_N6-MTase<-652666101_?<-652666102_?<-736299566_?<-736299570_?<-652666104_?||652666106_?-> 655515586 N6-MTase*->Thymidylate_synthase-> N6-MTase MethyltransfD12 P150_RS0104410 276 bacteria>bacteroidetes Prevotella sp. HUN102 DNA methyltransferase [Prevotella sp. HUN102]. 739060657_?->739060658_?->655515581_?->739060633_?->655515583_?->655515584_?->655515585_?->655515586_N6-MTase*->655515587_Thymidylate_synthase->739060659_?-><-655515588_?<-655515589_?<-655515590_?<-655515591_?<-655515592_? 655516468 N6-MTase*->Thymidylate_synthase-> N6-MTase MethyltransfD12 P150_RS0109795 276 bacteria>bacteroidetes Prevotella sp. HUN102 DNA methyltransferase [Prevotella sp. HUN102]. 655516461_?->655516462_?->655516463_?->655516464_?->655516465_?->655516466_?->655516467_?->655516468_N6-MTase*->655516469_Thymidylate_synthase->655516470_?-><-655516471_?<-655516472_?||655516473_?->655516474_?->739060943_?-> 655516580 N6-MTase*->Thymidylate_synthase-> N6-MTase - P150_RS0110495 276 bacteria>bacteroidetes Prevotella sp. HUN102 DNA methyltransferase [Prevotella sp. HUN102]. 655516575_?->655516576_?->739060777_?->655516577_?->655516578_?->655516579_?->655515585_?->655516580_N6-MTase*->655516581_Thymidylate_synthase-><-739060778_?<-655516582_?||739060953_?-><-655516583_?<-655516584_?||655516585_?-> 737095939 Collar->N6-MTase*-><-?||?->?->?-><-?<-ABC-ATPase N6-MTase MethyltransfD12 H526_RS0116665 276 bacteria>bacteroidetes Aquimarina latercula DNA methyltransferase [Aquimarina latercula]. 737096088_?->653142542_?->737095942_?->653142541_?->653142540_?->653142539_?->737096086_Collar->737095939_N6-MTase*-><-653142537_?||653144827_?->653144828_?->737097563_?-><-653144830_?<-653144831_ABC-ATPase<-737097565_? 739003412 N6-MTase*->Thymidylate_synthase-> N6-MTase MethyltransfD12 HMPREF0654_RS11780 276 bacteria>bacteroidetes Prevotella disiens DNA methyltransferase [Prevotella disiens]. 739003411_?->739003412_N6-MTase*->739003418_Thymidylate_synthase-> 739005860 N6-MTase*->Thymidylate_synthase->?->VirD4-FtsK-> N6-MTase - HMPREF1651_RS08825 276 bacteria>bacteroidetes Prevotella bivia DNA methyltransferase [Prevotella bivia]. <-739005859_?||490468429_?->739005860_N6-MTase*->739005861_Thymidylate_synthase->739005865_?->739005862_VirD4-FtsK-><-488624280_?<-695331374_?<-488624282_?<-488624283_? 739058226 <-Thymidylate_synthase<-N6-MTase*<-?<-Phage_tail_S<-MuF N6-MTase MethyltransfD12 HMPREF9304_RS12585 276 bacteria>bacteroidetes Prevotella timonensis DNA methyltransferase [Prevotella timonensis]. <-739058243_?<-739058244_?<-739058222_Thymidylate_synthase<-739058226_N6-MTase*<-739058238_?<-739058230_Phage_tail_S<-739058239_MuF<-739058240_? 763168088 N6-MTase*->Thymidylate_synthase-> N6-MTase MethyltransfD12 PIN17_RS06195 276 bacteria>bacteroidetes Prevotella intermedia DNA methyltransferase [Prevotella intermedia]. 504522352_?->504522353_?->504522354_?->504522355_?->504522356_?->504522357_?->763168086_?->763168088_N6-MTase*->504522361_Thymidylate_synthase->504522362_?->763168089_?-><-504522364_?<-763168091_?<-490507278_?<-490471607_? 763205581 N6-MTase*->Thymidylate_synthase-> N6-MTase - HMPREF9420_RS10145 276 bacteria>bacteroidetes Prevotella salivae DNA methyltransferase [Prevotella salivae]. 494223933_?-><-494223934_?<-763205919_?||494223936_?->763205580_?->763205920_?->494223939_?->763205581_N6-MTase*->494223941_Thymidylate_synthase->494223942_?->494223943_?->494223944_?-><-763205583_?||494223946_?->494223947_?-> 786219197 <-N6-MTase* N6-MTase - BN1088_RS06390 276 bacteria>bacteroidetes Sphingobacterium sp. PM2-P1-29 DNA methyltransferase [Sphingobacterium sp. PM2-P1-29]. 786219183_?->786219185_?->786219187_?->786219189_?->786219191_?-><-786219193_?<-786219195_?<-786219197_N6-MTase*<-786219199_?<-786219202_?<-786219204_?<-786219206_?<-786219209_?<-786219212_?<-786219214_? 489846667 Phage_Mu_Gp45->GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->N6-MTase*-> N6-MTase - NEIPOLOT_RS06080 275 bacteria>proteobacteria>betaproteobacteria Neisseria polysaccharea hypothetical protein [Neisseria polysaccharea]. 489846657_Phage_Mu_Gp45->489846658_GP46->757453500_Baseplate_J->489846661_DUF2313->489846662_Collar->489846663_DUF4376->489846665_?->489846667_N6-MTase*-><-645192739_?<-489846672_?<-645192740_?||489846675_?-><-489846680_?<-489846682_?<-489846683_? 489871056 <-N6-MTase*<-?<-?<-?<-?<-?<-METHYLASE N6-MTase - NELON_RS04600 275 bacteria>proteobacteria>betaproteobacteria Neisseria elongata hypothetical protein [Neisseria elongata]. 489871039_?->489871042_?->750384455_?->489871047_?->489871049_?->489871051_?->489871052_?-><-489871056_N6-MTase*<-489870829_?<-489871070_?<-489871073_?<-750384457_?<-489871079_?<-489871081_METHYLASE||489871086_?-> 489918676 GP46->Baseplate_J->DUF2313->Collar->DUF4376->?->?->N6-MTase*-> N6-MTase - EIKCOROL_RS01150 275 bacteria>proteobacteria>betaproteobacteria Eikenella corrodens hypothetical protein [Eikenella corrodens]. 489918666_GP46->489918667_Baseplate_J->489918668_DUF2313->737609230_Collar->489918671_DUF4376->489918673_?->737609113_?->489918676_N6-MTase*-><-489918679_?||737609116_?->489918682_?->489918683_?-><-489918684_?<-489918685_?<-489918688_? 495946224 <-N6-MTase*<-?||?->DOC-> N6-MTase MethyltransfD12 BSFG_RS03795 275 bacteria>bacteroidetes Bacteroides sp. 4_3_47FAA hypothetical protein, partial [Bacteroides sp. 4_3_47FAA]. <-495946217_?<-696364617_?<-495946222_?<-495946224_N6-MTase*<-495946227_?||696364624_?->495946231_DOC-><-495946232_?||696364621_?->696364623_?-><-495946235_? 565956523 MuF->?->N6-MTase*-><-?<-Phage_tail_S N6-MTase - HMPREF1199_RS02420 275 bacteria>bacteroidetes Prevotella oralis hypothetical protein [Prevotella oralis]. 738975553_?->565956498_?->565956503_?->565956506_?->738975744_?->738975745_MuF->565956519_?->565956523_N6-MTase*-><-565956528_?<-565956533_Phage_tail_S<-565956539_?<-565956551_?<-565956557_?<-565956560_?<-490503989_? 633953678 N6-MTase*-> N6-MTase - HPS41_07110 275 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis ST4-1 hypothetical protein HPS41_07110 [Haemophilus parasuis ST4-1]. 633953678_N6-MTase*-><-633953679_?||633953680_?->633953681_?->633953682_?->633953683_?->633953684_?->633953685_?-> 640590225 <-N6-MTase*<-Thymidylate_synthase N6-MTase MethyltransfD12 JCM16497_RS23455 275 bacteria>bacteroidetes Bacteroides sartorii DNA methyltransferase [Bacteroides sartorii]. <-640590216_?<-640590217_?||640590218_?->640590219_?->640590222_?->640590223_?->640590224_?-><-640590225_N6-MTase*<-640590226_Thymidylate_synthase<-640590227_?<-727808809_?<-640590229_? 737511689 <-N6-MTase*<-?<-DUF4376<-?<-DUF2313<-Baseplate_J<-GP46<-Phage_Mu_Gp45 N6-MTase - HPS9_RS04300 275 bacteria>proteobacteria>gammaproteobacteria Haemophilus parasuis hypothetical protein [Haemophilus parasuis]. <-737511689_N6-MTase*<-737511692_?<-737511694_DUF4376<-652522515_?<-737511697_DUF2313<-737511699_Baseplate_J<-506592116_GP46<-737511701_Phage_Mu_Gp45 750049800 <-N6-MTase* N6-MTase - HMPREF0198_RS01770 275 bacteria>proteobacteria>gammaproteobacteria Cardiobacterium hominis hypothetical protein [Cardiobacterium hominis]. <-490241187_?<-490241188_?<-490241189_?<-490241191_?<-750049796_?||750050068_?->490241194_?-><-750049800_N6-MTase*<-750050071_?<-490241199_?||490241201_?->750050072_?->490241206_?->490241208_?->750050074_?-> 495910797 Collar->?->Thymidylate_synthase->N6-MTase*-> N6-MTase MethyltransfD12 BZARG_RS04055 274 bacteria>bacteroidetes Bizionia argentinensis DNA methyltransferase [Bizionia argentinensis]. 495910929_?->495911099_?->749809173_?->749809174_?->749809175_Collar->495910915_?->495911144_Thymidylate_synthase->495910797_N6-MTase*->495910913_?->495911079_?->749809153_?-><-495910801_?||495911112_?-><-495910949_?<-495911118_? 499512619 <-N6-MTase*<-?<-?<-Caudo_TAP<-Collar<-Tail_P2_I<-Baseplate_J<-GPW_gp25 N6-MTase - MS_RS00345 274 bacteria>proteobacteria>gammaproteobacteria [Mannheimia] succiniciproducens hypothetical protein [[Mannheimia] succiniciproducens]. 499512611_?->753910721_?->499512613_?->499512615_?->499512616_?->499512617_?->499512618_?-><-499512619_N6-MTase*<-499512620_?<-499512621_?<-499512622_Caudo_TAP<-499512623_Collar<-753909406_Tail_P2_I<-499512625_Baseplate_J<-753909410_GPW_gp25 548211070 <-N6-MTase* N6-MTase SP+Methyltransf_26 BN741_01478 274 bacteria>bacteroidetes Prevotella stercorea CAG:629 d12 class N6 adenine-specific DNA methyltransferase family protein [Prevotella stercorea CAG:629]. <-548211068_?<-548211069_?<-548211070_N6-MTase*<-548211071_?<-548211072_?<-548211073_?<-548211074_?<-548211075_?<-548211076_?<-548211077_? 739635808 N6-MTase*->?->?->?->?->Phage_capsid->?->Portal-> N6-MTase Methyltransf_26 SALWKB29_RS09160 274 bacteria>proteobacteria>betaproteobacteria Snodgrassella alvi hypothetical protein [Snodgrassella alvi]. 739635869_?->739635798_?->739635799_?->739635802_?->739635804_?->739635805_?->739635807_?->739635808_N6-MTase*->739635812_?->739635815_?->739635818_?->739635876_?->739635879_Phage_capsid->739635821_?->739635826_Portal-> 740746518 <-N6-MTase*<-?<-?<-?<-?<-?<-Collar N6-MTase MethyltransfD12 BN863_RS14255 274 bacteria>bacteroidetes Formosa agariphila DNA methyltransferase [Formosa agariphila]. 740746510_?->740746512_?->740748407_?-><-740746513_?||740746514_?-><-740746516_?<-740746517_?<-740746518_N6-MTase*<-740746519_?<-740746521_?<-740746522_?<-740746523_?<-740746524_?<-740746525_Collar<-740746527_? 488758369 Collar->?->?->?->?->?->N6-MTase*->N6-MTase-> N6-MTase - CAPSP0001_RS11720 273 bacteria>bacteroidetes Capnocytophaga sputigena D12 class N6 adenine-specific DNA methyltransferase family protein [Capnocytophaga sputigena]. 488758360_?->488758329_Collar->488758368_?->488766210_?->488758379_?->488758366_?->488758350_?->488758369_N6-MTase*->488758377_N6-MTase-> 739535589 N6-MTase*->?->HNH->Terminase_SS->Terminase_LS-> N6-MTase - SASC598J21_RS08380 273 bacteria>proteobacteria>betaproteobacteria Snodgrassella alvi hypothetical protein [Snodgrassella alvi]. 739535578_?->739535581_?->739535583_?->739535586_?->739535442_?->739535446_?->739535448_?->739535589_N6-MTase*->739535450_?->739535453_HNH->739535456_Terminase_SS->739535458_Terminase_LS->739535461_?->739535592_?->739535595_?-> 763356669 Collar->?->?->?->?->N6-MTase+Phage-tailfib*-> N6-MTase+Phage-tailfib SP JCM21142_RS20860 273 bacteria>bacteroidetes Saccharicrinis fermentans hypothetical protein, partial [Saccharicrinis fermentans]. 763356667_?->653273643_Collar->653273642_?->653273641_?->653273640_?->653273639_?->763356669_N6-MTase+Phage-tailfib*-> 488180979 Baseplate_J->Baseplate_J->DUF2313->?->?->?->?->N6-MTase*-> N6-MTase - NM70021_RS109520 272 bacteria>proteobacteria>betaproteobacteria Neisseria meningitidis D12 class N6 adenine-specific DNA methyltransferase family protein [Neisseria meningitidis]. 488141546_Baseplate_J->488141547_Baseplate_J->728043483_DUF2313->488166025_?->488166024_?->488141550_?->488141551_?->488180979_N6-MTase*-><-488149547_?||488149549_?-><-488149551_? 497282263 N6-MTase*-> N6-MTase - C506_RS0110745 272 bacteria>bacteroidetes Alistipes MULTISPECIES: DNA methyltransferase [Alistipes]. 517526431_?->703290626_?->517526433_?->648626765_?->497282137_?->517526435_?->497282133_?->497282263_N6-MTase*-><-517526436_?||517526437_?->497282296_?->497282210_?->517526438_?->517526439_?->517526440_?-> 523673311 ABC-ATPase-><-?<-?<-?||?->?-><-N6-MTase*<-DCM N6-MTase - AJF4211_000170 270 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum JF4211 Putative uncharacterized protein [Avibacterium paragallinarum JF4211]. <-523673304_?||523673305_ABC-ATPase-><-523673306_?<-523673307_?<-523673308_?||523673309_?->523673310_?-><-523673311_N6-MTase*<-523673312_DCM<-523673313_?||523673314_?-><-523673315_?<-523673316_?<-523673317_?<-523673318_? 523674289 <-N6-MTase*<-DCM<-?<-?<-DUF4376<-Collar N6-MTase - AJF4211_000450 270 bacteria>proteobacteria>gammaproteobacteria Avibacterium paragallinarum JF4211 Putative uncharacterized protein [Avibacterium paragallinarum JF4211]. <-523674282_?<-523674283_?<-523674284_?||523674285_?->523674286_?->523674287_?->523674288_?-><-523674289_N6-MTase*<-523674290_DCM<-523674291_?<-523674292_?<-523674293_DUF4376<-523674294_Collar # 2; 490401642 <-N6-MTase* N6-MTase - CUP_RS09075 283 bacteria>proteobacteria>epsilonproteobacteria Campylobacter upsaliensis hypothetical protein [Campylobacter upsaliensis]. <-490401632_?||490401633_?->748624973_?->490401636_?->490401637_?->490401638_?-><-748624639_?<-490401642_N6-MTase*<-748624972_? 736539564 <-N6-MTase*<-DCM N6-MTase SP LS72_RS05710 279 bacteria>proteobacteria>epsilonproteobacteria Helicobacter apodemus hypothetical protein [Helicobacter apodemus]. <-736539555_?<-736539557_?<-736539558_?<-736539569_?<-736539559_?<-736539561_?<-736539563_?<-736539564_N6-MTase*<-736539566_DCM<-736539568_?Back to Contents
GI Archs Archs Pfam archs Gene name Len Taxonomy Species Genbank # AlkBH1: In kinetoplastids, note fusion to ZNF+RRM+TET-JBP in Aureococcus 86562425 - - 2OG-FeII_Oxy_2 Note fragmented Y51H7C.5 248 eukaryota>metazoa>nematoda Caenorhabditis elegans hypothetical protein Y51H7C.5 [Caenorhabditis elegans]. 326429695 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PTSG_06918 427 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_06918 [Salpingoeca rosetta]. Crev1000000444 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Crev1000000444 369 eukaryota>fungi>kickxellomycotina Coemansia reversa estExt_Genemark1.C_20086 Mcir1000011152 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Mcir1000011152 420 eukaryota>fungi>basal Mucor circinelloides Genemark1.11549_g Bcir1000005678 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bcir1000005678 401 eukaryota>fungi>mucoromycotina Backusella circina e_gw1.67.64.1 Pbla1000008754 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pbla1000008754 395 eukaryota>fungi>basal Phycomyces blakesleeanus fgeneshPB_pg.23__178 Uram1000002459 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Uram1000002459 430 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.10_#_156_#_combest_scaffold_10_5503 Lhya1000001164 TM+TM+TM+TM - TM+TM+TM+TM+2OG-FeII_Oxy_2 Lhya1000001164 647 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora gm1.1132_g Bcir1000009260 SP - SP+2OG-FeII_Oxy_2 Bcir1000009260 427 eukaryota>fungi>mucoromycotina Backusella circina estExt_Genewise1Plus.C_2670019 384493338 TM+TM - TM+TM+2OG-FeII_Oxy_2 RO3G_08534 557 eukaryota>fungi Rhizopus delemar RA 99-880 hypothetical protein RO3G_08534 [Rhizopus delemar RA 99-880]. Mcir1000002688 - - 2OG-FeII_Oxy_2 Mcir1000002688 428 eukaryota>fungi>basal Mucor circinelloides Genemark1.2761_g Pbla1000012921 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pbla1000012921 451 eukaryota>fungi>basal Phycomyces blakesleeanus estExt_fgeneshPB_pg.C_50358 Bcir1000010874 SP+TM+TM+TM+TM+TM - SP+TM+TM+TM+TM+TM+2OG-FeII_Oxy_2 Bcir1000010874 644 eukaryota>fungi>mucoromycotina Backusella circina estExt_Genewise1Plus.C_890072 Lhya1000002295 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Lhya1000002295 358 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora estExt_Genemark1.C_180068 Mver1000004016 - - 2OG-FeII_Oxy_2 Mver1000004016 426 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (426 aa) Spun1000007058 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000007058 379 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 alkylated DNA repair protein AlkB (379 aa) 58262090 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CNM01450 425 eukaryota>fungi>basidiomycota Cryptococcus neoformans var. neoformans JEC21 hypothetical protein CNM01450 [Cryptococcus neoformans var. neoformans JEC21]. Wseb1000002876 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Wseb1000002876 383 eukaryota>fungi>basidiomycota Wallemia sebi estExt_fgenesh1_kg.C_100073 164650570 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LACBIDRAFT_305933 428 eukaryota>fungi>basidiomycota Laccaria bicolor S238N-H82 predicted protein [Laccaria bicolor S238N-H82]. 169853276 - AlkB-2OGFEDO+TBC 2OG-FeII_Oxy_2+Ribosomal_S21e+DUF3548+RabGAP-TBC CC1G_04298 1328 eukaryota>fungi>basidiomycota Coprinopsis cinerea okayama7#130 hypothetical protein CC1G_04298 [Coprinopsis cinerea okayama7#130]. 527305476 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 FOMPIDRAFT_1027065 446 eukaryota>fungi>basidiomycota Fomitopsis pinicola FP-58527 SS1 hypothetical protein FOMPIDRAFT_1027065 [Fomitopsis pinicola FP-58527 SS1]. Abis1000001743 - AlkB-2OGFEDO+TBC 2OG-FeII_Oxy_2+Ribosomal_S21e+DUF3548+RabGAP-TBC Abis1000001743 1245 eukaryota>fungi>basidiomycota Agaricus bisporus estExt_fgenesh2_pm.C_20405 Rall1000005813 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Rall1000005813 365 eukaryota>fungi>cryptomycota Rozella allomycis O9G_000964m.01 Ccor1000005557 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ccor1000005557 411 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus gm1.6199_g Amac1000008309 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Amac1000008309 378 eukaryota>fungi>blastocladiomycota Allomyces macrogynus Allomyces macrogynus ATCC 38327 alkylated DNA repair protein AlkB (378 aa) Amac1000014273 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Amac1000014273 378 eukaryota>fungi>blastocladiomycota Allomyces macrogynus Allomyces macrogynus ATCC 38327 alkylated DNA repair protein AlkB (378 aa) Bden1000008328 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Bden1000008328 371 eukaryota>fungi>chytridiomycota Batrachochytrium dendrobatidis Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (371 aa) Falb1000005587 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Falb1000005587 452 eukaryota>nucleariidae_and_fonticula Fonticula alba Fonticula alba ATCC 38817 (V2) hypothetical protein (452 aa) Falb1000005586 SP - SP+2OG-FeII_Oxy_2 Falb1000005586 411 eukaryota>nucleariidae_and_fonticula Fonticula alba Fonticula alba ATCC 38817 (V2) hypothetical protein (411 aa) 116202811 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CHGG_09290 296 eukaryota>fungi>ascomycota Chaetomium globosum CBS 148.51 hypothetical protein CHGG_09290 [Chaetomium globosum CBS 148.51]. 85095632 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NCU09779.1 366 eukaryota>fungi>ascomycota Neurospora crassa OR74A hypothetical protein [Neurospora crassa OR74A]. Chet1000008148 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Chet1000008148 386 eukaryota>fungi>ascomycota Cochliobolus heterostrophus estExt_fgenesh1_pg.C_180212 111061162 - - 2OG-FeII_Oxy_2 SNOG_09947 366 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_09947 [Phaeosphaeria nodorum SN15]. 67527001 - - 2OG-FeII_Oxy_2 AN3958.2 361 eukaryota>fungi>ascomycota Aspergillus nidulans FGSC A4 hypothetical protein AN3958.2 [Aspergillus nidulans FGSC A4]. 70991683 - - 2OG-FeII_Oxy_2 AFUA_6G07990 359 eukaryota>fungi>ascomycota Aspergillus fumigatus Af293 oxidoreductase, 2OG-Fe(II) oxygenase family family [Aspergillus fumigatus Af293]. 160703884 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SNOG_13051 298 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_13051 [Phaeosphaeria nodorum SN15]. 50555041 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 YALI0F03003g 330 eukaryota>fungi>ascomycota Yarrowia lipolytica CLIB122 YALI0F03003p [Yarrowia lipolytica]. 19113345 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SPBC13G1.04c 273 eukaryota>fungi>ascomycota Schizosaccharomyces pombe alkB homolog [Schizosaccharomyces pombe]. 569432089 - - 2OG-FeII_Oxy_2 RFI_03532 449 eukaryota Reticulomyxa filosa hypothetical protein RFI_03532 [Reticulomyxa filosa]. 528235142 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AGDE_10531 368 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 1 [Angomonas deanei]. 528232486 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AGDE_11003 368 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 1 [Angomonas deanei]. 528273468 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AGDE_01394 167 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_01394 [Angomonas deanei]. 528225497 - - 2OG-FeII_Oxy_2 STCU_07353 386 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis alkylated DNA repair protein alkB like protein 1 [Strigomonas culicis]. 594144045 - - 2OG-FeII_Oxy_2 GSHART1_T00002327001 120 eukaryota>euglenozoa>kinetoplastida Phytomonas sp. isolate Hart1 unnamed protein product [Phytomonas sp. isolate Hart1]. 146082486 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LINJ_16_0360 368 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 157867117 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LMJF_16_0350 368 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 72387544 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Tb927.4.460 323 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 Alkylated DNA repair protein (alkB homolog) [Trypanosoma brucei TREU927]. 71419500 - - 2OG-FeII_Oxy_2 Tc00.1047053510687.140 323 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener alkylated DNA repair protein [Trypanosoma cruzi strain CL Brener]. 238660742 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Smp_041490 334 eukaryota>metazoa Schistosoma mansoni expressed protein [Schistosoma mansoni]. 674588814 - AlkB-2OGFEDO 2OG-FeII_Oxy_2+Transposase_mut HmN_000414100 344 eukaryota>metazoa Hymenolepis microstoma alkylated DNA repair protein alkB [Hymenolepis microstoma]. 674564509 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EgrG_000517300 366 eukaryota>metazoa Echinococcus granulosus alkylated DNA repair protein alkB [Echinococcus granulosus]. 576694219 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EGR_07282 347 eukaryota>metazoa Echinococcus granulosus Alkylated DNA repair protein AlkB [Echinococcus granulosus]. 674576182 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EmuJ_000517300 366 eukaryota>metazoa Echinococcus multilocularis conserved hypothetical protein [Echinococcus multilocularis]. Hrob1000010247 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Hrob1000010247 263 eukaryota>metazoa>annelida Helobdella robusta 123161 158593980 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Bm1_35285 339 eukaryota>metazoa>nematoda Brugia malayi ALKBH protein, putative [Brugia malayi]. 17537375 - - 2OG-FeII_Oxy_2 Y51H7C.4 169 eukaryota>metazoa>nematoda Caenorhabditis elegans hypothetical protein Y51H7C.4 [Caenorhabditis elegans]. Adig1000007449 TM+TM+TM+TM+TM AlkB-2OGFEDO 2OG-FeII_Oxy_2+TM+TM+TM+TM+TM Adig1000007449 653 eukaryota>cnidaria Acropora digitifera adi_v1.04821 221130857 - - 2OG-FeII_Oxy_2 LOC100210027 350 eukaryota>metazoa>cnidaria Hydra magnipapillata PREDICTED: similar to predicted protein [Hydra magnipapillata]. 156222983 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 NEMVEDRAFT_v1g96737 323 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. Bnat1000003625 - - 2OG-FeII_Oxy_2 Bnat1000003625 292 eukaryota>rhizaria>cercozoa Bigelowiella natans fgenesh1_pg.23_#_199 Aque1000008145 - BetaPropeller 2OG-FeII_Oxy_2+WD40 Aque1000008145 303 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.208392 115681547 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC579973 488 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: alkylated DNA repair protein alkB homolog 1 [Strongylocentrotus purpuratus]. Aque1000020897 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Aque1000020897 420 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.221165 198417894 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100186336 312 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: alkylated DNA repair protein alkB homolog 1 [Ciona intestinalis]. 119115270 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AgaP_AGAP000155 293 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP000155-PA [Anopheles gambiae str. PEST]. Lgig1000020166 - - 2OG-FeII_Oxy_2 Lgig1000020166 349 eukaryota>metazoa>mollusca Lottia gigantea estExt_fgenesh2_pg.C_sca_10395 Caps1000009657 - - 2OG-FeII_Oxy_2 Caps1000009657 317 eukaryota>metazoa>annelida Capitella spI fgenesh1_pg.C_scaffold_335000028 190584785 - SbcC+AlkB-2OGFEDO 2OG-FeII_Oxy_2 TRIADDRAFT_24904 271 eukaryota>metazoa>placozoa Trichoplax adhaerens hypothetical protein TRIADDRAFT_24904, partial [Trichoplax adhaerens]. 91080539 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC661543 297 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: similar to AlkB CG33250-PA [Tribolium castaneum]. 45555401 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Dmel_CG33250 332 eukaryota>metazoa>hexapoda Drosophila melanogaster AlkB [Drosophila melanogaster]. 193641120 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100160270 300 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: similar to AlkB CG33250-PA [Acyrthosiphon pisum]. 156542602 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100120538 252 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to HDC19127 [Nasonia vitripennis]. 66515297 - - 2OG-FeII_Oxy_2 AlkB 310 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: alkylated DNA repair protein alkB homolog 1 [Apis mellifera]. 307171459 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EAG_03150 306 eukaryota>metazoa>hexapoda Camponotus floridanus Alkylated DNA repair protein alkB-like protein 1 [Camponotus floridanus]. 307202053 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EAI_02538 308 eukaryota>metazoa>hexapoda Harpegnathos saltator Alkylated DNA repair protein alkB-like protein 1 [Harpegnathos saltator]. Smar1000008099 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Smar1000008099 279 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR006255-PA pep:novel scaffold:Smar1:JH431682:46249:48654:-1 gene:SMAR006255 transcript:SMAR006255-RA 321456927 - - 2OG-FeII_Oxy_2 DAPPUDRAFT_218420 288 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_218420 [Daphnia pulex]. 210123411 - HNH-HHH+AlkB-2OGFEDO 2OG-FeII_Oxy_2 BRAFLDRAFT_119815 365 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_119815 [Branchiostoma floridae]. 210123430 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 BRAFLDRAFT_262043 343 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_262043, partial [Branchiostoma floridae]. 291242901 - - 2OG-FeII_Oxy_2 LOC100369152 416 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: alkylated DNA repair protein alkB homolog 1-like [Saccoglossus kowalevskii]. 66472360 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 alkbh1 363 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio alkylated DNA repair protein alkB homolog 1 [Danio rerio]. 47206791 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 GSTEN:00005439:G:001 351 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 326920865 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100546801 365 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: alkylated DNA repair protein alkB homolog 1-like [Meleagris gallopavo]. 224051564 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH1 367 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: alkylated DNA repair protein alkB homolog 1 [Taeniopygia guttata]. 71895969 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH1 371 eukaryota>metazoa>chordata>vertebrata Gallus gallus alkylated DNA repair protein alkB homolog 1 [Gallus gallus]. 327259296 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 alkbh1 370 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: alkylated DNA repair protein alkB homolog 1 isoform X1 [Anolis carolinensis]. 87298840 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH1 389 eukaryota>metazoa>chordata>vertebrata Homo sapiens alkylated DNA repair protein alkB homolog 1 [Homo sapiens]. 114654177 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH1 389 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: alkylated DNA repair protein alkB homolog 1 [Pan troglodytes]. 109478500 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Alkbh_predicted 389 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to Alkylated DNA repair protein alkB homolog [Rattus norvegicus]. Fcyl1000037010 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Fcyl1000037010 279 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.26.280.1 Fcyl1000031479 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Fcyl1000031479 279 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.26_#_236 226523214 METHYLASE AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 MICPUN_60739 684 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 485641088 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_227562 256 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_227562 [Emiliania huxleyi CCMP1516]. 303279975 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 MICPUCDRAFT_69712 167 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 226518964 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 MICPUN_108675 334 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. Pram1000014720 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pram1000014720 217 eukaryota>stramenopiles Phytophthora ramorum 50629 Psoj1000010940 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Psoj1000010940 292 eukaryota>stramenopiles Phytophthora sojae 138234 301110256 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PITG_08013 292 eukaryota>stramenopiles Phytophthora infestans T30-4 conserved hypothetical protein [Phytophthora infestans T30-4]. 302838827 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 VOLCADRAFT_101962 250 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_101962 [Volvox carteri f. nagariensis]. 162676748 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PHYPADRAFT_16622 253 eukaryota>viridiplantae Physcomitrella patens predicted protein, partial [Physcomitrella patens]. 186519239 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 AT5G01780 387 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 30683015 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AT3G14160 313 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 15231791 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AT3G14140 452 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 284096896 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_45239 294 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 323449197 SP+TET-JBP ZNF+TRUA+TET-JBP+AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 AURANDRAFT_66742 1793 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_66742 [Aureococcus anophagefferens]. # SAD fused group (Note ZnR after SAD reported in original paper) Spun1000007354 - SAD+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000007354 742 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (742 aa) Spun1000003257 - SWC3+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000003257 341 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (341 aa) Ccor1000006818 - SAD+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ccor1000006818 546 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus fgenesh1_pg.165_#_2 88182299 - CDC27+SAD+AlkB-2OGFEDO 2OG-FeII_Oxy_2 CHGG_06386 724 eukaryota>fungi>ascomycota Chaetomium globosum CBS 148.51 hypothetical protein CHGG_06386 [Chaetomium globosum CBS 148.51]. 169600785 - SAD+AlkB-2OGFEDO 2OG-FeII_Oxy_2 SNOG_03244 978 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_03244 [Phaeosphaeria nodorum SN15]. 67526987 SP SAD+AlkB-2OGFEDO SP+2OG-FeII_Oxy_2+2OG-FeII_Oxy_2 AN3951.2 686 eukaryota>fungi>ascomycota Aspergillus nidulans FGSC A4 hypothetical protein AN3951.2 [Aspergillus nidulans FGSC A4]. 70998340 - SAD+DDRP-ZNR+AlkB-2OGFEDO 2OG-FeII_Oxy_2 AFUA_5G07420 465 eukaryota>fungi>ascomycota Aspergillus fumigatus Af293 hypothetical protein AFUA_5G07420 [Aspergillus fumigatus Af293]. 527303756 - SAD+AlkB-2OGFEDO 2OG-FeII_Oxy_2 FOMPIDRAFT_1156757 567 eukaryota>fungi>basidiomycota Fomitopsis pinicola FP-58527 SS1 hypothetical protein FOMPIDRAFT_1156757 [Fomitopsis pinicola FP-58527 SS1]. 164647611 SP SAD+AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 LACBIDRAFT_313729 1047 eukaryota>fungi>basidiomycota Laccaria bicolor S238N-H82 predicted protein [Laccaria bicolor S238N-H82]. Abis1000004299 - PAF1+SAD+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Abis1000004299 1014 eukaryota>fungi>basidiomycota Agaricus bisporus Genemark.4150_g 116508555 - SAD+AlkB-2OGFEDO+AlkB-2OGFEDO - CC1G_01939 500 eukaryota>fungi>basidiomycota Coprinopsis cinerea okayama7#130 predicted protein [Coprinopsis cinerea okayama7#130]. # Although grouped with above searches put them closer to AlkBH3 485611490 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_57409 94 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_57409, partial [Emiliania huxleyi CCMP1516]. 551605040 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_58725 85 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_58725, partial [Emiliania huxleyi CCMP1516]. Fcyl1000122023 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000122023 121 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.28.330.1 Fcyl1000032066 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Fcyl1000032066 429 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.28_#_266 Fcyl1000030685 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000030685 495 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.24_#_31 Fcyl1000120875 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000120875 150 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.24.318.1 #; AlkBH2 569436514 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 RFI_01908 150 eukaryota Reticulomyxa filosa hypothetical protein RFI_01908 [Reticulomyxa filosa]. 156221236 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 NEMVEDRAFT_v1g101914 212 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. Lgig1000002422 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Lgig1000002422 242 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.2.1002.1 210111343 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 BRAFLDRAFT_83601 267 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_83601 [Branchiostoma floridae]. 219447363 - MYB+MYB+POTRA+POTRA+AlkB-2OGFEDO Myb_DNA-bind_6+Myb_DNA-binding+Myb_DNA-binding+YppG+Cmyb_C+2OG-FeII_Oxy_2 BRAFLDRAFT_123541 844 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_123541 [Branchiostoma floridae]. 115953792 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC593121 265 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to AlkB, alkylation repair homolog 2 (E. coli) [Strongylocentrotus purpuratus]. Adig1000021749 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Adig1000021749 408 eukaryota>cnidaria Acropora digitifera adi_v1.02480 Aque1000029667 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Aque1000029667 264 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.229935 114646812 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH2 261 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 isoform 1 [Pan troglodytes]. 48717226 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH2 261 eukaryota>metazoa>chordata>vertebrata Homo sapiens alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 isoform 1 [Homo sapiens]. 61098162 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Alkbh2 239 eukaryota>metazoa>chordata>vertebrata Mus musculus alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 [Mus musculus]. 109497502 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Alkbh2_predicted 239 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to alkB, alkylation repair homolog 2 [Rattus norvegicus]. 118098574 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH2 241 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: hypothetical protein [Gallus gallus]. 326929758 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH2 243 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 isoform X1 [Meleagris gallopavo]. 224071680 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH2 256 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 [Taeniopygia guttata]. 68383159 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 si:dkey-65b12.2 258 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2 [Danio rerio]. 47226495 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 GSTEN:00029512:G:001 257 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 193606231 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100162992 218 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: similar to alkB, alkylation repair homolog 2 [Acyrthosiphon pisum]. 189241463 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 LOC662784 197 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: similar to alkB, alkylation repair homolog 2 [Tribolium castaneum]. 163776713 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 MONBRDRAFT_16363 180 eukaryota>choanoflagellida Monosiga brevicollis MX1 predicted protein, partial [Monosiga brevicollis MX1]. 514688869 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PTSG_06623 325 eukaryota>choanoflagellida Salpingoeca rosetta Alkbh2 protein [Salpingoeca rosetta]. 221132913 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 LOC100215008 235 eukaryota>metazoa>cnidaria Hydra vulgaris PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 2-like [Hydra vulgaris]. Bnat1000020289 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bnat1000020289 95 eukaryota>rhizaria>cercozoa Bigelowiella natans gw1.85.31.1 #; Note group shows fusions to amidohydrolase and GST, new removal pathway? ?, possible related to AlkBH2 Bnat1000011351 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bnat1000011351 425 eukaryota>rhizaria>cercozoa Bigelowiella natans estExt_fgenesh1_pg.C_180193 220976009 TM AlkB-2OGFEDO UPF0029+2OG-FeII_Oxy_2+TM THAPSDRAFT_21832 568 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. Fcyl1000027737 - AlkB-2OGFEDO UPF0029+2OG-FeII_Oxy_2 Fcyl1000027737 482 eukaryota>stramenopiles Fragilariopsis cylindrus e_gw1.15.516.1 46136713 - AlkB-2OGFEDO Isochorismatase+2OG-FeII_Oxy_2+GST_C_2 FG09872.1 927 eukaryota>fungi>ascomycota Fusarium graminearum PH-1 hypothetical protein FG09872.1 [Fusarium graminearum PH-1]. 116201083 - MIP-T3+AlkB-2OGFEDO Isochorismatase+Herpes_BLLF1+2OG-FeII_Oxy_2 CHGG_08426 990 eukaryota>fungi>ascomycota Chaetomium globosum CBS 148.51 hypothetical protein CHGG_08426 [Chaetomium globosum CBS 148.51]. 85095350 SP AlkB-2OGFEDO SP+Isochorismatase+2OG-FeII_Oxy_2+DUF4358 NCU05807 1166 eukaryota>fungi>ascomycota Neurospora crassa OR74A isochorismatase family protein [Neurospora crassa OR74A]. 67524139 - SWC3+AlkB-2OGFEDO Isochorismatase+2OG-FeII_Oxy_2+GST_C_2 AN2527.2 817 eukaryota>fungi>ascomycota Aspergillus nidulans FGSC A4 hypothetical protein AN2527.2 [Aspergillus nidulans FGSC A4]. 70998909 SP AlkB-2OGFEDO SP+Isochorismatase+2OG-FeII_Oxy_2+GST_C_2 AFUA_3G14500 828 eukaryota>fungi>ascomycota Aspergillus fumigatus Af293 isochorismatase family protein family [Aspergillus fumigatus Af293]. Chet1000005801 - AlkB-2OGFEDO Isochorismatase+2OG-FeII_Oxy_2 Chet1000005801 879 eukaryota>fungi>ascomycota Cochliobolus heterostrophus estExt_Genewise1Plus.C_130166 169622270 - AlkB-2OGFEDO Isochorismatase+Isochorismatase+2OG-FeII_Oxy_2+GST_C_2 SNOG_14354 1122 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_14354 [Phaeosphaeria nodorum SN15]. Abis1000009270 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Abis1000009270 348 eukaryota>fungi>basidiomycota Agaricus bisporus estExt_fgenesh2_pg.C_130160 527304243 - AlkB-2OGFEDO+SbcC 2OG-FeII_Oxy_2 FOMPIDRAFT_1112477 295 eukaryota>fungi>basidiomycota Fomitopsis pinicola FP-58527 SS1 hypothetical protein FOMPIDRAFT_1112477 [Fomitopsis pinicola FP-58527 SS1]. 170104308 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LACBIDRAFT_329183 333 eukaryota>fungi>basidiomycota Laccaria bicolor S238N-H82 predicted protein [Laccaria bicolor S238N-H82]. 116500656 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CC1G_04807 347 eukaryota>fungi>basidiomycota Coprinopsis cinerea okayama7#130 isochorismatase [Coprinopsis cinerea okayama7#130]. Sarc1000013250 - - FTO_CTD Sarc1000013250 182 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (182 aa) #; Kinetoplastid-only 528262236 - - 2OG-FeII_Oxy_2 AGDE_05127 309 eukaryota>euglenozoa>kinetoplastida Angomonas deanei hypothetical protein AGDE_05127 [Angomonas deanei]. 528229076 SP - SP+2OG-FeII_Oxy_2 STCU_06661 309 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis hypothetical protein STCU_06661 [Strigomonas culicis]. 594143793 - - 2OG-FeII_Oxy_2 GSHART1_T00002652001 311 eukaryota>euglenozoa>kinetoplastida Phytomonas sp. isolate Hart1 unnamed protein product [Phytomonas sp. isolate Hart1]. 146100771 - - 2OG-FeII_Oxy_2 LINJ_35_1290 318 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 70905714 - - 2OG-FeII_Oxy_2 LMJ_1059 318 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin hypothetical protein, conserved [Leishmania major strain Friedlin]. 71423866 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Tc00.1047053508303.40 305 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71425222 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Tc00.1047053511179.120 305 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 72388954 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Tb927.5.980 305 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 hypothetical protein [Trypanosoma brucei brucei TREU927]. # ; Fcyl1000018322 - - 2OG-FeII_Oxy_2 Fcyl1000018322 418 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.3.1101.1 300259504 SP RAD18+SWC3+AlkB-2OGFEDO+POTRA SP+2OG-FeII_Oxy_2 VOLCADRAFT_119009 753 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_119009 [Volvox carteri f. nagariensis]. 551590411 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_450175 375 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_450175 [Emiliania huxleyi CCMP1516]. 323449058 SP+TM+TM+TM+TM+TM+TM+TM+TM+TM+TM VgrG+AlkB-2OGFEDO SP+Beta_helix+Beta_helix+TM+TM+TM+TM+TM+TM+TM+TM+Drf_FH1+DUF488+2OG-FeII_Oxy_2+TM+TM AURANDRAFT_66792 2180 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_66792 [Aureococcus anophagefferens]. 226520209 SP - SP+2OG-FeII_Oxy_2 MICPUN_62550 353 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 226458145 - - 2OG-FeII_Oxy_2 MICPUCDRAFT_60114 377 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 149262140 - - - LOC100045202 127 eukaryota>metazoa>chordata>vertebrata Mus musculus PREDICTED: hypothetical protein [Mus musculus]. 284091784 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_66602 226 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 485643787 - - 2OG-FeII_Oxy_2 EMIHUDRAFT_251931 163 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_251931 [Emiliania huxleyi CCMP1516]. # ; ALKBH7 found in kinetoplastids 326428939 SP POTRA+POTRA SP PTSG_05873 295 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_05873 [Salpingoeca rosetta]. 163776258 - - - MONBRDRAFT_24773 212 eukaryota>choanoflagellida Monosiga brevicollis MX1 predicted protein [Monosiga brevicollis MX1]. 551558472 - - 2OG-FeII_Oxy_2 EMIHUDRAFT_212276 291 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 alkB, alkylation repair 7 [Emiliania huxleyi CCMP1516]. 551626318 - - 2OG-FeII_Oxy_2 EMIHUDRAFT_251025 188 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_251025 [Emiliania huxleyi CCMP1516]. Uram1000003343 SP - SP+2OG-FeII_Oxy_2 Uram1000003343 258 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.14_#_839_#_combest_scaffold_14_22505 Crev1000000227 - - - Crev1000000227 323 eukaryota>fungi>kickxellomycotina Coemansia reversa fgenesh1_kg.1_#_172_#_isotig04968 Mver1000004713 SP - SP Mver1000004713 324 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (324 aa) Spun1000001767 SP - SP+2OG-FeII_Oxy_2 Spun1000001767 259 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (259 aa) Ccor1000005156 - - - Ccor1000005156 247 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus estExt_Genemark1.C_990013 Wseb1000002511 - - 2OG-FeII_Oxy_2 Wseb1000002511 272 eukaryota>fungi>basidiomycota Wallemia sebi gm1.2421_g 527292773 SP - SP+2OG-FeII_Oxy_2 FOMPIDRAFT_130573 247 eukaryota>fungi>basidiomycota Fomitopsis pinicola FP-58527 SS1 hypothetical protein FOMPIDRAFT_130573 [Fomitopsis pinicola FP-58527 SS1]. 164647071 - - 2OG-FeII_Oxy_2 LACBIDRAFT_316029 234 eukaryota>fungi>basidiomycota Laccaria bicolor S238N-H82 predicted protein [Laccaria bicolor S238N-H82]. Abis1000001809 SP - SP+2OG-FeII_Oxy_2 Abis1000001809 248 eukaryota>fungi>basidiomycota Agaricus bisporus estExt_Genewise1.C_21079 220730306 - - 2OG-FeII_Oxy_2 POSPLDRAFT_94548 259 eukaryota>fungi>basidiomycota Postia placenta Mad-698-R predicted protein [Postia placenta Mad-698-R]. 58265654 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 CNC05880 287 eukaryota>fungi>basidiomycota Cryptococcus neoformans var. neoformans JEC21 hypothetical protein CNC05880 [Cryptococcus neoformans var. neoformans JEC21]. Falb1000000067 SP - SP Falb1000000067 377 eukaryota>nucleariidae_and_fonticula Fonticula alba Fonticula alba ATCC 38817 (V2) hypothetical protein (377 aa) Amac1000008462 SP - SP+2OG-FeII_Oxy_2 Amac1000008462 281 eukaryota>fungi>blastocladiomycota Allomyces macrogynus Allomyces macrogynus ATCC 38327 hypothetical protein (281 aa) 320163657 SP - SP+2OG-FeII_Oxy_2 CAOG_01081 278 eukaryota Capsaspora owczarzaki ATCC 30864 hypothetical protein CAOG_01081 [Capsaspora owczarzaki ATCC 30864]. 71661420 - - - Tc00.1047053511211.100 280 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 74025276 SP - SP Tb11.01.3200 272 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 hypothetical protein [Trypanosoma brucei brucei TREU927]. 528238560 SP - SP STCU_04463 271 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis alkylated DNA repair protein alkB like protein 7 [Strigomonas culicis]. 528226932 SP - SP AGDE_12223 300 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 7 [Angomonas deanei]. 528261529 SP - SP AGDE_05364 266 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 7 [Angomonas deanei]. 528264616 SP - SP AGDE_04331 266 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 7 [Angomonas deanei]. 157872016 SP - SP LMJF_28_2710 286 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 146092519 SP - SP LINJ_28_2910 286 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 594147457 - - - GSHART1_T00005476001 300 eukaryota>euglenozoa>kinetoplastida Phytomonas sp. isolate Hart1 unnamed protein product [Phytomonas sp. isolate Hart1]. 110758828 - - - LOC411448 185 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: similar to Y46G5A.35 [Apis mellifera]. 307186823 - - 2OG-FeII_Oxy_2 EAG_04133 241 eukaryota>metazoa>hexapoda Camponotus floridanus Alkylated DNA repair protein alkB-like protein 7 [Camponotus floridanus]. 307191857 SP - SP+2OG-FeII_Oxy_2 EAI_03104 229 eukaryota>metazoa>hexapoda Harpegnathos saltator Alkylated DNA repair protein alkB-like protein 7 [Harpegnathos saltator]. 238661944 SP SIN18+SIN3A SP+SAP18 Smp_150930 600 eukaryota>metazoa Schistosoma mansoni conserved hypothetical protein [Schistosoma mansoni]. 238652610 - - - Smp_194350 160 eukaryota>metazoa Schistosoma mansoni conserved hypothetical protein, partial [Schistosoma mansoni]. 674575674 SP - SP EMUJ_000641600 283 eukaryota>metazoa Echinococcus multilocularis alpha ketoglutarate dependent [Echinococcus multilocularis]. 674567274 SP IF2-HTH SP EgrG_000641600 283 eukaryota>metazoa Echinococcus granulosus alpha ketoglutarate dependent [Echinococcus granulosus]. 576699638 - IF2-HTH Pkinase_Tyr EGR_01968 605 eukaryota>metazoa Echinococcus granulosus Alkylated DNA repair protein AlkB [Echinococcus granulosus]. 674595975 SP - SP HmN_000005200 264 eukaryota>metazoa Hymenolepis microstoma alpha ketoglutarate dependent [Hymenolepis microstoma]. 193681067 - - - LOC100161950 373 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]. 291224759 SP - SP+2OG-FeII_Oxy_2 LOC100372060 245 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: spermatogenesis associated 11-like [Saccoglossus kowalevskii]. Smar1000006659 - - 2OG-FeII_Oxy_2 Smar1000006659 178 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR007432-PA pep:novel scaffold:Smar1:JH431789:114970:119470:-1 gene:SMAR007432 transcript:SMAR007432-RA Lgig1000004215 - - 2OG-FeII_Oxy_2 Lgig1000004215 176 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.28.427.1 189239805 SP - SP+2OG-FeII_Oxy_2 LOC100141950 230 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial isoform X1 [Tribolium castaneum]. Caps1000000823 SP - SP+2OG-FeII_Oxy_2 Caps1000000823 234 eukaryota>metazoa>annelida Capitella spI estExt_Genewise1.C_120131 71998257 - - 2OG-FeII_Oxy_2 CELE_Y46G5A.35 227 eukaryota>metazoa>nematoda Caenorhabditis elegans Y46G5A.35 [Caenorhabditis elegans]. 170571892 - - 2OG-FeII_Oxy_2 Bm1_02010 212 eukaryota>metazoa>nematoda Brugia malayi spermatogenesis associated 11 [Brugia malayi]. Aque1000017310 SP - SP+2OG-FeII_Oxy_2 Aque1000017310 265 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.217578 196010015 - - 2OG-FeII_Oxy_2 TRIADDRAFT_28544 211 eukaryota>metazoa>placozoa Trichoplax adhaerens hypothetical protein TRIADDRAFT_28544 [Trichoplax adhaerens]. 156217303 - - 2OG-FeII_Oxy_2 NEMVEDRAFT_v1g114209 185 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 198434437 SP - SP+2OG-FeII_Oxy_2 LOC100180366 264 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial-like [Ciona intestinalis]. 327264013 SP - SP+2OG-FeII_Oxy_2 alkbh7 222 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial [Anolis carolinensis]. 109513091 SP - SP+2OG-FeII_Oxy_2 LOC679944 221 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to spermatogenesis associated 11 isoform 2 [Rattus norvegicus]. 109486536 SP SHS2 SP+2OG-FeII_Oxy_2 LOC681562 163 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to spermatogenesis associated 11 isoform 1 [Rattus norvegicus]. 21313470 SP - SP+2OG-FeII_Oxy_2 Alkbh7 221 eukaryota>metazoa>chordata>vertebrata Mus musculus alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial isoform 1 precursor [Mus musculus]. 114674893 - - 2OG-FeII_Oxy_2 ALKBH7 221 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial isoform X2 [Pan troglodytes]. 14150066 - - 2OG-FeII_Oxy_2 ALKBH7 221 eukaryota>metazoa>chordata>vertebrata Homo sapiens alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial precursor [Homo sapiens]. 47214908 SP - SP GSTEN:00023729:G:001 228 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product [Tetraodon nigroviridis]. 62955187 - - 2OG-FeII_Oxy_2 alkbh7 233 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio alpha-ketoglutarate-dependent dioxygenase alkB homolog 7, mitochondrial [Danio rerio]. 321472218 - - - DAPPUDRAFT_100794 182 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_100794 [Daphnia pulex]. 115894368 SP - SP+2OG-FeII_Oxy_2 LOC757284 203 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to AlkB, alkylation repair homolog 7 (E. coli), partial [Strongylocentrotus purpuratus]. 210122389 - - 2OG-FeII_Oxy_2 BRAFLDRAFT_262579 181 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_262579 [Branchiostoma floridae]. 219443919 - - 2OG-FeII_Oxy_2 BRAFLDRAFT_266169 181 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_266169 [Branchiostoma floridae]. Hrob1000010803 - - - Hrob1000010803 205 eukaryota>metazoa>annelida Helobdella robusta 76748 85726474 - - - Dmel_CG14130 255 eukaryota>metazoa>hexapoda Drosophila melanogaster CG14130 [Drosophila melanogaster]. 118781137 - - - AgaP_AGAP000760 258 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP000760-PA [Anopheles gambiae str. PEST]. Psoj1000016465 SP - SP+2OG-FeII_Oxy_2 Psoj1000016465 247 eukaryota>stramenopiles Phytophthora sojae 144652 Psoj1000016470 METHYLASE METHYLASE Methyltransf_16+2OG-FeII_Oxy_2 Psoj1000016470 361 eukaryota>stramenopiles Phytophthora sojae 144657 Pram1000000142 SP - SP+2OG-FeII_Oxy_2 Pram1000000142 245 eukaryota>stramenopiles Phytophthora ramorum 84939 262110203 SP - SP PITG_04676 246 eukaryota>stramenopiles Phytophthora infestans T30-4 conserved hypothetical protein [Phytophthora infestans T30-4]. 220978082 - - Urocanase+2OG-FeII_Oxy_2 THAPSDRAFT_31380 177 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein, partial [Thalassiosira pseudonana CCMP1335]. Fcyl1000050013 - - - Fcyl1000050013 179 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.2.1691.1 Fcyl1000106316 - - - Fcyl1000106316 280 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.2.998.1 Fcyl1000016334 TM - TM Fcyl1000016334 354 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.2_#_308 217403334 SP - SP PHATRDRAFT_50302 249 eukaryota>stramenopiles Phaeodactylum tricornutum CCAP 1055/1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]. #; ALKBH4 -- found in kinetoplastids 403343479 - - 2OG-FeII_Oxy OXYTRI_08063 342 eukaryota>alveolata>ciliophora Oxytricha trifallax hypothetical protein OXYTRI_08063 (macronuclear) [Oxytricha trifallax]. 403359506 - - - OXYTRI_23312 335 eukaryota>alveolata>ciliophora Oxytricha trifallax hypothetical protein OXYTRI_23312 (macronuclear) [Oxytricha trifallax]. Caps1000011463 - - - Caps1000011463 319 eukaryota>metazoa>annelida Capitella spI estExt_Genewise1.C_2310033 219504515 - - - BRAFLDRAFT_256230 304 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_256230 [Branchiostoma floridae]. 210116659 - AlkB-2OGFEDO - BRAFLDRAFT_219277 303 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_219277 [Branchiostoma floridae]. 156226754 - PLC 2OG-FeII_Oxy_2 NEMVEDRAFT_v1g34785 267 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. Lgig1000007221 - - 2OG-FeII_Oxy Lgig1000007221 273 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.180.14.1 Adig1000021750 - - 2OG-FeII_Oxy_2 Adig1000021750 294 eukaryota>cnidaria Acropora digitifera adi_v1.02479 291242331 - Nimm60 2OG-FeII_Oxy_2 LOC100367945 270 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: alkB, alkylation repair homolog 4-like [Saccoglossus kowalevskii]. 221106579 - - - LOC100204004 272 eukaryota>metazoa>cnidaria Hydra vulgaris PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4-like [Hydra vulgaris]. 326433577 - - 2OG-FeII_Oxy_2 PTSG_09879 270 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_09879 [Salpingoeca rosetta]. 167524358 - - - MONBRDRAFT_26078 207 eukaryota>choanoflagellida Monosiga brevicollis MX1 hypothetical protein [Monosiga brevicollis MX1]. 557636210 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 HmN_000938900 328 eukaryota>metazoa Hymenolepis microstoma alpha ketoglutarate dependent [Hymenolepis microstoma]. Aque1000014704 - - 2OG-FeII_Oxy Aque1000014704 284 eukaryota>metazoa>porifera Amphimedon queenslandica Aqu1.214972 198434600 - AlkB-2OGFEDO - LOC100186431 303 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Ciona intestinalis]. 91087937 - AlkB-2OGFEDO - LOC660615 306 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Tribolium castaneum]. 158298439 SP - SP AgaP_AGAP009588 315 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP009588-PA [Anopheles gambiae str. PEST]. 156547970 - AlkB-2OGFEDO - LOC100121480 291 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: similar to ENSANGP00000020936 [Nasonia vitripennis]. 66554392 - - - LOC551104 296 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Apis mellifera]. 307200597 - - - EAI_15196 240 eukaryota>metazoa>hexapoda Harpegnathos saltator Alkylated DNA repair protein alkB-like protein 4 [Harpegnathos saltator]. 307183142 - - - EAG_14740 298 eukaryota>metazoa>hexapoda Camponotus floridanus Alkylated DNA repair protein alkB-like protein 4 [Camponotus floridanus]. 321472550 SP - SP+2OG-FeII_Oxy DAPPUDRAFT_48015 293 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_48015 [Daphnia pulex]. 24583140 - - - Dmel_CG4036 304 eukaryota>metazoa>hexapoda Drosophila melanogaster CG4036, isoform A [Drosophila melanogaster]. 193599006 - - 2OG-FeII_Oxy LOC100167916 299 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: similar to conserved hypothetical protein [Acyrthosiphon pisum]. 224076156 - - 2OG-FeII_Oxy LOC100229056 389 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: alkB, alkylation repair homolog 4 (E. coli) [Taeniopygia guttata]. 326931240 - - - LOC100543581 289 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH4-like [Meleagris gallopavo]. 118100089 - - HpaP ALKBH4 486 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: hypothetical protein [Gallus gallus]. 8923019 - - - ALKBH4 302 eukaryota>metazoa>chordata>vertebrata Homo sapiens alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Homo sapiens]. 110625894 - - - Alkbh4 215 eukaryota>metazoa>chordata>vertebrata Mus musculus alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Mus musculus]. 109497175 - - - Alkbh4_predicted 301 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to CG4036-PA [Rattus norvegicus]. 68372246 SP - SP+2OG-FeII_Oxy_2 alkbh4 315 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 isoform X2 [Danio rerio]. 25148697 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CELE_F09F7.7 291 eukaryota>metazoa>nematoda Caenorhabditis elegans F09F7.7, isoform a [Caenorhabditis elegans]. 170579486 - - - Bm1_16965 297 eukaryota>metazoa>nematoda Brugia malayi LD42289p [Brugia malayi]. 239897326 - - 2OG-FeII_Oxy_2 Pmar_PMAR003551 252 eukaryota>alveolata Perkinsus marinus ATCC 50983 conserved hypothetical protein [Perkinsus marinus ATCC 50983]. 71747664 - - 2OG-FeII_Oxy_2 Tb10.70.0360 304 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 hypothetical protein [Trypanosoma brucei brucei TREU927]. 71659461 - - 2OG-FeII_Oxy Tc00.1047053510187.490 304 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 146104297 - - 2OG-FeII_Oxy_2 LINJ_36_2080 297 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 157876860 - - 2OG-FeII_Oxy_2 LMJF_36_1970 297 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 528254392 - - 2OG-FeII_Oxy_2 STCU_00887 304 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis alkylated DNA repair protein alkB like protein 4 [Strigomonas culicis]. 528266291 - - 2OG-FeII_Oxy_2 AGDE_03849 297 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 4 [Angomonas deanei]. 528238215 - - 2OG-FeII_Oxy_2 AGDE_09971 297 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 4 [Angomonas deanei]. 528261759 SP - SP+2OG-FeII_Oxy_2 AGDE_05286 210 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 4 [Angomonas deanei]. 672578582 TGMAS_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii MAS hypothetical protein TGMAS_246140 [Toxoplasma gondii MAS]. 523576915 TGGT1_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii GT1 hypothetical protein TGGT1_246140 [Toxoplasma gondii GT1]. 672285053 TGFOU_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii FOU hypothetical protein TGFOU_246140 [Toxoplasma gondii FOU]. 672573839 TGVAND_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii VAND hypothetical protein TGVAND_246140 [Toxoplasma gondii VAND]. 672301308 TGRUB_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii RUB hypothetical protein TGRUB_246140 [Toxoplasma gondii RUB]. 675123610 HHA_246140 803 eukaryota>alveolata>apicomplexa Hammondia hammondi hypothetical protein HHA_246140 [Hammondia hammondi]. 557738285 TGVEG_246140 1033 eukaryota>alveolata>apicomplexa Toxoplasma gondii VEG hypothetical protein TGVEG_246140 [Toxoplasma gondii VEG]. 237835449 TGME49_046140 1033 eukaryota>alveolata>apicomplexa Toxoplasma gondii ME49 hypothetical protein TGME49_046140 [Toxoplasma gondii ME49]. 672276401 TGP89_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii p89 hypothetical protein TGP89_246140 [Toxoplasma gondii p89]. #; Basal Fungal-only Bden1000004472 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bden1000004472 226 eukaryota>fungi>chytridiomycota Batrachochytrium dendrobatidis Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (226 aa) Spun1000008764 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000008764 195 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (195 aa) Uram1000007702 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Uram1000007702 198 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.54_#_164_#_combest_scaffold_54_109393 Mcir1000007727 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Mcir1000007727 211 eukaryota>fungi>basal Mucor circinelloides Mucci1.fgeneshMC_pg.8_#_475 Bcir1000007546 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bcir1000007546 206 eukaryota>fungi>mucoromycotina Backusella circina fgenesh1_kg.13_#_14_#_Locus8775v1rpkm8.87 Lhya1000001028 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Lhya1000001028 218 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora estExt_fgenesh1_pg.C_70030 Mver1000006163 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Mver1000006163 222 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (222 aa) Ccor1000007017 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ccor1000007017 226 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus fgenesh1_pg.176_#_7 #; RNA modifiying? 284087788 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_58773 259 eukaryota>heterolobosea Naegleria gruberi hypothetical protein NAEGRDRAFT_58773 [Naegleria gruberi]. Ttra1000007412 - AlkB-2OGFEDO+KH 2OG-FeII_Oxy_2 Ttra1000007412 330 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (330 aa) #; AlkBH6-like? 325078238 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 DICPUDRAFT_99071 256 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium purpureum hypothetical protein DICPUDRAFT_99071 [Dictyostelium purpureum]. 66800191 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 DDB_G0293582 247 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium discoideum AX4 2-oxoglutarate and Fe-dependent oxygenase family protein [Dictyostelium discoideum AX4]. 284085583 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_72926 288 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. Rall1000004359 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Rall1000004359 279 eukaryota>fungi>cryptomycota Rozella allomycis O9G_000840m.01 284081917 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_54771 279 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 284094106 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_64270 279 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 89302339 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 TTHERM_00219000 254 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 2OG-Fe(II) oxygenase family oxidoreductase (macronuclear) [Tetrahymena thermophila SB210]. Ttra1000001477 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ttra1000001477 253 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase family Oxidoreductase (253 aa) #;ALKBH6 in kinetoplastids Sarc1000005302 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Sarc1000005302 256 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (256 aa) 109148544 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH6 266 eukaryota>metazoa>chordata>vertebrata Homo sapiens alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 isoform 2 [Homo sapiens]. 34855673 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC292780 238 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to calpain, small subunit 1 [Rattus norvegicus]. 38569508 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Alkbh6 238 eukaryota>metazoa>chordata>vertebrata Mus musculus alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Mus musculus]. 320169428 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CAOG_04295 256 eukaryota Capsaspora owczarzaki ATCC 30864 calcium-dependent cysteine protease [Capsaspora owczarzaki ATCC 30864]. 674588781 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 HmN_000423000 268 eukaryota>metazoa Hymenolepis microstoma nucleic acid binding [Hymenolepis microstoma]. 321463592 - AlkB-2OGFEDO+SWC3 2OG-FeII_Oxy_2 DAPPUDRAFT_307189 211 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_307189 [Daphnia pulex]. Caps1000008447 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Caps1000008447 215 eukaryota>metazoa>annelida Capitella spI e_gw1.16.157.1 47196062 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 GSTEN:00003217:G:001 234 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 53292605 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 alkbh6 234 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Danio rerio]. 158288561 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AgaP_AGAP003866 227 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP003866-PA [Anopheles gambiae str. PEST]. 193718445 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100164195 230 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Acyrthosiphon pisum]. 91076692 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC660463 215 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Tribolium castaneum]. 85726418 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Dmel_CG6144 228 eukaryota>metazoa>hexapoda Drosophila melanogaster CG6144, isoform C [Drosophila melanogaster]. 115960280 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC585652 245 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: hypothetical protein [Strongylocentrotus purpuratus]. 156544714 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100122093 231 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6 [Nasonia vitripennis]. 110758548 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC552732 221 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: similar to calpain, small subunit 1 [Apis mellifera]. 196006752 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 TRIADDRAFT_57191 232 eukaryota>metazoa>placozoa Trichoplax adhaerens hypothetical protein TRIADDRAFT_57191 [Trichoplax adhaerens]. 156215604 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NEMVEDRAFT_v1g119331 234 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. 238652512 - - 2OG-FeII_Oxy_2 Smp_120440.1 257 eukaryota>metazoa Schistosoma mansoni nucleic acid binding, putative [Schistosoma mansoni]. 291241873 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100369792 243 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 6-like [Saccoglossus kowalevskii]. 210129621 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 BRAFLDRAFT_202284 231 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_202284 [Branchiostoma floridae]. 219489319 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 BRAFLDRAFT_244728 231 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_244728 [Branchiostoma floridae]. Lgig1000010006 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Lgig1000010006 228 eukaryota>metazoa>mollusca Lottia gigantea fgenesh2_pg.C_sca_12000135 168057031 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PHYPADRAFT_148636 258 eukaryota>viridiplantae Physcomitrella patens predicted protein, partial [Physcomitrella patens]. 302823387 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_136984 229 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_136984 [Selaginella moellendorffii]. 302781915 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_98137 231 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_98137 [Selaginella moellendorffii]. Hrob1000017985 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Hrob1000017985 223 eukaryota>metazoa>annelida Helobdella robusta 86209 325075616 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 DICPUDRAFT_21733 192 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium purpureum hypothetical protein DICPUDRAFT_21733, partial [Dictyostelium purpureum]. 735994710 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SAMD00019534_035740 266 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_035740 [Acytostelium subglobosum LB1]. 281203011 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PPL_12421 251 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 hypothetical protein PPL_12421 [Polysphondylium pallidum PN500]. 545710109 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Gasu_18590 268 eukaryota>rhodophyta Galdieria sulphuraria hypothetical protein Gasu_18590 [Galdieria sulphuraria]. Mver1000002798 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Mver1000002798 245 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (245 aa) Uram1000003236 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Uram1000003236 230 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.14_#_327_#_combest_scaffold_14_20870 Spun1000005556 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000005556 222 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (222 aa) 111056864 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SNOG_14792 238 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_14792 [Phaeosphaeria nodorum SN15]. Pram1000009643 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pram1000009643 198 eukaryota>stramenopiles Phytophthora ramorum 73389 262101095 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PITG_11617 231 eukaryota>stramenopiles Phytophthora infestans T30-4 alkylated DNA repair protein alkB [Phytophthora infestans T30-4]. Psoj1000004970 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Psoj1000004970 281 eukaryota>stramenopiles Phytophthora sojae 131422 71409378 SP AlkB-2OGFEDO+DNAA-HTH SP Tc00.1047053503971.10 638 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 528222890 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2+Myosin_tail_1 STCU_07865 661 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis alkylated DNA repair protein alkB like protein 6 [Strigomonas culicis]. 146102795 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LINJ_36_5180 715 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 157877528 SP AlkB-2OGFEDO SP LMJF_36_4950 716 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. Ttra1000003432 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ttra1000003432 269 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 alkylated DNA repair protein alkB (269 aa) #;AT4G02485-like Uram1000001443 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Uram1000001443 252 eukaryota>fungi>mucoromycotina Umbelopsis ramanniana fgenesh1_kg.5_#_553_#_combest_scaffold_5_102347 Pbla1000008773 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pbla1000008773 238 eukaryota>fungi>basal Phycomyces blakesleeanus fgeneshPB_pg.23__224 Lhya1000000544 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Lhya1000000544 211 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora estExt_Genemark1.C_30112 Bcir1000016642 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bcir1000016642 204 eukaryota>fungi>mucoromycotina Backusella circina estExt_Genewise1Plus.C_7650003 Mcir1000005378 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Mcir1000005378 234 eukaryota>fungi>basal Mucor circinelloides Mucci1.e_gw1.4.913.1 302757747 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_165238 225 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_165238 [Selaginella moellendorffii]. 302763591 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_230539 207 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_230539 [Selaginella moellendorffii]. 162692790 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PHYPADRAFT_26843 198 eukaryota>viridiplantae Physcomitrella patens predicted protein, partial [Physcomitrella patens]. 18411957 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AT4G02485 226 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 320166009 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CAOG_08040 304 eukaryota Capsaspora owczarzaki ATCC 30864 hypothetical protein CAOG_08040 [Capsaspora owczarzaki ATCC 30864]. 119358859 - AlkB-2OGFEDO HEAT_EZ+2OG-FeII_Oxy_2 Ot02g04110 494 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri SelMay undefined product (IC) [Ostreococcus tauri]. 226521158 - - 2OG-FeII_Oxy_2 MICPUN_63915 319 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 303286859 SP - SP+2OG-FeII_Oxy_2 MICPUCDRAFT_42311 244 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. #; Related to above? 167521620 SP - SP+2OG-FeII_Oxy_2 MONBRDRAFT_24779 474 eukaryota>choanoflagellida Monosiga brevicollis MX1 hypothetical protein [Monosiga brevicollis MX1]. #; ALKBH8 189530001 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 LOC556362 657 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: similar to alkB, alkylation repair homolog 8 [Danio rerio]. Ttra1000009841 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Ttra1000009841 232 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 hypothetical protein (232 aa) 89299232 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 TTHERM_00483520 199 eukaryota>alveolata>ciliophora Tetrahymena thermophila SB210 2OG-Fe(II) oxygenase family oxidoreductase (macronuclear) [Tetrahymena thermophila SB210]. 198432246 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 LOC100183670 593 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Ciona intestinalis]. 20270315 - RRM DUF1891+RRM_1 ALKBH8 238 eukaryota>metazoa>chordata>vertebrata Homo sapiens alkB, alkylation repair homolog 8 [Homo sapiens]. 114565310 - - - LOC736490 220 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Pan troglodytes]. 169162451 - - - LOC646804 220 eukaryota>metazoa>chordata>vertebrata Homo sapiens PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Homo sapiens]. 569418308 - IG+AlkB-2OGFEDO PCEMA1+2OG-FeII_Oxy_2 RFI_09511 706 eukaryota Reticulomyxa filosa hypothetical protein RFI_09511 [Reticulomyxa filosa]. 209556865 - RRM+AlkB-2OGFEDO RRM_5+2OG-FeII_Oxy_2 CMU_032950 332 eukaryota>alveolata>apicomplexa Cryptosporidium muris RN66 oxidoreductase, 2og-Fe(II) oxygenase family protein [Cryptosporidium muris RN66]. 46229757 - RRM+AlkB-2OGFEDO RRM_5+2OG-FeII_Oxy_2 cgd7_1000 350 eukaryota>alveolata>apicomplexa Cryptosporidium parvum Iowa II F27M3_19 plant like RRM plus AlkB domain containing protein [Cryptosporidium parvum Iowa II]. 255086679 - RRM+AlkB-2OGFEDO FSH1+RRM_5+2OG-FeII_Oxy_2 MICPUN_86885 418 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 116060339 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ot11g00700 232 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri 2-Oxoglutarate-and iron-dependent dioxygenase-related proteins (ISS) [Ostreococcus tauri]. 302797440 - RRM+AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_112315 315 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_112315, partial [Selaginella moellendorffii]. 302758364 - RRM+AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_78643 315 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_78643, partial [Selaginella moellendorffii]. 42571711 - RRM+AlkB-2OGFEDO RRM_5+2OG-FeII_Oxy_2 AT1G31600 431 eukaryota>viridiplantae Arabidopsis thaliana tRNA methyltransferase 9 [Arabidopsis thaliana]. 42571709 - RRM+AlkB-2OGFEDO RRM_5+2OG-FeII_Oxy_2 AT1G31600 344 eukaryota>viridiplantae Arabidopsis thaliana tRNA methyltransferase 9 [Arabidopsis thaliana]. 168033740 - RRM+AlkB-2OGFEDO RRM_1+2OG-FeII_Oxy_2 PHYPADRAFT_15669 334 eukaryota>viridiplantae Physcomitrella patens predicted protein, partial [Physcomitrella patens]. 303284329 - RRM+AlkB-2OGFEDO RRM_5+2OG-FeII_Oxy_2 MICPUCDRAFT_41620 408 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 239884096 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pmar_PMAR017769 325 eukaryota>alveolata Perkinsus marinus ATCC 50983 conserved hypothetical protein [Perkinsus marinus ATCC 50983]. 551578395 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_240478 226 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_240478 [Emiliania huxleyi CCMP1516]. 72390966 - - 2OG-FeII_Oxy_2 Tb927.7.1530 454 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 hypothetical protein [Trypanosoma brucei brucei TREU927]. 71423311 - - 2OG-FeII_Oxy_2 Tc00.1047053503579.130 426 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 71418255 - - 2OG-FeII_Oxy_2 Tc00.1047053507517.110 426 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 157870995 - - 2OG-FeII_Oxy_2 LMJF_26_0400 562 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 146089452 - Nimm73 2OG-FeII_Oxy_2 LINJ_26_0390 563 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 528216286 SP - SP+2OG-FeII_Oxy_2 STCU_09264 424 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis hypothetical protein STCU_09264 [Strigomonas culicis]. Ccor1000000123 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ccor1000000123 336 eukaryota>fungi>entomophthoromycota Conidiobolus coronatus fgenesh1_pg.3_#_43 85090541 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NCU09959 290 eukaryota>fungi>ascomycota Neurospora crassa OR74A hypothetical protein NCU09959 [Neurospora crassa OR74A]. Amac1000013082 SP RRM+AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Amac1000013082 375 eukaryota>fungi>blastocladiomycota Allomyces macrogynus Allomyces macrogynus ATCC 38327 hypothetical protein (375 aa) Spun1000006493 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000006493 383 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (383 aa) Bden1000007191 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bden1000007191 367 eukaryota>fungi>chytridiomycota Batrachochytrium dendrobatidis Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (367 aa) 284094307 - AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 NAEGRDRAFT_958 460 eukaryota>heterolobosea Naegleria gruberi predicted protein, partial [Naegleria gruberi]. 403335499 - AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 OXYTRI_12781 710 eukaryota>alveolata>ciliophora Oxytricha trifallax hypothetical protein OXYTRI_12781 (macronuclear) [Oxytricha trifallax]. 145491391 METHYLASE AlkB-2OGFEDO+SAM-methylase+SbcC 2OG-FeII_Oxy_2+Methyltransf_11 GSPATT00006150001 634 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145522424 METHYLASE AlkB-2OGFEDO+SAM-methylase+SbcC 2OG-FeII_Oxy_2+Methyltransf_11 GSPATT00014589001 636 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 299115673 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Esi_0186_0021 350 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. Sarc1000000358 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Sarc1000000358 335 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (335 aa) 58394263 METHYLASE AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 AgaP_AGAP011900 621 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP011900-PA, partial [Anopheles gambiae str. PEST]. 170579523 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 Bm1_17050 576 eukaryota>metazoa>nematoda Brugia malayi hypothetical protein [Brugia malayi]. 17552176 METHYLASE AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 CELE_C14B1.10 591 eukaryota>metazoa>nematoda Caenorhabditis elegans ALKB-8 [Caenorhabditis elegans]. 193643465 DnaJ DNAJ 2OG-FeII_Oxy_2+DnaJ+zf-CSL LOC100163323 382 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: similar to alkB, alkylation repair homolog 8 (E. coli) (alkbh8) [Acyrthosiphon pisum]. Fcyl1000110820 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000110820 269 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.6.1356.1 Fcyl1000110682 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000110682 280 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.6.1329.1 Fcyl1000021378 METHYLASE AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 Fcyl1000021378 947 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.6_#_646 91080367 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 LOC663807 582 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: alkylated DNA repair protein alkB homolog 8 [Tribolium castaneum]. 24658267 METHYLASE SAM-methylase+Tox-HetC 2OG-FeII_Oxy_2+Methyltransf_11 Dmel_CG17807 615 eukaryota>metazoa>hexapoda Drosophila melanogaster CG17807 [Drosophila melanogaster]. 307180204 METHYLASE TRPR-HTH+AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 EAG_13148 604 eukaryota>metazoa>hexapoda Camponotus floridanus Alkylated DNA repair protein alkB-like protein 8 [Camponotus floridanus]. 156552181 - AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 LOC100122369 589 eukaryota>metazoa>hexapoda Nasonia vitripennis PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform X1 [Nasonia vitripennis]. 110756990 METHYLASE GHH+AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 LOC411649 558 eukaryota>metazoa>hexapoda Apis mellifera PREDICTED: similar to CG17807-PA [Apis mellifera]. 307214872 METHYLASE AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 EAI_01988 558 eukaryota>metazoa>hexapoda Harpegnathos saltator Alkylated DNA repair protein alkB-like protein 8 [Harpegnathos saltator]. 115738137 - RRM+AlkB-2OGFEDO RRM_1+2OG-FeII_Oxy_2 LOC592985 424 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: alkylated DNA repair protein alkB homolog 8 [Strongylocentrotus purpuratus]. 219505941 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase+GAF RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 BRAFLDRAFT_111176 650 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_111176 [Branchiostoma floridae]. 219431450 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2 BRAFLDRAFT_215107 641 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_215107 [Branchiostoma floridae]. Lgig1000007292 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 Lgig1000007292 624 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.192.7.1 321463990 - RRM+AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 DAPPUDRAFT_306906 574 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_306906 [Daphnia pulex]. Caps1000010719 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 Caps1000010719 611 eukaryota>metazoa>annelida Capitella spI estExt_Genewise1.C_3640053 291238544 SP RRM+AlkB-2OGFEDO+SAM-methylase SP+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 LOC100369536 742 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: hypothetical protein [Saccoglossus kowalevskii]. 327269144 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 alkbh8 666 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: alkylated DNA repair protein alkB homolog 8 [Anolis carolinensis]. 114640181 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 ALKBH8 664 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: alkylated DNA repair protein alkB homolog 8 isoform X2 [Pan troglodytes]. 61675696 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase+GAF DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 Alkbh8 664 eukaryota>metazoa>chordata>vertebrata Mus musculus alkylated DNA repair protein alkB homolog 8 [Mus musculus]. 109478839 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 RGD1304687_predicted 671 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: similar to CG17807-PA [Rattus norvegicus]. 224043547 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 LOC100232062 847 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: similar to Alkylated DNA repair protein alkB homolog 8 [Taeniopygia guttata]. 118085116 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase DUF1891+RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 LOC418972 679 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: hypothetical protein [Gallus gallus]. 326914414 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 LOC100541852 846 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: alkylated DNA repair protein alkB homolog 8-like [Meleagris gallopavo]. Hrob1000005052 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_1+2OG-FeII_Oxy_2+Methyltransf_11 Hrob1000005052 559 eukaryota>metazoa>annelida Helobdella robusta 69129 569439499 TM AlkB-2OGFEDO 2OG-FeII_Oxy_2+TM RFI_00879 195 eukaryota Reticulomyxa filosa hypothetical protein RFI_00879 [Reticulomyxa filosa]. 156212272 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase RRM_6+2OG-FeII_Oxy_2+Methyltransf_11 NEMVEDRAFT_v1g235894 648 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. Psoj1000018446 METHYLASE AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 Psoj1000018446 430 eukaryota>stramenopiles Phytophthora sojae 125662 301121774 - AlkB-2OGFEDO+SAM-methylase+S2P 2OG-FeII_Oxy_2+Methyltransf_11 PITG_02033 640 eukaryota>stramenopiles Phytophthora infestans T30-4 alkylated DNA repair protein alkB 8 [Phytophthora infestans T30-4]. Pram1000006453 METHYLASE AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 Pram1000006453 643 eukaryota>stramenopiles Phytophthora ramorum 77246 196005257 METHYLASE RRM+AlkB-2OGFEDO+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 TRIADDRAFT_26003 653 eukaryota>metazoa>placozoa Trichoplax adhaerens hypothetical protein TRIADDRAFT_26003 [Trichoplax adhaerens]. #; 735850808 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SAMD00019534_100830 200 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_100830 [Acytostelium subglobosum LB1]. 281212158 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PPL_00108 168 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 2-oxoglutarate and Fe(II)-dependent oxygenase family protein [Polysphondylium pallidum PN500]. #; ALKBH5 mRNA methylase, crown-SAR 313240619 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 GSOID_T00020806001 462 eukaryota>metazoa>chordata Oikopleura dioica unnamed protein product [Oikopleura dioica]. 115965125 - - 2OG-FeII_Oxy_2 LOC579335 266 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: similar to MGC79570 protein [Strongylocentrotus purpuratus]. Adig1000002503 - - 2OG-FeII_Oxy_2 Adig1000002503 322 eukaryota>cnidaria Acropora digitifera adi_v1.20302 156219200 - - 2OG-FeII_Oxy_2 NEMVEDRAFT_v1g108628 256 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. 291224533 - - 2OG-FeII_Oxy_2 LOC100378951 362 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: RNA demethylase ALKBH5-like [Saccoglossus kowalevskii]. Smar1000006760 - - 2OG-FeII_Oxy_2 Smar1000006760 334 eukaryota>metazoa>arthropoda>myriapoda Strigamia maritima SMAR013901-PA pep:novel scaffold:Smar1:JH431789:635068:636630:1 gene:SMAR013901 transcript:SMAR013901-RA 210101960 - - 2OG-FeII_Oxy_2 BRAFLDRAFT_126925 314 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_126925 [Branchiostoma floridae]. Lgig1000000949 - - 2OG-FeII_Oxy_2 Lgig1000000949 256 eukaryota>metazoa>mollusca Lottia gigantea gw1.8.349.1 198418993 - MIP-T3 2OG-FeII_Oxy_2+MIP-T3 LOC100176098 308 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: RNA demethylase ALKBH5, partial [Ciona intestinalis]. 221119302 - - 2OG-FeII_Oxy_2 LOC100212340 329 eukaryota>metazoa>cnidaria Hydra vulgaris PREDICTED: probable alpha-ketoglutarate-dependent dioxygenase ABH5-like [Hydra vulgaris]. Caps1000015030 - - 2OG-FeII_Oxy_2 Caps1000015030 231 eukaryota>metazoa>annelida Capitella spI gw1.256.16.1 116517268 - - 2OG-FeII_Oxy_2 alkbh5 352 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio RNA demethylase ALKBH5 [Danio rerio]. 47218956 - - 2OG-FeII_Oxy_2 GSTEN:00015751:G:001 355 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 327287260 - - 2OG-FeII_Oxy_2 alkbh5 379 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: RNA demethylase ALKBH5 [Anolis carolinensis]. 118097886 - - 2OG-FeII_Oxy_2 ALKBH5 374 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: RNA demethylase ALKBH5 isoform X1 [Gallus gallus]. 224070277 - - 2OG-FeII_Oxy_2 ALKBH5 383 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: RNA demethylase ALKBH5 [Taeniopygia guttata]. 148539642 - - 2OG-FeII_Oxy_2 ALKBH5 394 eukaryota>metazoa>chordata>vertebrata Homo sapiens RNA demethylase ALKBH5 [Homo sapiens]. 114668860 SP - SP+2OG-FeII_Oxy_2 LOC743473 378 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: hypothetical protein LOC743473 [Pan troglodytes]. 109490878 - - 2OG-FeII_Oxy_2 Alkbh5 395 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: alkB, alkylation repair homolog 5-like [Rattus norvegicus]. 31044423 - - 2OG-FeII_Oxy_2 Alkbh5 395 eukaryota>metazoa>chordata>vertebrata Mus musculus RNA demethylase ALKBH5 [Mus musculus]. 403336135 SP ** ATHOOK+AlkB SP OXYTRI_12242 943 eukaryota>alveolata>ciliophora Oxytricha trifallax hypothetical protein OXYTRI_12242 (macronuclear) [Oxytricha trifallax]. 116061067 SP ** PHD SP Ot13g01270 544 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 303274614 - ** C1+PHD 2OG-FeII_Oxy_2 MICPUCDRAFT_38490 897 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 300262451 SP POTRA+POTRA+Asp-B-Hydro+UBC SP+MSP1_C+SDA1+SDA1+PAT1+PAT1 VOLCADRAFT_92841 2654 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_92841 [Volvox carteri f. nagariensis]. 403365523 - CDC27 - OXYTRI_19840 1315 eukaryota>alveolata>ciliophora Oxytricha trifallax hypothetical protein OXYTRI_19840 (macronuclear) [Oxytricha trifallax]. 118352562 - SFII-RAD3+Nimm67+Classical-AAA TFIIA+SMC_N+2OG-FeII_Oxy_2 TTHERM_00371220 1999 eukaryota>alveolata>ciliophora Tetrahymena thermophila hypothetical protein TTHERM_00371220 (macronuclear) [Tetrahymena thermophila]. 145483981 SP+Tox-ABHYDROLASE3 SFII-RAD3+Metallopeptidase SP+Lipase_2 GSPATT00005366001 1283 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145520431 SP 35exo+NTox3 SP GSPATT00001715001 941 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 307105467 - - CHLNCDRAFT_136563 521 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_136563 [Chlorella variabilis]. Sarc1000012919 - ZNR FTO_NTD Sarc1000012919 292 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (292 aa) Sarc1000002122 - PLUS3 - Sarc1000002122 178 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (178 aa) 30690892 - AlkB-2OGFEDO 2OG-FeII_Oxy AT2G48080 438 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 15236223 Tox-MCF Asp-B-Hydro 2OG-FeII_Oxy AT4G02940 569 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 302803799 - - 2OG-FeII_Oxy_2+Totivirus_coat SELMODRAFT_422939 556 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_422939 [Selaginella moellendorffii]. 186489643 - - 2OG-FeII_Oxy_2 AT1G48980 325 eukaryota>viridiplantae Arabidopsis thaliana 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana]. 79319564 - TET-JBP 2OG-FeII_Oxy_2 AT1G48980 327 eukaryota>viridiplantae Arabidopsis thaliana 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana]. 79361742 - TET-JBP 2OG-FeII_Oxy_2 AT1G48980 331 eukaryota>viridiplantae Arabidopsis thaliana 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana]. 79326344 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AT4G36090 520 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 302775126 - AlkB-2OGFEDO DUF2052+2OG-FeII_Oxy_2 SELMODRAFT_94921 307 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_94921 [Selaginella moellendorffii]. 302757365 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_64626 289 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_64626, partial [Selaginella moellendorffii]. 15227938 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AT2G17970 507 eukaryota>viridiplantae Arabidopsis thaliana 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase-like protein [Arabidopsis thaliana]. 116061183 - AlkB-2OGFEDO+EP1+SAM-methylase 2OG-FeII_Oxy_2+Methyltransf_11 OT_ostta13g02300 597 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri Alpha-ketoglutarate-dependent dioxygenase AlkB-like [Ostreococcus tauri]. 281211827 - - 2OG-FeII_Oxy_2 PPL_01222 280 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 hypothetical protein PPL_01222 [Polysphondylium pallidum PN500]. 323447352 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AURANDRAFT_68154 461 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_68154 [Aureococcus anophagefferens]. 485613446 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 EMIHUDRAFT_120329 373 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_120329, partial [Emiliania huxleyi CCMP1516]. #; FTO, note fusion to ASCH in stramenopiles and kinase in Chlamy 291236823 - - FTO_NTD+FTO_CTD LOC100366525 455 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO-like, partial [Saccoglossus kowalevskii]. 326927229 - - FTO_NTD+FTO_CTD LOC100546308 509 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO-like [Meleagris gallopavo]. 50753676 - - FTO_NTD+FTO_CTD LOC415718 120 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: hypothetical protein [Gallus gallus]. 224064302 - - FTO_NTD+FTO_CTD LOC100223734 509 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: fat mass and obesity associated [Taeniopygia guttata]. 122937263 - - FTO_NTD+FTO_CTD FTO 505 eukaryota>metazoa>chordata>vertebrata Homo sapiens alpha-ketoglutarate-dependent dioxygenase FTO [Homo sapiens]. 114662524 - - FTO_NTD+FTO_CTD FTO 505 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO isoform X3 [Pan troglodytes]. 89337260 - - FTO_NTD+FTO_CTD Fto 502 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus alpha-ketoglutarate-dependent dioxygenase FTO [Rattus norvegicus]. 6753916 - - FTO_NTD+FTO_CTD Fto 502 eukaryota>metazoa>chordata>vertebrata Mus musculus similar to FTO [Mus musculus]. 327276419 - - FTO_NTD+FTO_CTD LOC100556980 489 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: alpha-ketoglutarate-dependent dioxygenase FTO-like [Anolis carolinensis]. 189521378 SP - SP+FTO_NTD+FTO_CTD+FTO_CTD fto 556 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: fto protein [Danio rerio]. 47216623 TM - FTO_NTD+FTO_CTD+TM GSTEN:00025432:G:001 488 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. Fcyl1000025701 SP - SP+FTO_NTD+FTO_CTD Fcyl1000025701 545 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.11.582.1 320169135 - - FTO_NTD+FTO_NTD+FTO_CTD CAOG_04002 638 eukaryota Capsaspora owczarzaki ATCC 30864 hypothetical protein CAOG_04002 [Capsaspora owczarzaki ATCC 30864]. Sarc1000012918 - - FTO_NTD Sarc1000012918 212 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (212 aa) 298709920 SP SPX SP+FTO_NTD+FTO_CTD Esi_0270_0028 715 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. Ttra1000008560 - - SRP-alpha_N+FTO_NTD+FTO_CTD Ttra1000008560 561 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 FATSO protein (561 aa) 220977073 - - FTO_NTD+FTO_CTD THAPSDRAFT_261481 613 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 hypothetical protein THAPSDRAFT_261481, partial [Thalassiosira pseudonana CCMP1335]. 298709934 ASCH ASCH FTO_NTD+FTO_CTD Esi_0272_0024 1065 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. Fcyl1000039992 ASCH ASCH FTO_NTD+FTO_CTD+ASCH Fcyl1000039992 886 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.102_#_45 Fcyl1000029925 ASCH ASCH FTO_NTD+FTO_CTD+ASCH Fcyl1000029925 870 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.21_#_237 116060758 TM+TM+TM TFIIE-HTH TM+TM+TM+FTO_NTD+FTO_CTD Ot12g01460 689 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 226457226 - - FTO_NTD+FTO_CTD MICPUCDRAFT_60556 541 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 255078368 - - FTO_NTD+FTO_CTD MICPUN_112682 520 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 FATSO protein [Micromonas sp. RCC299]. 220977539 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 THAPSDRAFT_20825 573 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 219113643 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 PHATR_44026 384 eukaryota>stramenopiles Phaeodactylum tricornutum CCAP 1055/1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]. Fcyl1000122509 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000122509 409 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.31.61.1 Fcyl1000129851 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Fcyl1000129851 477 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.113.10.1 Fcyl1000032662 Tox-ABHYDROLASE3 Tox-ABHYDROLASE3+AlkB-2OGFEDO Lipase_3+2OG-FeII_Oxy_2 Fcyl1000032662 827 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.31_#_127 Fcyl1000040299 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Fcyl1000040299 406 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.113_#_6 299473601 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Esi_0081_0103 484 eukaryota>stramenopiles Ectocarpus siliculosus conserved unknown protein [Ectocarpus siliculosus]. 303291017 - - 2OG-FeII_Oxy_2+API5 MICPUCDRAFT_67764 233 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 255082812 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2+API5 MICPUN_61562 577 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. Bnat1000005628 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Bnat1000005628 508 eukaryota>rhizaria>cercozoa Bigelowiella natans fgenesh1_pg.46_#_69 302836187 - POTRA+AlkB-2OGFEDO PAT1+DUF4175+2OG-FeII_Oxy_2 VOLCADRAFT_104423 765 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_104423 [Volvox carteri f. nagariensis]. 158275615 - STYKIN 2OG-FeII_Oxy_2+Pkinase CHLREDRAFT_175223 1503 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein [Chlamydomonas reinhardtii]. 320162612 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CAOG_00036 372 eukaryota Capsaspora owczarzaki ATCC 30864 alkylated DNA repair protein [Capsaspora owczarzaki ATCC 30864]. 497649437 490863615 502454342 521074729 521061610 499713678 522048777 771842312 657897720 515934545 752713757 53758311 766766902 #; ALKBH3, note fusion to HOMEO domain, mehtylase not DNA methylase 116054890 SP METHYLASE+AlkB-2OGFEDO SP+DUF633+2OG-FeII_Oxy_2 Ot14g02210 516 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri alkylated DNA repair protein (ISS), partial [Ostreococcus tauri]. Caps1000024925 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Caps1000024925 159 eukaryota>metazoa>annelida Capitella spI e_gw1.37312.2.1 125855293 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 LOC792266 282 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: hypothetical protein [Danio rerio]. 47214690 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 GSTEN:00019672:G:001 302 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 156218294 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NEMVEDRAFT_v1g168448 269 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein [Nematostella vectensis]. 198419633 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 LOC100187190 288 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Ciona intestinalis]. Hrob1000003787 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Hrob1000003787 293 eukaryota>metazoa>annelida Helobdella robusta 185055 Caps1000018790 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Caps1000018790 288 eukaryota>metazoa>annelida Capitella spI e_gw1.669.6.1 Lgig1000006095 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Lgig1000006095 278 eukaryota>metazoa>mollusca Lottia gigantea e_gw1.85.31.1 114637163 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH3 286 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 isoform X1 [Pan troglodytes]. 21040275 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 ALKBH3 286 eukaryota>metazoa>chordata>vertebrata Homo sapiens alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Homo sapiens]. 114637165 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC741336 145 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: similar to ALKBH3 protein isoform 1 [Pan troglodytes]. 110625726 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Alkbh3 286 eukaryota>metazoa>chordata>vertebrata Mus musculus alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Mus musculus]. 62079085 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Alkbh3 295 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus alpha-ketoglutarate-dependent dioxygenase alkB homolog 3 [Rattus norvegicus]. 118118318 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC776379 70 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: similar to ALKBH3 protein, partial [Gallus gallus]. 224051018 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100220917 338 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: similar to alkB, alkylation repair homolog 3 [Taeniopygia guttata]. 326920364 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 LOC100545576 228 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3-like [Meleagris gallopavo]. 118091513 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 ALKBH3 333 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: similar to prostate cancer antigen-1 [Gallus gallus]. 327259719 - AlkB-2OGFEDO 2OG-FeII_Oxy_2+2OG-FeII_Oxy_2 LOC100557858 220 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 3-like [Anolis carolinensis]. 210103637 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 BRAFLDRAFT_126257 287 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_126257 [Branchiostoma floridae]. 219407733 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 BRAFLDRAFT_117316 316 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_117316 [Branchiostoma floridae]. 485640232 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_111319 283 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_111319 [Emiliania huxleyi CCMP1516]. 551577156 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_207885 280 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_207885 [Emiliania huxleyi CCMP1516]. Fcyl1000026428 - HOMEO+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000026428 396 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.13_#_88 743520408 735848816 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SAMD00019534_121570 167 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_121570 [Acytostelium subglobosum LB1]. 281207196 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PPL_05363 121 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 hypothetical protein PPL_05363 [Polysphondylium pallidum PN500]. 735849527 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SAMD00019534_115570 173 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_115570 [Acytostelium subglobosum LB1]. 735994552 - Actinomycete-peptide+AlkB-2OGFEDO+STYKIN 2OG-FeII_Oxy_2+DUF605 SAMD00019534_034160 511 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_034160, partial [Acytostelium subglobosum LB1]. 281207383 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PPL_05555 506 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 putative alkylated DNA repair protein [Polysphondylium pallidum PN500]. 569381717 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 RFI_21980 275 eukaryota Reticulomyxa filosa alkylated DNA repair protein [Reticulomyxa filosa]. 284094864 Nimm55 AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_63689 292 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. #; AlkBH3- subgroup Likely to bind DNA, note SAD and PHD in one instance and ATHOOK in another. 568037062 299115604 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Esi_0181_0046 349 eukaryota>stramenopiles Ectocarpus siliculosus 2OG-Fe(II) oxygenase [Ectocarpus siliculosus]. Bnat1000014488 - AlkB-2OGFEDO+TET-JBP 2OG-FeII_Oxy_2 Bnat1000014488 166 eukaryota>rhizaria>cercozoa Bigelowiella natans gw1.3.140.1 217403598 SP AlkB-2OGFEDO+SAD SP+2OG-FeII_Oxy_2+YDG_SRA PHATRDRAFT_49981 544 eukaryota>stramenopiles Phaeodactylum tricornutum CCAP 1055/1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]. 323453703 PHD AlkB-2OGFEDO+SAD+PHD+AN1+PHD+FUNDEAMN+EP1+SH3 2OG-FeII_Oxy_2+YDG_SRA+PAT1+zf-HC5HC2H+MAP65_ASE1+Rib_recp_KP_reg AURANDRAFT_63241 2643 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_63241 [Aureococcus anophagefferens]. Spun1000006275 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000006275 232 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (232 aa) 313239117 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 GSOID_T00007062001 262 eukaryota>metazoa>chordata Oikopleura dioica unnamed protein product [Oikopleura dioica]. 485619453 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_210584 259 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_210584 [Emiliania huxleyi CCMP1516]. 220973386 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 THAPSDRAFT_6486 318 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein [Thalassiosira pseudonana CCMP1335]. 239886228 - ATHOOK+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pmar_PMAR018546 305 eukaryota>alveolata Perkinsus marinus ATCC 50983 conserved hypothetical protein [Perkinsus marinus ATCC 50983]. 239895917 - AlkB-2OGFEDO AF-4+2OG-FeII_Oxy_2 Pmar_PMAR006990 477 eukaryota>alveolata Perkinsus marinus ATCC 50983 conserved hypothetical protein [Perkinsus marinus ATCC 50983]. Wseb1000002244 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Wseb1000002244 200 eukaryota>fungi>basidiomycota Wallemia sebi estExt_Genemark1.C_70130 71003245 - SWC3+AlkB-2OGFEDO 2OG-FeII_Oxy_2 UM00156.1 421 eukaryota>fungi>basidiomycota Ustilago maydis 521 hypothetical protein UM00156.1 [Ustilago maydis 521]. 18399917 SP AlkB-2OGFEDO SP+DUF4057+2OG-FeII_Oxy_2 AT2G22260 314 eukaryota>viridiplantae Arabidopsis thaliana DNA repair protein ALKBH2 [Arabidopsis thaliana]. 307102474 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CHLNCDRAFT_13108 199 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_13108, partial [Chlorella variabilis]. Spun1000007268 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Spun1000007268 281 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (281 aa) 70996955 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AFUA_5G14250 317 eukaryota>fungi>ascomycota Aspergillus fumigatus Af293 DNA repair family protein [Aspergillus fumigatus Af293]. 67901590 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AN7782.2 335 eukaryota>fungi>ascomycota Aspergillus nidulans FGSC A4 hypothetical protein AN7782.2 [Aspergillus nidulans FGSC A4]. Sarc1000007962 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Sarc1000007962 360 eukaryota>ichthyosporea Sphaeroforma arctica Sphaeroforma arctica JP610 hypothetical protein (360 aa) 46108746 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 FG01255.1 326 eukaryota>fungi>ascomycota Fusarium graminearum PH-1 hypothetical protein FG01255.1 [Fusarium graminearum PH-1]. 160705040 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SNOG_15119 325 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_15119 [Phaeosphaeria nodorum SN15]. Chet1000005718 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Chet1000005718 317 eukaryota>fungi>ascomycota Cochliobolus heterostrophus estExt_fgenesh1_pg.C_410002 162673787 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PHYPADRAFT_141856 320 eukaryota>viridiplantae Physcomitrella patens predicted protein [Physcomitrella patens]. Ttra1000002481 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Ttra1000002481 292 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (292 aa) #; AlkBH3-related Interesting fusion to CUE and TOPC- likely to bind DNA Lhya1000011955 - CUE+AlkB-2OGFEDO+TOPC 2OG-FeII_Oxy_2+zf-GRF Lhya1000011955 397 eukaryota>fungi>mucoromycotina Lichtheimia hyalospora e_gw1.1115.1.1 50423311 - CUE+AlkB-2OGFEDO+TOPC 2OG-FeII_Oxy_2 DEHA0E22759g 404 eukaryota>fungi>ascomycota Debaryomyces hansenii CBS767 hypothetical protein DEHA0E22759g [Debaryomyces hansenii CBS767]. 45199260 - CUE+AlkB-2OGFEDO 2OG-FeII_Oxy_2 AGOS_AFR741W 407 eukaryota>fungi>ascomycota Ashbya gossypii ATCC 10895 AFR741Wp [Ashbya gossypii ATCC 10895]. 50550127 - CUE+AlkB-2OGFEDO+TOPC 2OG-FeII_Oxy_2 YALI0D07546g 372 eukaryota>fungi>ascomycota Yarrowia lipolytica CLIB122 YALI0D07546p [Yarrowia lipolytica]. 116505443 TM AlkB-2OGFEDO+JAB 2OG-FeII_Oxy_2+TM+Peptidase_M13_N+Peptidase_M13 CC1G_05104 1263 eukaryota>fungi>basidiomycota Coprinopsis cinerea okayama7#130 hypothetical protein CC1G_05104 [Coprinopsis cinerea okayama7#130]. 527305314 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 FOMPIDRAFT_41770 430 eukaryota>fungi>basidiomycota Fomitopsis pinicola FP-58527 SS1 hypothetical protein FOMPIDRAFT_41770 [Fomitopsis pinicola FP-58527 SS1]. Abis1000001814 - AlkB-2OGFEDO+TOPC 2OG-FeII_Oxy_2 Abis1000001814 379 eukaryota>fungi>basidiomycota Agaricus bisporus estExt_Genewise1Plus.C_21078 164647078 SP AlkB-2OGFEDO+TOPC SP+2OG-FeII_Oxy_2 LACBIDRAFT_232887 357 eukaryota>fungi>basidiomycota Laccaria bicolor S238N-H82 predicted protein [Laccaria bicolor S238N-H82]. Mver1000002291 - CUE+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Mver1000002291 480 eukaryota>fungi>zygomycete Mortierella verticillata Mortierella verticillata NRRL 6337 hypothetical protein (480 aa) Chet1000002884 - UBA+AlkB-2OGFEDO+TOPC 2OG-FeII_Oxy_2+zf-GRF Chet1000002884 453 eukaryota>fungi>ascomycota Cochliobolus heterostrophus estExt_Genewise1Plus.C_320154 169625210 - CUE+AlkB-2OGFEDO 2OG-FeII_Oxy_2 SNOG_15872 420 eukaryota>fungi>ascomycota Phaeosphaeria nodorum SN15 hypothetical protein SNOG_15872 [Phaeosphaeria nodorum SN15]. 67517207 - CUE+RelE-ParE+AlkB-2OGFEDO+TOPC CUE+2OG-FeII_Oxy_2+zf-GRF AN0881.2 448 eukaryota>fungi>ascomycota Aspergillus nidulans FGSC A4 hypothetical protein AN0881.2 [Aspergillus nidulans FGSC A4]. 70996308 - CUE+AlkB-2OGFEDO+TOPC CUE+2OG-FeII_Oxy_2+zf-GRF AFUA_1G15410 493 eukaryota>fungi>ascomycota Aspergillus fumigatus Af293 CUE domain protein [Aspergillus fumigatus Af293]. 46121463 - AlkB-2OGFEDO+TOPC 2OG-FeII_Oxy_2+zf-GRF FG05110.1 357 eukaryota>fungi>ascomycota Fusarium graminearum PH-1 hypothetical protein FG05110.1 [Fusarium graminearum PH-1]. 85107094 - CUE+AlkB-2OGFEDO+TOPC CUE+2OG-FeII_Oxy_2+zf-GRF NCU07663.1 584 eukaryota>fungi>ascomycota Neurospora crassa OR74A hypothetical protein [Neurospora crassa OR74A]. 116196186 SP CUE+AlkB-2OGFEDO+TOPC SP+2OG-FeII_Oxy_2+zf-GRF CHGG_04691 473 eukaryota>fungi>ascomycota Chaetomium globosum CBS 148.51 hypothetical protein CHGG_04691 [Chaetomium globosum CBS 148.51]. Spun1000003263 - CUE+AlkB-2OGFEDO+TOPC 2OG-FeII_Oxy_2+zf-GRF Spun1000003263 427 eukaryota>fungi>chytridiomycota Spizellomyces punctatus Spizellomyces punctatus DAOM BR117 hypothetical protein (427 aa) 255085460 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2+zf-GRF MICPUN_62379 453 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 485619574 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_210764 448 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_210764 [Emiliania huxleyi CCMP1516]. 551544968 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_249216 366 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_249216 [Emiliania huxleyi CCMP1516]. #; THAPSDRAFT_42543-like RNA modifying 220970302 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 THAPSDRAFT_42543 222 eukaryota>stramenopiles Thalassiosira pseudonana CCMP1335 predicted protein, partial [Thalassiosira pseudonana CCMP1335]. Fcyl1000019629 METHYLASE AlkB-2OGFEDO+THUMP+METHYLASE+DUF2431 2OG-FeII_Oxy_2+UPF0020 Fcyl1000019629 822 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.4_#_1043 Fcyl1000039366 METHYLASE AlkB-2OGFEDO+THUMP+METHYLASE+DUF2431 2OG-FeII_Oxy_2+UPF0020 Fcyl1000039366 813 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.88.28.1 Fcyl1000078853 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000078853 234 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.4.648.1 551550668 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_215572 548 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_215572 [Emiliania huxleyi CCMP1516]. 551618356 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_225460 465 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_225460 [Emiliania huxleyi CCMP1516]. 551576966 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 EMIHUDRAFT_207723 307 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_207723 [Emiliania huxleyi CCMP1516]. #; Naegleria LSE 284090982 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_79711 271 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 284094920 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_63194 236 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 284086042 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 NAEGRDRAFT_52210 251 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. #; Note fusion to Little finger and HECT Psoj1000002243 SP LittleFinger+HECT+AlkB-2OGFEDO SP+HECT+2OG-FeII_Oxy_2 Psoj1000002243 1001 eukaryota>stramenopiles Phytophthora sojae 128295 Pram1000006143 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pram1000006143 292 eukaryota>stramenopiles Phytophthora ramorum 77618 Fcyl1000015510 POTRA CYSTM+AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000015510 379 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.1_#_1574 Fcyl1000029493 SP+POTRA+POTRA AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 Fcyl1000029493 394 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.20_#_150 262105909 PG_binding_1 AlkB-2OGFEDO 2OG-FeII_Oxy_2 PITG_02472 261 eukaryota>stramenopiles Phytophthora infestans T30-4 conserved hypothetical protein [Phytophthora infestans T30-4]. #; Ttra1000010051 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2+TFIIA+IMCp Ttra1000010051 935 eukaryota>apusozoa Thecamonas trahens Thecamonas trahens ATCC 50062 hypothetical protein (935 aa) #; 303276360 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 MICPUCDRAFT_56679 318 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 226517324 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 MICPUN_58441 335 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. #; 118346635 - SbcC+AlkB-2OGFEDO 2OG-FeII_Oxy_2 TTHERM_00035460 403 eukaryota>alveolata>ciliophora Tetrahymena thermophila hypothetical protein TTHERM_00035460 (macronuclear) [Tetrahymena thermophila]. 145491776 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 GSPATT00033965001 312 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. 145488027 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 GSPATT00032472001 304 eukaryota>alveolata>ciliophora Paramecium tetraurelia strain d4-2 hypothetical protein (macronuclear) [Paramecium tetraurelia strain d4-2]. #; Pram1000006748 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Pram1000006748 308 eukaryota>stramenopiles Phytophthora ramorum 76882 Psoj1000014320 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Psoj1000014320 310 eukaryota>stramenopiles Phytophthora sojae 142118 301093209 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PITG_19142 309 eukaryota>stramenopiles Phytophthora infestans T30-4 alkylated DNA repair protein alkB-like protein [Phytophthora infestans T30-4]. 209582552 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PHATR_46782 352 eukaryota>stramenopiles Phaeodactylum tricornutum CCAP 1055/1 predicted protein [Phaeodactylum tricornutum CCAP 1055/1]. Fcyl1000110306 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 Fcyl1000110306 534 eukaryota>stramenopiles Fragilariopsis cylindrus gw1.5.156.1 Fcyl1000020581 - CheRN-Alpha+AlkB-2OGFEDO Pap_E4+2OG-FeII_Oxy_2 Fcyl1000020581 588 eukaryota>stramenopiles Fragilariopsis cylindrus fgenesh2_pg.5_#_951 551549245 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 EMIHUDRAFT_358442 316 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_358442 [Emiliania huxleyi CCMP1516]. 551567129 SP - SP+2OG-FeII_Oxy_2 EMIHUDRAFT_355877 280 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_355877, partial [Emiliania huxleyi CCMP1516]. 284095290 SP AlkB-2OGFEDO SP+2OG-FeII_Oxy_2 NAEGRDRAFT_30537 314 eukaryota>heterolobosea Naegleria gruberi predicted protein [Naegleria gruberi]. 735859727 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SAMD00019534_002080 394 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_002080 [Acytostelium subglobosum LB1]. 66808825 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 DDB_G0285575 393 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium discoideum AX4 alkylated DNA repair protein [Dictyostelium discoideum AX4]. 325082087 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 DICPUDRAFT_55106 328 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium purpureum hypothetical protein DICPUDRAFT_55106 [Dictyostelium purpureum]. 281211828 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PPL_01223 116 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 alkylated DNA repair protein [Polysphondylium pallidum PN500]. 281209952 POTRA+TM+TM+TM+TM POTRA+POTRA+AlkB-2OGFEDO+sGTP 2OG-FeII_Oxy_2+Ras+TM+TM+TM+TM PPL_03193 780 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 alkylated DNA repair protein [Polysphondylium pallidum PN500]. 307109513 - SHELIX+AlkB-2OGFEDO 2OG-FeII_Oxy_2 CHLNCDRAFT_143028 800 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_143028 [Chlorella variabilis]. 239900379 - - 2OG-FeII_Oxy_2 Pmar_PMAR019901 332 eukaryota>alveolata Perkinsus marinus ATCC 50983 conserved hypothetical protein [Perkinsus marinus ATCC 50983]. 221487023 - AlkB-2OGFEDO Macoilin+2OG-FeII_Oxy_2 TGGT1_010770 927 eukaryota>alveolata>apicomplexa Toxoplasma gondii GT1 conserved hypothetical protein [Toxoplasma gondii GT1]. 302829380 - SWC3+AlkB-2OGFEDO 2OG-FeII_Oxy_2 VOLCADRAFT_115825 365 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_115825 [Volvox carteri f. nagariensis]. 159479846 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 CHLREDRAFT_151320 398 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii hypothetical protein CHLREDRAFT_151320, partial [Chlamydomonas reinhardtii]. 116000537 - AlkB-2OGFEDO 2OG-FeII_Oxy_2+Aha1_N+AHSA1 Ot01g03050 836 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri DNA alkylation damage repair protein (ISS) [Ostreococcus tauri]. 255081849 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 MICPUN_74198 126 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein, partial [Micromonas sp. RCC299]. 303285390 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 MICPUCDRAFT_35646 144 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein, partial [Micromonas pusilla CCMP1545]. 323447764 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AURANDRAFT_8404 133 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_8404, partial [Aureococcus anophagefferens]. 162670995 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 PHYPADRAFT_169907 401 eukaryota>viridiplantae Physcomitrella patens predicted protein [Physcomitrella patens]. 302801802 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_116512 342 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_116512 [Selaginella moellendorffii]. 302798841 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 SELMODRAFT_113969 342 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_113969 [Selaginella moellendorffii]. 15221095 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 AT1G11780 345 eukaryota>viridiplantae Arabidopsis thaliana oxidoreductase, 2OG-Fe(II) oxygenase family protein [Arabidopsis thaliana]. 156082850 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 BBOV_I002590 336 eukaryota>alveolata>apicomplexa Babesia bovis T2Bo hypothetical protein [Babesia bovis T2Bo]. 84999908 - AlkB-2OGFEDO+CytC 2OG-FeII_Oxy_2 TA19740 350 eukaryota>alveolata>apicomplexa Theileria annulata strain Ankara alkylated DNA repair protein [Theileria annulata]. 71031831 - AlkB-2OGFEDO 2OG-FeII_Oxy_2 TP01_0030 259 eukaryota>alveolata>apicomplexa Theileria parva strain Muguga hypothetical protein [Theileria parva strain Muguga].Back to Contents
Str (-3) Str (-2) Str (-1) Str-1 Str-2 Str-3 Str-4 Str-5 Str-6 Str-7 FINAL -----HHHH-HHH---------------------------------------EEEE-EE-----------------------------------------------------------------------------------------------------------------------------EEEEEEE--HHHHHHHHHHH----------------------------------------------------------------------------E----------------------------------HHHHH-H-HH-----------------------------------------------------------------------------HHHH--------------------------EE-------------EEE--------------------EEE---EEEEEEE---EEEEEE----------------------------------------------------------------------------------------------------------------------------------EEEE-------EE----EEEEE---HHHHHHHHHHHH----------------------------------EEEEEEE-----------------------------------------HHHHHH------HHHHHHHHH------ ALIGN ------------------------------------------------------HH-H--------HHHH---------------------E--------------------------------------------------------------------------------------------EEEEE----HHHHHHHHHHH-------------------------------------------------------------------------------HHH-----------------------------HHHHH-H-HH-----------------------------------------------------------------------------HH--------------HHH---------H-HH-------------HHHHH---H--------------------EEEEEE-----EEEEE----------------------------------------------------------------------------------------------------------------------------------EEEE-------EE-----EEEE----HEEEHEHHH------------------------------------EEEEHHHHH---------------------------------------HH-HHH------HHHHHHHH------- HMM -----HEEH-H------------------------------------------EEE-EE----------EE--H-H-H--HHHHHHHH-HHHH--HHH--------H----------------------------------------------------------------------------EEEEEEEEE-HHHHHHHHHHHH---------------------------------------------E--EEE----------EEEE------EEEEHHHHHEE----------EE---------------HHHHHH-H-HH-----------------------------------------------------------------------------HHHH-----------HHHHH-------EE-EE-------------EEEE-----H------EE--EEEEEE--HEEEEEEE--EEEEEEEE---------------------------------------------------------------------------------------------------------------------------------EEEE-------EE----EEEEE---HHHHHHHHHHHH----------------------------------EEEEEEE----HH----------------------H-HH----H---HHHHHHH------HHHHHHHHHH----- FREQ ------EEE-EEE--------------------------------------EEEEE-E------------------------------------------------------------------------------------------------------------------------------EEEEEE---HHHHHHHHHH--------------------------------------------------------------------------------HHHH----------HHH----------------HHHH-H-HH-----------------------------------------------------------------------------HH-----------------------------E-------------E---------------------HHHHHHHHHHEE-----EEEEE-----------------------------------------------------------------------------------------------------------------------------------EEEE-------E------EEEE---HHHHHHHHHHHHH----------------H-------------HHHHEEHHHHH------------------------------------------HHHH------HHHHHHHHHH----- PSSM ----HHHHH-HHH---------------------------------------HEEE-E------------------------------------------------------------------------------------------------------------------------------EEEE-----HHHHHHHHHHH----------------------------------------------------------------------------------------------------------------HHHH-H-HH-----------------------------------------------------------------------------HHHH------------------------------------------E----------------------------EEEEE-----EEEE------------------------------------------------------------------------------------------------------------------------------------EEE-------EE-----EEEE------EEE-----------------------------------------EEEEE------------------------------------------HHHHHH------HHHHHHH-------- CELE_F09F7.7_Caenorhabditis_elegans_25148697 CGCKGARFC-ALC-ET-TE-------RVKK-------LRVVE---DKHVNYKVFIY-DH------IRQIAIPTTNL-N--SQSSLEDI-IDES--TSC-------QSV---------------------------------------------S----TDGS---I-----------E-I--DGLTLIHNFLSESEESKILNMID-------------------------------T--------V--EWA--QSQSG--------RRKQ------DYGPKVNFKHK----------KVK-TD--T--FV--GMPEYADM-L-LN-----------------------------------------------------------------------------KMSE-----------YDVKKLG--NY-QP-FE-------------MCNLE---YEEVKKSAIEMHQDDMWIWGNRLISINLINGSVMTLSNDNK----------------------------------------------------S-------------------------------------------------------------------------FLCY-------VHMPHRSLLCMADECRYDWKHGVLAHH----------------I-------------RGRRIALTMREA--AK----------------------D-FA----E---GGELYEK------YGAELIRLGNIRVPL Dmel_CG4036_Drosophila_melanogaster_24583140 CGCKGVRTC-LSC-EQ-DF-------HIAK-------TSLRE---QFQ-------------------------------------QLEAWSYC--IQC-------DLL-----Q-R-GWDT--NHVQKDHE--------------------NHK----KDEG---L-----------P-L--PGILVQEEFLSVDEGAQLIADLD-------------------------------D--------L--PWD--ISQSG--------RRKQ------NFGPKTNFKKR----------KLR-LG--S--FA--GFPRTTEY-V-QR-----------------------------------------------------------------------------RFED-VP------L-LR-------GF-QT-IE-------------QCSLE---YEPSKGASIDPHVDDCWIWGERVVTVNCLGDSVLTLTPY-EVQQSGKYNLD------LVASYEDELLA-PLLT------------DDQLATFEG-------------------------------------------------------------------------KVLR-------IPMPNLSLIVLYGPARYQFEHSVLRED----------------V-------------QERRVCVAYREF--TP----------------------M-YI----NGV-DIQKGDP--VRE-KSQIFW-----QIN- LOC100367945_Saccoglossus_kowalevskii_585709738 CGCKGIRTC-LVCEKS-KI-------DLSC-------RSGRF---EKP-------------------------------------DAS-YSFC--WLC-------NIA-----W-L-ESNT--EQHPSHQG--------------------KF------------I-----------R-F--PGVTLIENVVSEEEEEAIIQAVD-------------------------------A--------T--PWK--VSQSG--------RRKQ------DYGPKVNFKKR----------KVN-SK--C--FS--GLPAFIRP-L-TE-----------------------------------------------------------------------------RLVQ-MD------G-LA-------DF-QV-VE-------------QCNLE---YVPDRGSSIDPHFDDVWLWGERLVTLNLNSETTLTMTQK-E---------------------------------------------------KD-------------------------------------------------------------------------ICVS-------IPLPKRSVIVLYGPARYEWMHAIHRED----------------I-------------INRRIAVTFREL--SA----------------------E-FL----DGGVNEDTGSR--LLE-IARTYE-----GKAV NEMVEDRAFT_v1g34785_Nematostella_vectensis_156402493 CGCTGIRSC-LFC-KD-NT-------KTSQ-------STTSV---DEA-------------------------------------KTK-YLFC--HLC-------SQT-----LPL-GGTC--SHELGDCG--------------------RYG-----------P-----------S-L--DGITLIEDFVSQREEARIVQVID-------------------------------E--------T--VWK--PSQSG--------RRKQ------DYGPQVNFKKK----------KVK-MS--H--FN--GLPAFSEF-L-VR-----------------------------------------------------------------------------RMNDDVP------G-LK-------DF-VP-VE-------------LCNLE---YDEARGSSIDAHFDDFWLWGERLVTLNLLSATRLTMTKD-T---------------------------------------------------YE-------------------------------------------------------------------------I--S-------VPMPRRSLIIVSGAARHLWQHAVKRED----------------I-------------SGRRIAITLREL--SE----------------------E-FC----KGGRNENVGFQ--AIK-TALTFN-----GTSV alkbh4_Danio_rerio_688557483 CGCKGIRTC-LRC-ET-DE-------TKHL-------L-QKN---DLI-------------------------------------HYD-FIYD--PV------------------L-KSAV--REEEGSTP--------------------Q-C-----------F-----------E-F--PGVLLWENFVSEDEERELVSRMD-------------------------------Q--------D--VWR--ESQSG--------RRKQTSVYPKDFGPKVNFKKR----------RVH-VG--S--FS--GLPAISRR-L-LV-----------------------------------------------------------------------------RMSD-LP------Q-LS-------SF-KP-VE-------------QCNLD---YDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLD-Q--------------------------------------------GWGDMEQGE-------------------------------------------------------------------------VRVA-------VRLPRRSLVVLYGDARHRWKHAIHRKD----------------I-------------HGRRVCSTFREL--SA----------------------E-FL----PGGQQEKLGSE--LLD-IALSFQ-----GAPL ALKBH4_Taeniopygia_guttata_823470605 CGCKGIRSC-LLC-EG-PA-------AAAP-------P-------PQG-------------------------------------EDN-FTYC--PA------------------T-GLAK--GNEHSEFA--------------------GWA-----------F-----------P-F--PGVFLVEEFISEDEECEIVELMD-------------------------------R--------D--DWK--PSQSG--------RKKQ------DYGPKVNFKKQ----------RLK-AG--S--FT--GLPSFSRK-I-VA-----------------------------------------------------------------------------QMKA-CA------V-LS-------GF-LP-VE-------------QCNLD---YSPERGSAIDPHFDDWWLWGERLVSLNLLSKTVLSMSCD-SEDTIQLFPISSK----EELSPPSPFMQ-TSACRNSGEEGTQCFLSPRLVPGKE-------------------------------------------------------------------------VSVA-------ILLPQRSLVVLQGDARYKWKHGIHRRH----------------I-------------EHRRVCITFREL--SA----------------------E-FS----AGGRHEELGKE--LLQ-IALSFQ-----GRPV LOC100520070_Sus_scrofa_311251081 CGCKGIRTC-LIC-ER-QR-------GGDP-------PWQHS---PQK-------------------------------------THR-FIYY--TD------------------T-GWAV--GAEESDFE--------------------GWA-----------F-----------P-F--PGVTLIEDFVTREEEAEMVQLMD-------------------------------R--------D--PWK--LSQSG--------RRKQ------DYGPKVNFRKQ----------KLK-TA--S--FR--GLPSFSRE-V-VR-----------------------------------------------------------------------------RMGL-YP------V-LE-------DF-RP-VE-------------QCNLD---YCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMSRE-APGSLLLCL--------APSGFPEALVE-GAVA------------PSRSVLCQE-------------------------------------------------------------------------VEVA-------VPLPRRSLLVLTGAARHQWKHAIHRRH----------------I-------------EARRVSATFREL--SA----------------------D-FG----PGGRQQDLGRE--LLQ-ISLSFQ-----GRPT alkbh4_Danio_rerio_68372246 CGCKGIRTC-LRC-ET-DE-------TKHL-------L-QKN---DLI-------------------------------------HYD-FIYD--PV------------------L-KSAV--REEEGSTP--------------------Q-C-----------F-----------E-F--PGVLLWENFVSEDEERELVSRMD-------------------------------Q--------D--VWR--ESQSG--------RRKQ------DFGPKVNFKKR----------RVH-VG--S--FS--GLPAISRR-L-LV-----------------------------------------------------------------------------RMSD-LP------Q-LS-------SF-KP-VE-------------QCNLD---YDSLRGSAIDPHLDDSWLWGENLVTVNLLSDTVLTLSLD-Q--------------------------------------------GWGDMEQGE-------------------------------------------------------------------------VRVA-------VRLPRRSLVVLYGDARHRWKHAIHRKD----------------I-------------HGRRVCSTFREL--SA----------------------E-FL----PGGQQEKLGSE--LLD-IALSFQ-----GAPL ALKBH4_Monodelphis_domestica_612002272 CGCKGVRTC-LLC-EG-ER-------DGGT-------AGTLY---PKK-------------------------------------TAH-FIYC--LE------------------T-GLAL--GTEKSGFA--------------------GWA-----------F-----------P-F--PGVAMIKDFVSADEETELVRLMD-------------------------------Q--------D--DWK--LSQSG--------RRKQ------DYGPKVNFRKQ----------KLK-TG--G--FD--GLPSFSRE-I-VH-----------------------------------------------------------------------------RMGQ-HP------V-LE-------RF-LP-VE-------------QCNLD---YHPERGSAIDPHLDDSWLWGERLVSLNLLSPTVLSMSRD-SNERLQLLSVAQQGTRNSPPNDPVPRDP-EDTG------------SRRSVPCDQ-------------------------------------------------------------------------VEVA-------IHLPARSLLVLFGAARYQWKHAIHRQH----------------I-------------ESHRICATFREL--SA----------------------E-FC----PGGKQGELGQE--LLE-IALSFQ-----GKPV ALKBH4_Homo_sapiens_8923019 CGCKGIRTC-LIC-ER-QR-------GSDP-------PWELP---PAK-------------------------------------TYR-FIYC--SD------------------T-GWAV--GTEESDFE--------------------GWA-----------F-----------P-F--PGVMLIEDFVTREEEAELVRLMD-------------------------------R--------D--PWK--LSQSG--------RRKQ------DYGPKVNFRKQ----------KLK-TE--G--FC--GLPSFSRE-V-VR-----------------------------------------------------------------------------RMGL-YP------G-LE-------GF-RP-VE-------------QCNLD---YCPERGSAIDPHLDDAWLWGERLVSLNLLSPTVLSMCRE-APGSLLLCS--------APSAAPEALVD-SVIA------------PSRSVLCQE-------------------------------------------------------------------------VEVA-------IPLPARSLLVLTGAARHQWKHAIHRRH----------------I-------------EARRVCVTFREL--SA----------------------E-FG----PGGRQQELGQE--LLR-IALSFQ-----GRPV _Schmidtea_mediterranea_386783769 CTCKGIRTC-SSC-NP-NK-------I------------KIE---NQN-------------------------------------CIV-CYFC--PK----------------I-S-KIVK--ENCISDLK--------------------C---EEFHKVN---I-----------E-L--NGIILIENFLTEDDKNYLLGGIC-------------------------------S--------N--SWV--DSQSG--------RRKQ------DFGPKVNFKKR----------KIN-LT--K--FQ--GLPEYIER-F-VN-----------------------------------------------------------------------------RFSD-IP------E-LK-------DF-NP-VE-------------LCNLE---YNPTRGASIDPHFDDFWLWGERLVTINVQSSTYLTFTPG-FPDMFD--------------ESSQAFFS-SVCS----EN------HENMGNNCA-------------------------------------------------------------------------VSIK-------VPLPEGSLVIVSGDARHKWMHAVSAAD----------------V-------------QSTRIASTLREL--SM----------------------E-FT----EN--DRELSRK--LID-LSLTFN-----GQPV MOQ_003722_Trypanosoma_cruzi_marinkellei_407409697 CCCSGIRYC-GRC-IE-SE-------RAQG-------IIHQK---FLLVKSSDVIS-RQ------YGAGRTSSCSF-T--CVD--SSH-YGYC--WQC-------NRI-----F-LMHHGA--FKSCADHE--------------------GAT----PNLD---I-----------R-I--EGLFVIPDFLSLLDEEKLVSFLD-------------------------------E-P-SS-F-S--GWK--HSQSG--------RRKQ------DFGPRANFKKR----------KLN-TS--G--MR--GMPKQLES-V-ME-----------------------------------------------------------------------------KVKS-----------FVREITSK-EY-HI-VE-------------VSALE---YTSENSSSIDPHIDDTWVWGDRVGGLNLLEDTVMTFVNN-E----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFLPRGAFFLLSQGSRYDWLHGIRLEN----------------I-------------KHRRISFTFREF--SS----------------------D-LD----I---DREIIQN--VRK-ITSTFV--------- TCSYLVIO_004974_Trypanosoma_cruzi_407849132 CCCSGIRFC-GRC-IE-SE-------RAQG-------IIHQN---VLLVKSSDVIS-QQ------YSAGRTSSCSF-T--CVE--LSH-YGLC--WQC-------NRI-----F-LMHHGA--FKSCADHE--------------------GAT----PNLD---I-----------R-I--EGLFVIPDFLSLLDEEKLVSFLD-------------------------------E-P-SS-F-S--GWK--HSQSG--------RRKQ------DFGPRANFKKR----------KLN-TS--G--IR--GMPKQLES-V-VK-----------------------------------------------------------------------------KVKS-----------FVRDITSK-EY-HI-VE-------------VSALE---YTSENSSSIDPHIDDTWVWGNRIGGLNLLEDTVMTFVNN-E----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFLPRGAFFLLSNGSRYDWLHGIRLEN----------------I-------------KHRRISFTFREF--SN----------------------D-LD----I---DREIIQN--VIK-ITSTFV--------- LPMP_352050_Leishmania_panamensis_731709183 CVCSGIRFC-AKC-RD-TL-------RVQQ-------LFSGS---VFLSSASSVIE-KQ------WHNDRLSSCSF-A--IIG--KST-LSYC--IEC-------MTI-----F-K-SEAP--IKSCVDHQ--------------------G-A----ISTS---V-----------V-I--SGLVVFQDVLTEEEETALIYYLD-------------------------------N-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPKRNFKKK----------KVR-PA--E--IP--AMPLALEP-V-CA-----------------------------------------------------------------------------TISS-----------TTENFTGR-AY-RI-AE-------------VSALE---YVEGKMSNFDPHIDDTWLWGDRIAGLNLNEPCVVTFVEP-D----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRSFFLMSGNCRYKWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DAGTSEV--VLS-AASTFV--------- LBRM_35_2190_Leishmania_braziliensis_MHOM/BR/75/M2904_154345804 CVCSGIRFC-AKC-RD-TL-------RVQQ-------LFSGS---VFLSSASSVIE-KQ------WHNDRLSSCSF-A--IIG--KST-LSYC--IEC-------MTI-----F-K-SEAP--IKSCVDHQ--------------------G-A----MSTS---I-----------V-I--SGLVVFQDVLTEEEETALIYYLD-------------------------------N-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPKRNFKKK----------KVR-PA--E--IP--AMPLALEP-V-CA-----------------------------------------------------------------------------TISS-----------TTENFTGR-AY-RI-AE-------------VSALE---YVEGKMSNFDPHIDDTWLWGDRIAGLNLNEPCVVTFVEP-D----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRSFFLMSGNCRYKWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DAETSEV--VLS-AASTFV--------- TCIL3000_10_5510_Trypanosoma_congolense_IL3000_342184304 CHCTGIRFC-GRC-VH-SD-------RAQS-------IINQR---IPLVKSSAVVS-QQ------YTGKRTSSCSF-T--CVE--PSR-RSIC--PVC-------LGI-----F-ETDGSL--IRECKDHM--------------------CLT----LCDD---I-----------I-M--DGFCLMTDFISLKLVTFFD-------------------------------D-P-AP-F-P--PWK--DSQSG--------RRKQ------DYGPKANFKKK----------KLK-LG--N--FQ--GMPQQMEE-L-LG-----------------------------------------------------------------------------RVTS-----------FVSRYTNK-EF-SV-AE-------------VSALE---YTTKC-SSLDPHVDDTWLWGDRIGGLNLLVDVVLTFVNA-S----------------------------------------------------G-------------------------------------------------------------------------IAVA-------AHIPRRSFFMLSKVCRYEWMHGIRRED----------------I-------------VGRRISVTFREL--AD----------------------K-ID----V---DEDLCRG--IKA-AASTFV--------- TVY486_1006420_Trypanosoma_vivax_Y486_340057250 CCCSGIRIC-RHC-VM-SD-------RAQS-------IINCH---VPLTKACDVVE-KQ------YSVERISSCSF-I--CIG--AST-HSFC--SNC-------GKI-----F-VFPERA--IRACADHN--------------------GLT----ADSE---T-----------T-M--RGLMVSPDFVSDIEEDYLLRFFD-------------------------------G---AH-H-S--RWK--VSQSG--------RRKQ------DYGPRANFKKR----------KLK-KG--D--GN--GMPIQLKD-I-IT-----------------------------------------------------------------------------RVNQ-----------FISNETMK-RY-QT-IE-------------VSVLE---YSTKCGSSIDTHIDDTWLWGDRIGGLNLLEDVVLTLVDS-K----------------------------------------------------G-------------------------------------------------------------------------TVAT-------VFVPRRSFFLLSGESRYNWMHGIRSED----------------I-------------KSRRISMTFREF--AD----------------------N-LE----V---DERLLQD--ILS-FSLTFV--------- LMJF_36_1970_Leishmania_major_strain_Friedlin_157876860 CSCAGIRFC-AKC-RG-SS-------RVQQ-------LFNGS---VPLSSARSVIE-KQ------QDNERLSSCSF-A--VIG--KSS-LSFC--IEC-------ASV-----F-K-SEVP--IKSCADHQ--------------------G-A----VATG---I-----------T-I--SGLAVFRDTLTEEDETAVIRFLD-------------------------------D-S-RP-F-P--PWK--ESQSG--------RRKQ------DYGPKRNFKKK----------KIK-VA--E--IP--GMPLVFES-V-FA-----------------------------------------------------------------------------VISS-----------MVETFTGK-AY-RI-AE-------------VSALE---YMEGKMSNFDPHVDDTWLWGDRIAGLNLNEACAVTFVNP-E----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRAFFLMSGNCRYRWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----V---DTKASEM--VLS-AASHFV--------- Tb10.70.0360_Trypanosoma_brucei_brucei_TREU927_71747664 CRCTGIRFC-QHC-ID-SD-------RAQS-------IVRRH---VSLANASDVVL-RQ------YTDQRMSSCSF-A--CVG--PSL-LSMC--CAC-------RTV-----F-ETAGGV--LKCCGDHK--------------------GFI----ARTD---I-----------E-L--SGLTIQPGFVSPNEEEYLVAFFD-------------------------------N-P-AP-F-A--AWK--VSQSG--------RRKQ------EYGPKANFKKR----------KLK-VG--D--FR--ALPHQMKT-T-LD-----------------------------------------------------------------------------RVRS-----------FVAEQTMR-EY-CI-VE-------------VSVLE---YTAEC-SCLDPHIDDTWLWGDRIGGLNLLDDVTLTFVSA-D----------------------------------------------------E-------------------------------------------------------------------------VAVT-------VFVPRGAFFLLTGVSRYEWMHGIRRED----------------V-------------KNRRVSVTFREF--AD----------------------N-LV----V---DQEILKT--IVM-SATTFI--------- LINJ_36_2080_Leishmania_infantum_JPCM5_146104297 CSCAGIRFC-AKC-RD-SS-------RVQQ-------LFSGS---VPLSSARTVIE-KQ------HNDERLSSCSF-A--IIG--KSS-LSFC--IEC-------ASV-----F-K-SEVP--IKSCADHQ--------------------G-A----MATS---I-----------T-I--SGLAVLQHALTEEDEAAVIRFLD-------------------------------D-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPRRNFKKK----------KVK-VA--E--IP--SMPLVFES-V-FA-----------------------------------------------------------------------------VISS-----------MTETFTGK-AY-RI-AE-------------VSALE---YMEGKMSNFDPHVDDTWLWGDRIAGLNLNEACVVTFVNP-E----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRAFFLMSGNCRYKWMHGIRHEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DTEASEM--VLS-AASNFV--------- TbgDal_X7900_Trypanosoma_brucei_gambiense_DAL972_788018057 CRCTGIRFC-QHC-ID-SD-------RAQS-------IVRRH---VSLANASDVVL-RQ------YTDQRMSSCSF-A--CVG--PSL-LSMC--CAC-------KTV-----F-ETAGGV--LKCCGDHK--------------------GFI----ARTD---I-----------E-L--SGLTIQPGFVSPNEEEYLVAFFD-------------------------------N-P-AP-F-A--AWK--VSQSG--------RRKQ------EYGPKANFKKR----------KLK-VG--D--FR--ALPHQMKT-T-LD-----------------------------------------------------------------------------RVRS-----------FVAEQTMR-EY-CI-VE-------------VSVLE---YTAEC-SCLDPHIDDTWLWGDRIGGLNLLDDVTLTFVSA-D----------------------------------------------------E-------------------------------------------------------------------------VAVT-------VFVPRGAFFLLTGVSRYEWMHGIRRED----------------V-------------KNRRVSVTFREF--AD----------------------N-LV----V---DQEILKT--IVM-SATTFT--------- GSEM1_T00000659001_Phytomonas_sp_isolate_EM1_588321322 CQCRGIRFC-IFC-KE-SE-------RVRK-------LLSAE---GKLLDPSVVIS-HQ------QEEERTSSCSL-T--LLE--ESR-LRFC--INC-------QAI-----F-V-SSVP--LMSCSQHS--------------------S-S----LRSN---V-----------S-I--LGLHVVRDFLSDDEETYLVKFLD-------------------------------D-P-SP-Y-P--PWK--ESQSG--------RRVQ------EYGPKKNFKKK----------KIK-TT--E--FV--SIPLPFEN-I-LD-----------------------------------------------------------------------------KARG-----------LVERLTEK-PY-II-AE-------------VSVLE---YLRRRLSNFDPHIDDTWLWGDRIAGINLLEDCFITFVNS-E----------------------------------------------------G-------------------------------------------------------------------------VCLE-------VFLPRKCFFLMSGESRYSWMHAIRPEN----------------V-------------GNRRISFTIREL--SD----------------------A-FK----E---ENETACC--QITDAAKNYL--------- LMXM_36_1970_Leishmania_mexicana_MHOM/GT/2001/U1103_401420114 CSCAGIRFC-AKC-CD-SS-------RVQQ-------LFSGS---VPLSSASSVIE-KQ------HHEERLSSCSF-A--TIG--KSS-LSFC--IEC-------MSV-----F-K-SEVP--IKSCADHH--------------------G-A----MATS---I-----------T-I--SGLAVFQDALPEEDETAVIRFLD-------------------------------D-S-HP-F-P--PWK--ESQSG--------RRKQ------DYGPRRNFKKK----------KVK-VA--D--IP--AMPLVFEP-I-FS-----------------------------------------------------------------------------VISS-----------MTETFTGK-AY-RI-AE-------------VSALE---YMEGKMSNFDPHVDDTWLWGDRIAGLNLNEACAVTFVNS-E----------------------------------------------------G-------------------------------------------------------------------------VCCD-------VYLPRRTFFLMSGDCRYKWMHGIRPEH----------------V-------------RGRRISLTFREL--SD----------------------E-IL----A---DTEASEM--VLS-AASNFV--------- STCU_00887_Strigomonas_culicis_528254392 CACSGIRFC-NRC-IN-SD-------RVKY-------LFSGN---VQLRNADDVIA-NQ------TSNDRTSSLSF-A--LLG--NSH-KSVC--IEC-------GVV-----Y-N-SSVL--ITSCADHK--------------------N-K----TDDK---V-----------D-VMVKGLFVERDVISDQEEADLIHFFD-------------------------------N-P-SP-F-P--DWK--ISQSG--------RRKQ------DFGPKRNFKKK----------KVK-PA--D--FP--HMPKVFEP-L-FC-----------------------------------------------------------------------------KVSK-----------EVSQCTASIPY-SI-AE-------------VSVLE---YTSENMSNFDPHIDDTWLWGDRIAGVNLLEDCVMTFVDS-N----------------------------------------------------G-------------------------------------------------------------------------NVVD-------AFLPRRCLFLMSSDCRSIWMHGIRPEN----------------I-------------KGRRVSITMREL--SD----------------------E-IK----Q---DLAVAAP--LLE-AARSFV--------- DQ04_02651050_Trypanosoma_grayi_686635833 CCCSGIRFC-DRC-IS-SD-------RAQA-------IIHQS---VDLTKASDVVS-QQ------YGSERTSSCSF-A--VVG--PSL-YSFC--WEC-------SSV-----F-QMHGRV--VRSCADHV--------------------ELS----SVTN---L-----------S-L--AGLFVAPDFLSLLEEEKLVMFFD-------------------------------N-P-SS-F-P--GWK--PSQSG--------RRKQ------DFGPRANFKKR----------KLR-VP--G--TP--WMPQQLKD-V-LG-----------------------------------------------------------------------------KVSS-----------FVTQQSGK-PF-CI-VE-------------ASVLE---YTSENSSSIDPHIDDTWLWGDRIGGLNLLEDAVMTFVHG-N----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFIPRGAFFLLSKDSRFIWMHGIRQEN----------------I-------------RNRRISITVREL--AE----------------------D-LE----V---DAELLKT--IME-NASSFV--------- Tc00.1047053510187.490_Trypanosoma_cruzi_strain_CL_Brener_71659461 CCCSGIRFC-GRC-IE-SE-------RAQG-------ITHQN---VLLVKSSDVIS-QQ------YGAGRTSSCSF-T--CVE--LSH-YGLC--WQC-------NRI-----F-LMHHGV--FKSCADHE--------------------GTT----PNLD---I-----------R-I--EGLFVIPDFLSLLDEEKLVSFLD-------------------------------E-P-SS-L-S--GWK--HSQSG--------RRKQ------DFGPRANFKKR----------KLN-TS--G--IR--GMPKQLES-V-ME-----------------------------------------------------------------------------KVKS-----------FVRDITSK-EY-HI-VE-------------VSALE---YTSENSSSIDPHIDDTWVWGNRVGGLNLLEDTVMTFVNN-E----------------------------------------------------G-------------------------------------------------------------------------TAVD-------VFLPRGAFFLLSNGSRYDWLHGIRLEN----------------I-------------KHRRISFTFREF--SN----------------------D-LD----I---DREIIQN--VIK-ITYTFV--------- AGDE_03849_Angomonas_deanei_528266291 CTCTGIRFC-ALC-AD-SA-------RVRE-------VFENK---ELLRSADAVIA-NQ------SQKDRSSSYSF-A--LLN--KSK-YALC--AEC-------GAT-----F-KLRDHP--FRACAEHA--------------------G-K----EQEP---K-----------R-I--GGLHVMENIVTPDMEQSLLDFFD-------------------------------NTP-EP-F-G--GWK--VSQSG--------RRKQ------DYGPKRNFKKK----------KVK-PS--D--IP--HMPLVFKK-L-FA-----------------------------------------------------------------------------DVSQ-----------AVSGPTGK-PF-DT-VE-------------VSVLE---YTTERMSNFDPHVDDTWLWGDRIVGLNLLEDCIMTFVDA-E----------------------------------------------------G-------------------------------------------------------------------------DALD-------VCLPRRCLFIMSGESRYTWMHGIRPES----------------I-------------LNRRISMTMREL--SD----------------------E-LL----S---DTEAATM--IKE-AAHSFI--------- AGDE_09971_Angomonas_deanei_528238215 CTCTGIRFC-ALC-AD-SA-------RVRE-------VFENK---ELLRSADAVIA-NQ------SQKDRSSSYSF-A--LLN--KSK-YALC--AEC-------GAT-----F-KLRDHP--FRACAEHA--------------------G-K----EQEP---K-----------R-I--GGLHVMENIVTPDMEQSLLEFFD-------------------------------NTP-EP-F-G--GWK--VSQSG--------RRKQ------DYGPKRNFKKK----------KVK-PS--D--IP--HMPLVFKK-L-FA-----------------------------------------------------------------------------DVSQ-----------AVSGPTGK-PF-DT-VE-------------VSVLE---YTTERMSNFDPHVDDTWLWGDRIVGLNLLEDCIMTFVDA-E----------------------------------------------------G-------------------------------------------------------------------------DALD-------VCLPRRCLFIMSGESRYTWMHGIRPES----------------I-------------LNRRISMTMREL--SD----------------------E-LL----S---DTEAATM--IKE-AAHSFI--------- OXYTRI_23312_Oxytricha_trifallax_403359506 CACTGVRYC-KDC-ND-PEFRK----QFKD-------LYPID---DILEAQQKVLT---------------------------------YVTC--GLC-------NRF-----K-L-KNEL--NISISDQNDGNNDKQDNQNDQSFIDQCIGHL----TAEQ---L-----------D-F--GGLYTIKEIISEDFEYNIVNKLQ-------------------------------D--------Y--KWV--DSQSG--------RKKI------DFGPQVNFKKQ----------KLK-YT--K--FT--GFPLFIKP-I-LD-------LISTLNDQNLQQKEEELKEEQKDQALL-------------------------------------------QLKQ-----------HLPSVLK--DF-QP-IE-------------VNVLE---YDEQRGSNIAPHKDDFWLWGERIIGINLLKDTFMTFQRD-S---------ENQL---------------------------------------G--------------QI---------------------------------------------------------VEIE-------VPVKRRMMYVISGKSRFEWMHGIKSEH----------------I-------------KGKRIVCTFREF--SD----------------------E-FK----S--QDNEDANK--IRE-IAKNFI--------- OXYTRI_08063_Oxytricha_trifallax_403343479 CACTGVRYC-KDC-ND-PEFRK----QFKD-------LYPID---DILEAQQKVLT---------------------------------YVTC--ELC-------NRF-----K-L-KNEL--NISQSDQNNGNNDKQDDQNYQSFIDQCIGHL----TSEQ---L-----------D-F--GGLYTINDFISEEFEQDIVNKLQ-------------------------------D--------Y--KWV--DSQSG--------RKKI------DFGPQVNFKKQ----------KLK-YT--K--FT--GFPLFIKP-I-LD-------LIQTLNDEILQEKIEEQKNQQKPQ-----------------------------------------------LKE-----------NLPSVLK--DF-IP-IE-------------VNVLE---YDEQRGSNIAPHKDDFWLWGERIIGINLLQDTFMTFQRD-S---------QNQI---------------------------------------G--------------QI---------------------------------------------------------VEIE-------VPVKRRMLYIISGKSRFEWMHGIKSEH----------------I-------------KGKRIVCTIREF--SD----------------------E-FK----S--QDNEDANK--IRA-IAKTYI-QA--NIKY Pmar_PMAR003551_Perkinsus_marinus_ATCC_50983_294944511 CACTGVRSC-RLC---------------EE-------VTGRS---LKSTHPRP-----R------YLPDGVAD--------------------------------------------------------------------------------T----AKSD---I-----------V-P--PGLVVLADAITEAEEATLLGDIY-------------------------------A--------R--PWK--LSQSG--------RRKQ------DYGPQVNFKKR----------KLKCPD--N--FQ--GLPHSIDL-V-LP-----------------------------------------------------------------------------RIHT-----------GLGLLLDH-AW-H---E-------------MVVQE---YAVSRGSSIDLHVDHSWVWADGILDLSLAADCIMAFANPKE----------------------------------------------------G-------------------------------------------------------------------------VYYD-------VGLPRRSACLIAGLSQTQWMHGIKRDN----------------A----------CLGGDTRVSITLRVL--DG----------------------A-VA----L---TAEGQET--IRR-SRMRC---------- NCLIV_063160_Neospora_caninum_Liverpool_401412938 LARPSLRD--PS--------------------------ESPA---APACPGRRGLSMKN------LEEELMSLLGLCGDGGAG--ISG-DRGCGAQGG-------SGV-----S---EMQP--MAGADARP--------------------ELV-AG-RRER-P-G-----------D-G-PFGVFLLPDALQPQEETEILAWADGGTEE-TQAREGDSRAEACGGE----GEPRRE-T-RAKEQG--FWA--LSQSG--------RRKI------DFGPKVNFKKK----------RLK-LG--L--FN--GFPPFTKR-L-LALHPDERSPASSCSRSGCSSPS--SSPSSPPPPPLSSRS-VRAFPFPVAASSVCTVER-------------AGVQGAERLTL--Q------A-FRKKLLS--TF-QP-VE-------------LCLLE---YVPSRGSHIEEHFDDFWLWGPRLVTFTLASSTILSFVSP-------VFCVPREL---FEAARPHSLCR-HGED------------TPSP----SSYPSPSSSSAASPEALQASPASAPHSVCGRSAKKSASSVSSSASRESSLPSLSLGPSSLPSSSSLASECGRVRVEIR-------VLLPRRSLVVCEGPCRYTWTHAIRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGESAEVGQK--LLS-LAASFN-----GSPV TGMAS_246140_Toxoplasma_gondii_MAS_672578582 VELQALEH--ATT-----R-------RDRA-------LRAPV---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSAPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASTSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-QSAFFN-----GSPV TGGT1_246140_Toxoplasma_gondii_GT1_523576915 VELQALEH--ATT-----R-------RDRA-------LRAPA---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPGNASSAPPSCSDSFLPSPADPSSPTVSSAPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV TGFOU_246140_Toxoplasma_gondii_FOU_672285053 VELQALEH--ATT-----R-------RDRA-------LRAPA---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPGNASSAPPSCSDSFLPSPADPSSPTVSSAPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV TGVAND_246140_Toxoplasma_gondii_VAND_672573839 VELQALEH--ATT-----R-------RDRA-------LRAPV---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------EEK-----S---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMGTGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-CLSSKAVSSSAVASTSGSSSASCSSAAPSDSCSLPSLGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV TGRUB_246140_Toxoplasma_gondii_RUB_672301308 VELQALEH--ATT-----R-------RDRA-------LRAPA---PPMSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTPGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASTSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV HHA_246140_Hammondia_hammondi_675123610 VELQALEQ--ATT-----K-------RERV-------LRAAV---SPVSEGKLGLS-SD------LEEVLIPLLGL----SDA--HSD-ADDC--EEG-------GEN-----R---EAEV--VVRKRDSS--------------------QFA-TR-RRDG-L-A-----------E-S-PFGVFLLPDALERQEEAAILAWADGNMETGAQSREATHKPRPRGDTREYGGETETE-P-RA---G--FWA--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFTKA-L-LSQPEKASCAPPSRSDSFLPSPSNPSSPDVSSVPSV------------FSPRATSLMESPRAAASNVRTPAGTGKRGVGRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VVCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-QLSPASSCSP-SLSPAAVSSSSGASPSCSSSDPCSSSDPSDSCSLPAPCLASGEFGRVRVEIR-------VVLPRRSLVVVEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGKNEEVGQK--LLS-LSAFFN-----GSPV TGVEG_246140_Toxoplasma_gondii_VEG_557738285 VELQALEP--ATT-----R-------RDRA-------LRAPV---PPVSEGKPGLS-SD------LEEALLPLLGL----SEA--HSD-TEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTSGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV TGME49_046140_Toxoplasma_gondii_ME49_237835449 VELQALEP--ATT-----R-------RDRA-------LRAPV---PPVSEGKPGLS-SD------LEEALLPLLGL----SEA--HSD-TEAC--EEG-------GEK-----R---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAKTE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTSGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV TGP89_246140_Toxoplasma_gondii_p89_672276401 VELQALEH--ATI-----R-------RDRA-------LRAPV---PPVSEGKRGLS-SD------LEEALLPLLGL----SEA--HSD-AEAC--EEG-------EEK-----S---EAEV--VMRKRDSS--------------------QFA-MR-RRDG-L-V-----------E-S-PFGVYLLPDALERQEEAAILAWADGNMETGAQSREATHKPQPRGDTREYGGDAETE-P-RA---G--LWV--LSQSG--------RRKI------DFGPQVNFKKK----------RLK-PG--R--FN--GFPPFAKA-L-LSQPENASSAPPSCSDSFLPSPADPSSPTVSSPPSVSSSSGVSSSPDAFSPRVACITESPRGAASDVRTSGGTGKRGVDRLTP--P------A-FRAELLA--NF-QP-VE-------------LCLLE---YVPDRGSHIEEHFDDFWLWGPRLVTFNLASSTILSFVSP-------VFCVSREL---FEVARAQRLFR-SHQS------------PLSP----S--PVFSSEAPETP-PLSPASSCSP-SLSSKAVSSSAVASSSGSSSASCSSAAPSDSCSLPSPGLASGEFGRVRVEIR-------VLLPRRSLVVFEGHCRYTWTHAVRSHH----------------I-------------FSRRVAVTLREL--AP----------------------E-FL----PGGENEEVGQK--LLS-LSAFFN-----GSPV PTSG_09879_Salpingoeca_rosetta_514683301 MTMVVMEKK-KDD----RE-------EAAP-------PPATV---SAP-------------------------------------HVQ-YRWC--EA------------------C-GKLR--IMPPGDVP--------------------CTGASTCASAESLGL-----------PAF--EGVHVFREFVTADEEQALLSQMD-------------------------------E--------W--PWK--LSQSG--------RWKQ------DFGPKVNFKRK----------KVK-VG--N--FT--GLPSSSRD-L-VA-----------------------------------------------------------------------------RMQA-LP------C-LQ-------DF-EP-VE-------------QCHLD---YRPERGAAIDMHFDDDWIWGERLVTVNLLSETRLSFEHP-Q----------------HE---------------------------------GA--------------------------------------------------------------------------QVY-------VVLPARSLVAVQGSARTQWKHAVHRGA----------------I-------------TDRRIAVTLREL--GP----------------------D-FV----AGAGRAEEGRH--LLD-IAHTFK-----GTPL OT_ostta04g01970_Ostreococcus_tauri_693500295 KGCGRLASS-EKR-RE-VV-D-----DFRG---------------RRAHTGARTTG---------WVAKDDLAGGW-I-------DRR-------DGG-------GAT-----V---GKST--PVDAEGTK----D---------------AFA-EM-ARAM-K-R----VGAV---K-M--SGHFLLLDFITEDEERAIVEYLD-------------------------------A-D-TS---R--PWK-DSSFNG--A-----HEGK------KYGVEPNLLKR----------TVE-PT-----KV--PIPKILKQLV-IK-----------------------------------------------------------------------------KFAS-AH--E---T-LR-------RF-EP-NE-------------CNAIN---YRKDLGSVLTPHCDDRQLSSDILVNLSLVSDCTMTYIHE------------------------------KHPE----------------------------------------------------------------------------------------------RRVE-------VYLPRRSLQIQSGSTRYDYMHSIANEN----------------L-----------H-GDRRVSVTFRES--GA----FTAKKK---------------------------------------------------- Ot04g02100_Ostreococcus_tauri_308802830 KGCGRLASS-EKR-RE-VV-D-----DFRG---------------RRAHTGARTTG---------WVAKDDLAGGW-I-------DRR-------DGG-------GAT-----V---GKST--PVDAEGTK----D---------------AFA-EM-ARAM-K-R----VGAV---K-M--SGHFLLLDFITEDEERAIVEYLD-------------------------------A-D-TS---R--PWK-DSSFNG--A-----HEGK------KYGVEPNLLKR----------TVE-PT-----KV--PIPKILKQLV-IK-----------------------------------------------------------------------------KFAS-AH--E---T-LR-------RF-EP-NE-------------CNAINCTFYS----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- Bathy06g04110_Bathycoccus_prasinos_612393879 HHVVRLQEHVNEC-LD-ER-A-KR--EEKE---------------EKEEREHSVIT-DA------EKTETLASKNC-R-------KRK-------AFE-------EKE-----E---MPSIPIPTIPNAKN---ND---------------AFL-KI-KRGS-I-EMSKISSPS---H-I--SGHKVIENFITEEEETELLRAVY-------------------------------D-M-SE---Q--TWN-DRNASGNGR-----HNGK------SWGVNADRARR----------KVF-KA-----KR--EMPEAFQR-IVLE-----------------------------------------------------------------------------KLRA-NE--YGYKG-LR-------EFGFACNE-------------CNAIE---YVREKGHELRPHVDDRILSSDIIINLSMVGTCIMRYRMN------------------------------KNQG----------------------------------------------------------------------------------------------IEIA-------KKLPRFSLQIQGGRCRFEWEHGIRNED----------------L-----------I-DGKRVSLTFRCS--GRGNGKDGVYKD---------VVFE-EP----PRGYVNDLEGR--------------------- MICPUCDRAFT_60114_Micromonas_pusilla_CCMP1545_303282765 GGETAAAAV-NVH-GQ-IA-H-----TARG---------------TPWRRSMAASGMDA------LEDDDARRRGD-D-------DDD-------DDDVCVSRDGNDD-----I---ANDA--PGDASSNV---KD---------------TLA-RL-MRAS-A-A----TSTSNTPS-L--PGHHLLLDFITEDEENALVAFLD-------------------------------D-G-ER---GIHDWK-PSTFNG--A-----HRGK------AWGVRVDLKRR----------TVS-PP-----TR--EMPPRLLA-V-AE-----------------------------------------------------------------------------KMRG-AH--A---L-LA-------RF-SP-NE-------------ANAIS---YDKRLGDRLLSHVDDRQLSSDVLVNLSLCGECVMTYERT----------------T-TRSSGGG-----TRGS----------------------------------------------------------------------------------------------DRVD-------VRLPRRSLQIQSGDARYAFAHSIANEN----------------L-----------L-DPRRVSITFRES--RT----PSTRTT---------------------------TRGK--------------------- OSTLU_86981_Ostreococcus_lucimarinus_CCE9901_145345998 DEAGKMPSS-EKR-RD-VR-E-----DYRG---------------RRAHTGRTTTG---------FVAKDDLEGGW-I-------ARA-------GEC-------GAR------------------SDGAR---SD---------------AFA-AL-ARGA-K-L----QAKV---K-L--PGHYILENFITEDEERRIVDWLD-------------------------------D-DIAA---G--PWR-DSSFNG--A-----HQGK------KYGVEPNLLKR----------CVE-PA-----RV--PMPKILRDLV-VA-----------------------------------------------------------------------------KFAA-AH--E---T-LK-------HF-TP-NE-------------CNAIN---YRKDLGSVLTPHCDDRQLSSDILVNLSLCSDCTMTYSHE------------------------------KFAS----------------------------------------------------------------------------------------------KRVD-------VRLPRRSLQIQSGSTRYDYMHSIANEN----------------L-----------H-GNRRVSVTFRES--GV----LQKSPQTPKWRPNQ-------------------------------------------- MICPUN_62550_Micromonas_sp_RCC299_255085018 RDETPRASV-------------------------------------PDQEEPAATG-EK------VSDQTTTKTGE-L-------DDD-------DDD--------------------DTK--P-----------S---------------ALA-EL-MRAS-A-T----APT----R-L--PGHHLIPDFITPDEESRLIEYLD-------------------------------R-D-ES---DTNPWR-PSNFNG--K-----HRGK------KWGVEVDLKRR----------TVA-PE-----RR--PLPALVRA-V-AD-----------------------------------------------------------------------------RMPA-AH--P---A-LR-------GF-VP-NE-------------ANAID---YDRRGGAELLPHVDDRQMSTDLIVNLSLAGDCVMTYVED----------------A-GRDGRRGGWEGVPAGA----------------------------------------------------------------------------------------------RRVD-------VFLPRRSLQVQSGPCRFNFAHSIRNEN----------------L-----------R-APRRVSITFRRS--QM----PRTRTR---------------------------VRGE--------------------- CHLREDRAFT_95290_Chlamydomonas_reinhardtii_159473956 --------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MRNK------RWGVQPDYHRR----------GVA-PA-----AH--PLPPLLLA-L-AR-----------------------------------------------------------------------------RMRA-AV----GGP-LK-------DF-SP-NE-------------ANAID---YRRSHGSWLKPHADDRILSGEVIVNLSLAGDCVMTLARL-------------------APQSGS-----GGGG--GGGG-----------------------------VAGAAGGGGA-------------------------------------------------DEFR-------IRLPRRSLQIMSKAARYSYSHGIWPAD----------------L-----------L-DERRVSITFRHS--KP-R--PCAK------------------------------------------------------ VOLCADRAFT_119009_Volvox_carteri_f_nagariensis_302847355 SGGGASASC-RHG-LN-EH-G-----LDPG----PVAGNIPL---HPMFLPRGAVA-SR------RVCTTWGPKPA-C-------RQE-------GAE-------EEE-----E---AARA--AVPEVIAE---AA---------------AVA-GE-MRAE-V-S-HP--------A-L--EGQYLVLEFVTPAEEAELLAMCD-------------------------------D-P-VL-KPSWSPWI--GQMYG--NATAQKTRGK------RWGVLPDYHRR----------GVA-PV-----EH--PLPPLLRI-L-TE-----------------------------------------------------------------------------RMRV-QV----G-L-LR-------CF-QP-NE-------------ANAID---YWRSRGSWLRPHVDDRILSGDLIVNLSLGGAAVMTFARE-------------------RDKDEG-----GHPGLQQHQQ-----------------------------QQQHPRQSAG-------------------------------------------------DEVR-------VRLAPRSLQILSRAARYSYTHAIAASD----------------L-----------L-DARRVSITFRRS--EL-R--PFERAE--R------------------------------------------------- Ot04g02090_Ostreococcus_tauri_308802828 ------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------MFAH-AT--D--------------------------------------------RKDLGSVLTPHCDDRQLSSDILVNLSLVSDCTMTYIHE------------------------------KHPE----------------------------------------------------------------------------------------------RRVE-------VYLPRRSLQIQSGSTRYDYMHSIANEN----------------L-----------H-GDRRVS-TVKHT--DA----DSVCAR------NHLLLHT-LP----RRTPAMSSRDP--------------------- SELMODRAFT_403903_Selaginella_moellendorffii_302757669 RPTSKVAPK-KRA-LK-KS-S-----SQQQ---------------RLVNTGASSFV-EC------PACQFLLSAGR-D-------ETI-------CLD-------EQA-----S---CSSV-------------------------------FV-AL-SSSR-V-Q----PKPS---V-M--NGYQLVENFISCEEEEKLIFAMD-------------------------------S-D-AR---N--LWK-PYSFTG--L-----NNGK------SYGLVMALGKRFVEINAFVHRKIL-AP-----KF--EMPHVLEP-I-ME-----------------------------------------------------------------------------RMRS-IP--L-----LA-------EF-FP-NE-------------MNSLE---YIRESGHFLRPHVDDRQLSGTLIVNLSMCGECYMTFKRE------------------------------RGAY----------------------------------------------------------------------------------------------ECHK-------IRLKQRSLQILTGDSRYNFTHEIENRD----------------L-----------L-SPRRVSITFREV--IS-------------------------------------------------------------- SELMODRAFT_406367_Selaginella_moellendorffii_302763503 RPTSKVAPK-KRA-LK-KS-S-SRASPQQQ---------------RLVNTGASSFV-EC------PACQVLLSAGR-E-------ETI-------CLD-------EQA-----S---CSSV-------------------------------FV-AL-SSSR-V-Q----PKPS---V-M--NGYQLVENFISCEEEEKLIFAMD-------------------------------S-D-AR---N--LWK-PYSFTG--L-----NNGK------SYGLVMALGKRFVEINAFVHRKIL-AP-----KF--EMPHVLEP-I-ME-----------------------------------------------------------------------------RMRS-IP--L-----LA-------EF-FP-NE-------------MNSLE---YIRESGHFLRPHVDDRQLSGTLIVNLSMCGECYMTFKRE------------------------------RGAY----------------------------------------------------------------------------------------------ECHK-------IRLKQRSLQILTGDSRYNFTHEIENRD----------------L-----------L-SPRRVSITFREV--IS-------------------------------------------------------------- AURANDRAFT_66792_Aureococcus_anophagefferens_676394053 RDAAAEAAA-LDD-VA-SR-A-----ARRE------ALCLMC--CEKRYTNCHREG----------LATALVARG------------------------------------------VRVV--HLRADGDA----E---------------AHP-RP-LAAL-R-A----ARHE---A-L--AGQYTVANFLSEAEEASLLAFLD-------------------------------G---EP---G-HPFV-RRDFNG--P-----ARGK------AWGVRTDLKRR----------TFA-EP-----AR--AMPDIFAP-L-VR-----------------------------------------------------------------------------RMRT-IP------E-LR-------SF-RP-NE-------------ANALD---YRRSEGHYLGAHCDDRQLSGPILVNLCLAGDATMTYTRD----------------------QAGR----GSAG----------------------------------------------------------------------------------------------ETVR-------ARLPRRALQIQSGSVRFDYRHGITNAD----------------F-----------H-ADRRVSITFRMN--KH----PGHRFG----------------------------RMMAGWTP-LRAWLI-----VATP EMIHUDRAFT_458739_Emiliania_huxleyi_CCMP1516_551571747 RETAHGPAA-TAG-AQ-LR-A-----TPPRLQLLWAALSPACGRFAAEHVQRHALG----------PCQRSVGRG------------------------------------------SESA--VVNAAAGG----R---------------CEV-RA---------------HA---V-L--RGLFLVHDFVTEEEEAALLQWMD-------------------------------G-Q-Q----P--GWR-LRHFNG--P-----ALGM------RWGATTDLRRR----------SVT-LG-----AP--MPPPLLA--L-TA-----------------------------------------------------------------------------RMRT-LPVPS---P-LA-------GF-EA-NE-------------ANALR---YVRAEGHFLGPHCDDRQLSGDTLVNLSLAGEATMTYAHD----------------------RDG-------SR----------------------------------------------------------------------------------------------PPVR-------VRLPRRSLQIQTEDVRFNHTHGIAAED----------------L-------------PELRVSVTFRRA--KL----TQ-------------------------------------------------------- GUITHDRAFT_150889_Guillardia_theta_CCMP2712_551671246 --------------------------------------------------------------------------------------------------------------------------------------------------------------MKPK-A-I-R---------D-V--PGLFQFPDFITEDEEGLLLQSLD-------------------------------T--------G-NKWQ-LSSFNG--E-----CMTQ------RWGVVTDLKRR----------SVR-PCSIERGEE--DLPSFLRA-I-IE-----------------------------------------------------------------------------KWIN-RC----E-V-IA-------QF-HP-NE-------------ANANS---YEKHKGHSLAAHFDDRFLSGDILVNLSLGADCHMTFARK--------------------------------------------------------------------------------------------------------------------------------DKIK-------VLVPRRSLQVVTGRARFEHTHGIDLDD----------------F-----------H-GPRRVSITFRRA--KL-T--CT-------------------------------------------------------- MONBRDRAFT_26078_Monosiga_brevicollis_MX1_167524358 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------M-----------P-F--TGVRVWLDFVTEAEETALVAQMD-------------------------------A--------W--PWT--DSQSG--------RRKQ------DFGPKANFKKK----------KVK-AA--N--FT--GLPALARP-L-IP-----------------------------------------------------------------------------RLQA-LT------ADLK-------DF-VP-VE-------------QCHLD---YEPSRGAAIVPHFDDFWLWGERLITLNLLSTTFLCFQLP-D----------------EP---------------------------------DA-------------------------------------------------------------------------LEIA-------VPLPPRSLLEVKGVARMAWHHAVHRQD----------------V-------------SQRRIAMTWREL--TP----------------------E-FL----SGE-RQAEGHA--LLD-LAATYT-----GLPM consensus/100% ...................................................................................................................................................................................................................................................................................................................................................................................................................h............................................................................................................................................................................................................................................................................................................................................................................ consensus/95% .......................................................................................................................................................................................G.......l.......hl..h.............................................h....p..G...........b.......aG...sh.+b...........l..............hP........................................................................................h.......................a.....E..............s.bp...Y.......h..H.DD.bl.u..l.shsh...s.hsh..............................................................................................................................................h.h....h.h....sR..a.H.l...p................h...............pRls.ThR.................................................................... consensus/90% .......................................................................................................................................................................................G.....phlp...E..ll..hp............................................W....sbsG........pb.b......paGsp.sh.+b..........pl...s..........hP...b..l.h..............................................................................ph.......................a.....E..............s.lp...Y.......h..HhDD.bl.u..l.shsl...s.hoh.....................................................................................................................................h........l.ls..sh.l....sRa.a.Hul..pp................l...............+RlshThRc................................................................... consensus/85% .....h.................................................................................................................................................................................G..lh.shlp..-E..ll..hD............................................W....SbsG........+c.b......caGsp.sh+++..........plp..s.....b....hP...c..l.h..............................................................................ph.............h.........a....sE..............s.l-...Y...p.u.h..HhDD.bl.G.blsshsL..ss.hoh.p...................................................................................................................................h........l.lPpbuh.lb.s.sRapa.Hul..cp................l...............RRlshThRc................................................................... consensus/80% ..h.sh.................................................................................................................................................................................Gh.lh.shlp..-E..ll.bhD...............................p............W....SboG........++.b......caGsp.Nh+K+..........+lp..s.....b...shP...c..l.h..............................................................................+hp............h.........a.p..sE.............hs.L-...Y..pp.u.lp.HhDD.hL.G.blsslsLh.ss.hohsp...................................................................................................................................h........l.lP+Ruh.lhpG.sRapa.Hulp.cp................l..............sRRluhThREh..ss.............................................................. consensus/75% s.h.ulb........p......p.....p......................................................................................................................................................h...Gl.lh.shlo.pEE..ll.hhD...............................p........s...W....SQSG........RRpb......caGPpsNFKK+..........+lp.ss.....h...shP.hhc..l.h..............................................................................+hps...........h.........a.p..sE.............hssL-...Y..ppuS.lcsHhDD.WLWGpblsslsLhpsshhoasps..................................................................................................................................lp.......V.LPRRuhhlhpG.sRapW.Hulc.cp................l..............sRRluhThREh..us........................h..........................h.......... consensus/70% s.hpulc........p.pp...p.....p.........................................................p.....s....s......................s.......s........................s...........h.............h...Gl.lh.-hloppEE..ll.hhD...............................p........s...W....SQSG........RRKb......-aGPpsNFKK+..........+lc.ss.....h...uhP.hhc..l.h..............................................................................+hps...........h........sF.ps.sE.............hssL-...Y..pcuS.l-sHhDD.WLWG-RllslNLhssshhoasps................................................................................................................................splc.......V.LPRRSlhlhpG.sRapW.HuI+.cc................l..............sRRlulThREh..us......................p.h.........p.p..p...lb...s..a.......... hCG_40699_Homo_sapiens_119587492 -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------A-L-P--PGLMVVEEIISSEEEKMLLESV--------------------------------------------DWTEDTDNQN-SQKSLK-HRRV-K----HFGYEFHYENN----------NVD-KD-KPL-SG--GLPDICES-F-LE-----------------------------------------------------------------------------KWLR---------K----------GY-IK-HK----PD-------QMTIN-Q-YEPGQG--IPAHIDTHSAFEDEIVSLSLGSEIVMDFKHP-D----------------------------------------------------G-------------------------------------------------------------------------IAV--P-----VMLPRRSLLVMTGESRYLWTHGITCRKFDT-VQASE-SLKSG-IITSDVGDLTLSK-RGLRTSFTFRKV--------------------------------R-------QTPCN--CRA----------------\AlkBH8 used for comparison GLOINDRAFT_52996_Rhizophagus_irregularis_DAOM_181602_552928730 ---------------------------------------------------------------------------------------------------------------------------------------------------------------------------------P-I--DGLFLIDDFISEQEEFELINSID-----------------------------------LC------EWSGNGIPPN-PE---M-RRRT-Q----HYGYEFSYRYR----------KVV-QN-----LG--VLPNFLDF-L-IK-----------------------------------------------------------------------------RFIE---------K----------KF-IQ-STEQEYPN-------MCIIN-E-YQAGQG--IMPHTDSPEIFGPVILSLSILTSCLITFTHI-Q----------------------------------------------------D-------------------------------------------------------------------------SSNQSI-----ILLKPRSLLVMTKSSRFDYKHSISKDAIEY-YNGEE-------I-----------K-RDRRVSLTFRTI----------------------------V-------------------------------------| RirG_035650_Rhizophagus_irregularis_DAOM_197198w_595492923 -----------------------------------------------------------------------------------------------------------------M---TTSE--QLNSQSFP--------TTSQS--------------YLSS---L-N-------S-P-I--DGLFLIDDFISEQEEFELINSID-----------------------------------LC------EWSGNGIPPN-PE---M-RRRT-Q----HYGYEFSYRYR----------KVV-QN-----LG--VLPNFLDF-L-IK-----------------------------------------------------------------------------RFIE---------K----------KF-IQ-STEQEYPN-------MCIIN-E-YQAGQG--IMPHTDSPEIFGPVILSLSILTSCLITFTHI-Q----------------------------------------------------D-------------------------------------------------------------------------SSNQSI-----ILLKPRSLLVMTKSSRFDYKHSISKDAIEY-YNGEE-------I-----------K-RDRRVSLTFRTI----------------------------VN----FNENNENMEKG--GCK-FNNNSV-HY--K---| LCOR_06335.1_Lichtheimia_corymbifera_JMRC:FSU:9682_661182209 -----------------------------------------------------------------------------------------------------------------------------------------------M--------------HTID---Y-G-------T-Q-I--PGLLVIEDFVTDEEEVALVTEVD-----------------------------------SR------TWCGLGVSPN-PE---L-KRRT-Q----QYGHLFSYRYR----------KVL-EK-----YG--PLPAFTHT-V-VT-----------------------------------------------------------------------------RIME---------N----------KL-MP-KE----PD-------HLLVN-E-YNAGQG--IMPHTDAPALFGPAILSLSLLSACVMKFTHV-E----------------------------------------------------K-------------------------------------------------------------------------GNSV-D-----ILLPRRSMLVMTGDARYLYKHSISKDLVESSSEGVT-------V-----------H-RDRRVSFTFREI----------------------------IA----WEVPPEDPSCG--CTK-SCSNK----------| EMIHUDRAFT_240478_Emiliania_huxleyi_CCMP1516_551578395 -------------------------------------------------------------------------------------------------------M-QPE-----A---CHGV--PLANLEAP--------------------------------------------S-A-V--EGLALHPDFVSEDEERELLRTVD-----------------------------------AQ------PWDC--------T---I-RRRT-Q----HYGRRFDYHNK----------TVG-DE-----AT--PLPAEIRR-L-IL-----------------------------------------------------------------------------ERVD-------VQP----------AL-LP-WR----PAAGREGSLQCTVN-E-YPPGVG--IAPHIDTHSAFEDGIASLSLGGGCAFRLRRG-S----------------------------------------------------D-------------------------------------------------------------------------GADH-S-----VWLPPRCLLVMSGAARYEWQHSISGRKFDR-VEHRESPAGWEWV-----------P-RERRISVTLRRV-LGS----------------------G-TA----------GTGQS--AW-----------------| ACA1_127620_Acanthamoeba_castellanii_str_Neff_470376461 --------------------------------------MSRN---QRVQPGGGSRG-GK-----PSGRGRGGAKPA----VPGW-APV-ATTS--TDS-A-----PPL-----A---LAYT--PSTSSSSS--------SGL-R--------------QAAA---L-P--------------PDLEYIEDFITADEERALVQAID-----------------------------------AQ------EWSE--------K---L-HRRT-Q----HYGYEFDYSRQ----------DIN-TS-----VPI-ELPVFAQQ-I-IE-----------------------------------------------------------------------------KMRQ---------R----------GL----PQ----FD-------QLIIN-E-YTPGQG--INPHIDKTHCFGPCVVSLSLLSTCVMTFTSL-E----------------------------------------------------T-------------------------------------------------------------------------GEKI-P-----VVLRPRSLVVLRGQARYGWQHGIEPKRADI-VAGKH-------T-----------P-RARRVSLTYRTV----------------------------AK----------SAGNA---------------------| MICPUN_86885_Micromonas_sp_RCC299_255086679 GDVDGVDMF--------DP---S----MAR--CVV--TMSRV---EDAIAAQAATH-ET-------CRPDLGDRRL----WVRF-SSD-PNAT--GG--------EPE--SR-A---KQEE--TWCAATRD--------SAT---------------------------------L-G-V--PGVTLITDFVTEEEEREMLACVD-----------------------------------SD-E----RWQG----------L-A-KRRV-L----HYGYAFDYGTR----------DAR-DK-----TS--PMPAFVAG-L-LG-----------------------------------------------------------------------------RAAS---------C----------GA-PG-AC----ES--VHCD-QLTVN-E-YVAGVG--IAPHVDTHSAFGPTILSLSLAGRAVMEFRLH-E----------------------------------------------------G-------------------------------------------------------------------------GEKE-PRERRAISMPPRSLLVLHGEARYRWLHYIPHRKRDA-IVGED---ECE-A-----------R-EERRVSFTFRRR--RE----------------------G-------------ACGCE--WPE-ACDSRE-GA--AQRL| NAEGRDRAFT_58773_Naegleria_gruberi_strain_NEG-M_290982964 --------------------------------------------------MKRTLN-DF-L----QTSDKKKQSTL---------KKI-------KSN-------SEV---------SGAP--VKQISSYK---------------------------TDSE-----------------I--GGLYIIENIIDVAEERKLVKFID-----------------------------------SQ------KWN---DEIS--------RRTQ------HYGVSYNYGAR----------GVK-EA-LK--VP--PVPSEFSD-L-LE-----------------------------------------------------------------------------EIKN-KE-GL-D-S-IRNLMEGI-DF--------------K----QVIIN-E-YKGAKQG-ISKHVDHCQDFGPLILILSLGDECVMKFHKL-E------------------QVKEEDLKK-KKVK--------RT--EVSP----S-------------------------------------------------------------------------ECYD-------RRMPRRSLIILSGDARYQYQHEIPKTM----VFKID-GKQF--L-KRS-------E-SYRRVSITYRSL--TT----------------------D---------------------------------------| SAMD00019534_100830_Acytostelium_subglobosum_LB1_735850808 ----------------------------------------------------------------------------------------------------------------------------MATVEVV----D----------------------MDDG---V-----------Q-L-PPGLSLLTDFITEEEERILVDNID-----------------------------------KS------EWK---TEIA--------RRTQ------QYGYHYCYRLR--GVDELDD-QGQ-PM-----TP--PIPQYLQF-L-VD-----------------------------------------------------------------------------RLAA-TP------Q-------------IP-VG----MD-------QIIIN-E-YEPGQQ--IKPHIDSTKDWDACVVSLSCLSDWRMVFIPE-D----------------------------DDKS----------------------------------------------------------------------------------------------KEVS-------MVLPKRSLLVLKGDARYKWKHGIRSQ-----------------V-----------K-VGRRVSLTFRHY----------------------------IG----SGGNS---------------------------| BATDEDRAFT_87049_Batrachochytrium_dendrobatidis_JAM81_575476790 ---------------------------------------------------------------------------------------------------------MDSSVGLDA---ADDP--TRNHANLP--------S---L--------------HKHP---F-E-------P-V-I--SGLRLIPDFITQQEELDLIASID-----------------------------------AH------PWSGYGIPPN-PE---L-KRHT-Q----QYGFLFSFRTR----------TIT-EC-----LG--SLPAFSSF-V-ID-----------------------------------------------------------------------------RMLL---------P----------EF-NV-FPNDP-PN-------HVLVN-E-YQPGQG--IMPHVDSQDTFGDVVTSLSLWSSCVMSFGNK-M----------------------------------------------------T-------------------------------------------------------------------------GEKV-H-----LELPRRSLLILTGDARTHYTHAIPKEDMLF-AGNEC-------V-----------D-RGRRVSLTIRSI----------------------------LK----SAIP----------------------------| MVEG_06150_Mortierella_verticillata_NRRL_6337_672822524 -----------------------------------------------------------------------------------------------------------------------------------------------------------------------M-------A-T-I--PGLEVILDFITEEEEQQLITELD-----------------------------------AG------HWAGRGIEPN-PE---M-KRRH-Q----HYGGVFSYRLR----------RVV-GD-----ME--KLPGMFDF-I-TE-----------------------------------------------------------------------------RLLQ---------R----------RI-YD-RS----PN-------SIIVN-E-YEAGQG--IMPHVDAPKLFGKTITALSLLSACVMTFQHV-K----------------------------------------------------D-------------------------------------------------------------------------PSQIYH-----IHLPQRSLVVMNGSSRYDFKHSISKDLIEH-VDGLE-------I-----------V-RARRVSITYRDM----------------------------LVEDRQQDRESDEAGSS--CKE-LCGNGI-SS--CTRS/Back to Contents
General notes |
# 1; 25148697 CELE_F09F7.7 291 eukaryota>metazoa>nematoda Caenorhabditis elegans F09F7.7, isoform a [Caenorhabditis elegans]. 24583140 Dmel_CG4036 304 eukaryota>metazoa>hexapoda Drosophila melanogaster CG4036, isoform A [Drosophila melanogaster]. 585709738 LOC100367945 271 eukaryota>metazoa>hemichordata Saccoglossus kowalevskii PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Saccoglossus kowalevskii]. 156402493 NEMVEDRAFT_v1g34785 267 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 688557483 alkbh4 321 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 isoform X1 [Danio rerio]. 823470605 ALKBH4 402 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Taeniopygia guttata]. 311251081 LOC100520070 302 eukaryota>metazoa>chordata>vertebrata Sus scrofa PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4-like [Sus scrofa]. 68372246 alkbh4 315 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 isoform X2 [Danio rerio]. 612002272 ALKBH4 304 eukaryota>metazoa>chordata>vertebrata Monodelphis domestica PREDICTED: alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Monodelphis domestica]. 8923019 ALKBH4 302 eukaryota>metazoa>chordata>vertebrata Homo sapiens alpha-ketoglutarate-dependent dioxygenase alkB homolog 4 [Homo sapiens]. 386783769 287 eukaryota>metazoa Schmidtea mediterranea alpha ketoglutarate dependent dioxygenase ABH4 [Schmidtea mediterranea]. 407409697 MOQ_003722 304 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi marinkellei hypothetical protein MOQ_003722 [Trypanosoma cruzi marinkellei]. 407849132 TCSYLVIO_004974 304 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi hypothetical protein TCSYLVIO_004974 [Trypanosoma cruzi]. 731709183 LPMP_352050 297 eukaryota>euglenozoa>kinetoplastida Leishmania panamensis alpha-ketoglutarate-dependent dioxygenase AlkB-like, putative [Leishmania panamensis]. 154345804 LBRM_35_2190 297 eukaryota>euglenozoa>kinetoplastida Leishmania braziliensis MHOM/BR/75/M2904 conserved hypothetical protein [Leishmania braziliensis MHOM/BR/75/M2904]. 342184304 TCIL3000_10_5510 306 eukaryota>euglenozoa>kinetoplastida Trypanosoma congolense IL3000 unnamed protein product [Trypanosoma congolense IL3000]. 340057250 TVY486_1006420 299 eukaryota>euglenozoa>kinetoplastida Trypanosoma vivax Y486 conserved hypothetical protein [Trypanosoma vivax Y486]. 157876860 LMJF_36_1970 297 eukaryota>euglenozoa>kinetoplastida Leishmania major strain Friedlin conserved hypothetical protein [Leishmania major strain Friedlin]. 71747664 Tb10.70.0360 304 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei brucei TREU927 hypothetical protein [Trypanosoma brucei brucei TREU927]. 146104297 LINJ_36_2080 297 eukaryota>euglenozoa>kinetoplastida Leishmania infantum JPCM5 conserved hypothetical protein [Leishmania infantum JPCM5]. 788018057 TbgDal_X7900 304 eukaryota>euglenozoa>kinetoplastida Trypanosoma brucei gambiense DAL972 hypothetical protein, conserved [Trypanosoma brucei gambiense DAL972]. 588321322 GSEM1_T00000659001 297 eukaryota>euglenozoa>kinetoplastida Phytomonas sp. isolate EM1 unnamed protein product [Phytomonas sp. isolate EM1]. 401420114 LMXM_36_1970 297 eukaryota>euglenozoa>kinetoplastida Leishmania mexicana MHOM/GT/2001/U1103 conserved hypothetical protein [Leishmania mexicana MHOM/GT/2001/U1103]. 528254392 STCU_00887 304 eukaryota>euglenozoa>kinetoplastida Strigomonas culicis alkylated DNA repair protein alkB like protein 4 [Strigomonas culicis]. 686635833 DQ04_02651050 296 eukaryota>euglenozoa>kinetoplastida Trypanosoma grayi alkylated DNA repair protein alkB like protein 4 [Trypanosoma grayi]. 71659461 Tc00.1047053510187.490 304 eukaryota>euglenozoa>kinetoplastida Trypanosoma cruzi strain CL Brener hypothetical protein [Trypanosoma cruzi strain CL Brener]. 528266291 AGDE_03849 297 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 4 [Angomonas deanei]. 528238215 AGDE_09971 297 eukaryota>euglenozoa>kinetoplastida Angomonas deanei alkylated DNA repair protein alkB like protein 4 [Angomonas deanei]. 403359506 OXYTRI_23312 335 eukaryota>alveolata>ciliophora Oxytricha trifallax hypothetical protein OXYTRI_23312 (macronuclear) [Oxytricha trifallax]. 403343479 OXYTRI_08063 342 eukaryota>alveolata>ciliophora Oxytricha trifallax hypothetical protein OXYTRI_08063 (macronuclear) [Oxytricha trifallax]. 294944511 Pmar_PMAR003551 252 eukaryota>alveolata Perkinsus marinus ATCC 50983 conserved hypothetical protein [Perkinsus marinus ATCC 50983]. 401412938 NCLIV_063160 951 eukaryota>alveolata>apicomplexa Neospora caninum Liverpool conserved hypothetical protein [Neospora caninum Liverpool]. 672578582 TGMAS_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii MAS hypothetical protein TGMAS_246140 [Toxoplasma gondii MAS]. 523576915 TGGT1_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii GT1 hypothetical protein TGGT1_246140 [Toxoplasma gondii GT1]. 672285053 TGFOU_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii FOU hypothetical protein TGFOU_246140 [Toxoplasma gondii FOU]. 672573839 TGVAND_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii VAND hypothetical protein TGVAND_246140 [Toxoplasma gondii VAND]. 672301308 TGRUB_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii RUB hypothetical protein TGRUB_246140 [Toxoplasma gondii RUB]. 675123610 HHA_246140 803 eukaryota>alveolata>apicomplexa Hammondia hammondi hypothetical protein HHA_246140 [Hammondia hammondi]. 557738285 TGVEG_246140 1033 eukaryota>alveolata>apicomplexa Toxoplasma gondii VEG hypothetical protein TGVEG_246140 [Toxoplasma gondii VEG]. 237835449 TGME49_046140 1033 eukaryota>alveolata>apicomplexa Toxoplasma gondii ME49 hypothetical protein TGME49_046140 [Toxoplasma gondii ME49]. 672276401 TGP89_246140 1030 eukaryota>alveolata>apicomplexa Toxoplasma gondii p89 hypothetical protein TGP89_246140 [Toxoplasma gondii p89]. 514683301 PTSG_09879 270 eukaryota>choanoflagellida Salpingoeca rosetta hypothetical protein PTSG_09879 [Salpingoeca rosetta]. 167524358 MONBRDRAFT_26078 207 eukaryota>choanoflagellida Monosiga brevicollis MX1 hypothetical protein [Monosiga brevicollis MX1]. 693500295 OT_ostta04g01970 351 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri Alpha-ketoglutarate-dependent dioxygenase AlkB-like [Ostreococcus tauri]. 308802830 Ot04g02100 245 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed protein product [Ostreococcus tauri]. 612393879 Bathy06g04110 317 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 303282765 MICPUCDRAFT_60114 377 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 145345998 OSTLU_86981 347 eukaryota>viridiplantae>chlorophyta Ostreococcus lucimarinus CCE9901 predicted protein [Ostreococcus lucimarinus CCE9901]. 255085018 MICPUN_62550 353 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 159473956 CHLREDRAFT_95290 170 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein [Chlamydomonas reinhardtii]. 302847355 VOLCADRAFT_119009 753 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_119009 [Volvox carteri f. nagariensis]. 308802828 Ot04g02090 147 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri LOC553475 protein (ISS) [Ostreococcus tauri]. 302757669 SELMODRAFT_403903 268 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_403903 [Selaginella moellendorffii]. 302763503 SELMODRAFT_406367 272 eukaryota>viridiplantae Selaginella moellendorffii hypothetical protein SELMODRAFT_406367 [Selaginella moellendorffii]. 676394053 AURANDRAFT_66792 2180 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_66792 [Aureococcus anophagefferens]. 551571747 EMIHUDRAFT_458739 375 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_458739 [Emiliania huxleyi CCMP1516]. 551671246 GUITHDRAFT_150889 193 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_150889 [Guillardia theta CCMP2712].Back to Contents
RES RQPSLARASVA------------RASVD-SLTSLGAMMA----------------------------HRLI-EAAPAALS---VEAASG--------EWLLADLCD-DGTILHLPADG--------GVPQRFRSPGGFVRFLRPWLSAAS-------VEGGSWTAVHYK-----------------GL----------------PLDLIRKAATSDADALGAV---A FINAL ------------------------------HHHHHHHHH-------------------------------------EEEE---EEE------------EEEEEE-----EEEEE---------------EEE--HHHHHHHHHHH-----------------EEEEEE------------------------------------HHHHHHHHHHHHH---------- ALIGN ------------------------------HHHHHHHHH-------------------------------------EEEE---E---------------EEEEEE----EEEEE---------------------HHHHHHHH--------------------EEEEE-------------------------------------HHHHHHHHHHH----------- HMM ------------------------------HHHHHHHHH------------------------------EE-E---EEEE---EE-------------EEEEEEE----EEEEE---------------E----HHHHHHHHHH---------------EEEEEEEEE------------------------------------HHHHHHHHHHHH----------- FREQ ------------------------------HHEEEEHHH--------------------------------------HHH---HHHH----------------------EEEEEE-E---------E--EEE---HHHHHHHHHH-----------------EEEEE---------------------------------------HHHHHHHHHHH---------- PSSM --------------------------------EHHHHH--------------------------------------EEEE---EE-------------EEEEEE-----EEEE---------------------HHHHHHHHHHH------------------EEEEE-------------------------------------HHHHHHHHHHHH---------- RES KPNPND-IELL------SLE--IKPPK---VPMKTLIE-----------------------------ADFL-RVGQTLFD-------KN--------ENAICIVTQ-DGNVKDN------------E---ETLSIHKMSAKYLNK------------TNNNGWDYFYLFR----------------NNN-------------FITLDSLRYEYTNQ FINAL -----H-HHHH---------------------HHHHHH-----------------------------H--------EEE-------------------EEEEEEEE---EEEE---------------------HHHHHHHHH--------------------EEEEEE-----------------------------------HHHHHHHHHHH- ALIGN ------------------------------HHHHHHHH-----------------------------H---------EE--------------------EEEEEE-----EEE----------------------HHHHHHHH--------------------EEEEE-----------------------------------HHHHHHHHH---- HMM ------------------------------HHHHHHHH-----------------------------HHH--H---EEEE------------------EEEEEEE----EEEE---------------------HHHHHHHHH-------------------EEEEEEE-------------------H-------------HHHHHHHHHH---- FREQ ----HH-HHHH------H----------------EEEE-------------------------------------HEEE----------------------EEEH-----EEE--------------------EEEEEE-HH--------------------EEEEEE-------------------------------------HHHHHHHHHH- PSSM ---------------------------------HHHHH-----------------------------HH-------EE---------------------EEEEEEE---EEE----------------------HHHHHHHHH--------------------EEEEE-------------------------------------HHHHHHHH--- MPND_Homo_sapiens_664805955 AGGCGGPGGAL--------------TRR-AVTLRVLLK-----------------------------DALL-EPGAGVLSI----YYLG--------KKFLGDLQP-DGRIMWQET----------G--QTFNSPSAWATHCKKLVNPAK-------KSGCGWASVKYK-----------------GQ----------------KLDKYKATWLRLHQLHT------------------------------------------ LOC101862270_Aplysia_californica_524883584 EGALHKAKQAI-------------TCR--SVTLNTLME-----------------------------DGVI-HPGKGVLSI----DYLG--------QRFEADLLE-SGKIKWQ------------GSEEEFGTPSAWALHCKRLVNPIK-------RSGCGWASVKYN-----------------NK----------------KLDIWKSIWARKHRVSSPF---KASSDSSNSSPTKPPQPSTSPAMLASPTLSIDGEKCS Dmel_CG4751_Drosophila_melanogaster_19921138 DDETKENYEGF-----------NGTGR--TVTLQTLMA-----------------------------ANVL-QPGLGLMTI----EYLG--------QKFVGDLLA-DGKIKSHET----------E--TIFLTPSAWAMHCKRIINPDK-------KSGCGWASVKYK-----------------GK----------------KLDAYKNTYLRKCALQK------------------------------------------ DAPPUDRAFT_23970_Daphnia_pulex_321479075 --------GGF-------------TGR--GVTLQMLLE-----------------------------DNIL-QPKDGAMSL----EYMG--------QKFNGDLLA-DGKIFSAEV----------Q--EVFSSPSAWALRCKKIVNPEQ-------KYGCGWSSVRYC-----------------GR----------------PLDVYKNQWMRKRRLEQ------------------------------------------ NEMVEDRAFT_v1g22327_Nematostella_vectensis_156394960 QVRSRKPRSFL-------------TGR--GVTLAMLME-----------------------------DGIM-QPGEKLLSI----DYLG--------QKFQADLLP-DGKIKWPEA----------N--KVFNSPSAWAIYCKKLVNPSK-------KSGCGWASVKYK-----------------GR----------------KLDQYKSTWFRKQRAQT------------------------------------------ BRAFLDRAFT_221394_Branchiostoma_floridae_260806149 ----------L-------------TGR--GVTMAMLLS-----------------------------DGIL-EPGDACLSI----DYLG--------QKFVGDLLP-NGKIKWQ------------GSNRVFNSPSAWAIHCKKMVNPSK-------KSGCGWASVKYK-----------------GK----------------KLDIYKTIWFRKQRGVN------------------------------------------ LOC100209039_Hydra_vulgaris_449665213 KKKVLSKSAML-------------TGR--GVTTQMLIE-----------------------------ENIL-EAGENNLTI----NYLG--------NKFVGDLKE-DGTINCK------------NANRIFSSPSAWAMHCKKQVNPDK-------KSGCGWASVKYK-----------------GR----------------KLDEFKSTWFRKQKIEQ------------------------------------------ LOC105319737_Crassostrea_gigas_762155186 DKQGSLAKAAI-------------TGR--SVTLQMLIS-----------------------------EKII-EPGPALLSL----KYLG--------RRFIADLLP-DGKIQMP------------GTKETFTSPSAWAIHCKKQVNPHK-------KSGCGWASVMYK-----------------ER----------------KLDIWKAAWFRKHRSSS------------------------------------------ CAPTEDRAFT_110378_Capitella_teleta_443694149 KPFEQRAHHAI-------------TGR--GVTLAMLMA-----------------------------DGFI-QPGNDTMSI----DYLG--------QNFRADLLD-GGRIREN------------G--KVFGTPSAWAIHCKNIVNPGK-------KSGCGWASVKYK-----------------GK----------------KLDAYKLSWLSKHRPLAIA---AVSLTVSNYSYFPYINQCYDIFLEDIKP--------- TRIADDRAFT_54406_Trichoplax_adhaerens_196002159 TKNSNKLPYNA-------------AGR--GVSIAVLIK-----------------------------DGIL-KPRKKCLSL----EYLK--------KTFYGDLLP-NGKIQSS------------TTEDIFNSPSAWAIHCKRLVNPAK-------KSGCGWASVKYN-----------------GV----------------KLDEFKATWYKKKKLSC------------------------------------------ LOC587071_Strongylocentrotus_purpuratus_390342613 TTSPPKKEKVL-------------TGR--GVTLSMLMG-----------------------------DGVV-EAGKDCLSI----EYLG--------SKFTADLMT-DGRIFWS------------KEKQIFNSPSAWAIHIKSILNPGK-------RSGCGWASVKYN-----------------GK----------------KLDVVKSQWFRNMKIPY------------------------------------------ mpnd_Maylandia_zebra_498992415 LRSSSGRGSLL--------------TRR-GITLRVLLK-----------------------------DGLV-EPGDGVLAI----HYLG--------KNFVGDLLT-DGKIRWVET----------G--QIFNSPSAWATHCKRLVNPAK-------KSGCGWASVRYR-----------------GQ----------------KLVQYKTTWLHKYQPSA------------------------------------------ LOC100902139_Metaseiulus_occidentalis_391344637 GERLEEREPRK-------------------LTLETLLR-----------------------------EGVL-ESGDGVLSM----EYMG--------MRFTGDLLS-DGAIRWGES----------G--EVFPSPSAWAIHCKRITQPDK-------KTSCGWSQVKYK-----------------GR----------------KLELYKQEWLHRHQTAK------------------------------------------ LOC100641685_Amphimedon_queenslandica_761911769 --------MAV-------------AGR--SISMFSLIT-----------------------------DNIL-EPGDEVLTF----DYLG--------KRYTADLLP-EGTIRGN------------G--QIYASPYAWASYCKNEINPDQ-------KTAIGWGHIRYR-----------------GI----------------KLSQYKNLYLKKHKLCS------------------------------------------ LOC100178477_Ciona_intestinalis_198425307 SLDSNSSRKDI---SPRSSR--GGTSR--SVTLQTLVQ-----------------------------EGVL-EPGNGVLSI----DYLS--------HKYLGDLLP-NGKILWD------------N--VQFPSPSTWATHIKKKINPSK-------KSGCGWNSVKYK-----------------GK----------------KLDKLKANWFRKNAGVV------------------------------------------ ANKRD31_Gallus_gallus_513229710 ----NYGYKECEQKQRKHAR----KNKK-KLQLIDLLE-----------------------------LGRI-KPGENVLEF----TLKE--------FTCKATLLT-NGKIKTSK-----------N--KIFQNPVQWVKDLLGSDIYV--------TWKYAWNKVVYR-----------------GT----------------QLSKLVVENAPVSNDLEIP---S------------------------------------ ANKRD31_Homo_sapiens_256574792 ----GSGQQDTIKKALNYST---APKKK-CIQIKDLIL-----------------------------LGRI-NPGNNILEF----KTQE--------TTHKASILL-NGKLKVES-----------G--QIYKNPVTWLKDLLGGNSYV--------TWNYAWSKVTYL-----------------GK----------------ELLRYVSEDAPILPEPNSV---P------------------------------------ LOC101480116_Maylandia_zebra_499004945 GIALEHFSIMIRRKNVLIQN---RAVDN-SRRLSVLIQ-----------------------------RGII-SPGSA-LQL----LLKG--------HCHFANVLA-DGSILSK------------G--KVHLAPECWLKSILGKNIPV--------SSAYAWDKVTFR-----------------GR----------------SLSFYLLNMEGDENTPQRC---L------------------------------------ GSONMT00016346001_Oncorhynchus_mykiss_642092905 ---HGNTPDMGSNVLQQGTA---SDGEE-NRKLIRLIK-----------------------------RGVI-TPGEDVLQL----MWRG--------CVHQASLLL-EGWIRDSVT----------G--REFQAPELWVAAILGNNIPV--------SSAYAWDKVNTT-----------------QQ----------------SS--------------------------------------------------------- CHLREDRAFT_187079_Chlamydomonas_reinhardtii_159476834 ---GAVSTRALPSPPRNSRG---VSAAA-AGSSARALQ-----------------------------ASLL--VGNQDV--------------------VDVVVHA-DGRIDCQ------------Y--GEFRSVSSLALKVLRQRNPNR-------MACDGWQEVKLN-----------------GV----------------RLDEMRQEAGRLLAREAAA---S------------------------------------ CHLREDRAFT_150353_Chlamydomonas_reinhardtii_159478713 DSPKSAPARSGGVVGKDGKR---VYNRT-VPTFGDIIS-----------------------------HGLF-PPGPCRWTV----GTIK--------EEVSVEVRP-SGEILYC------------G--NAYPSISAFALVVLRSRNSER-------IACDGWREVRHN-----------------GI----------------KMEVLRKECLRMMLENG------------------------------------------ MONBRDRAFT_26083_Monosiga_brevicollis_MX1_167524112 --------AMSAPGTDRSKL---QYNPE-SVLLRCLVE-----------------------------CRLV-TPGQRQLRL----DHGP--------FQYAADLSP-DGLVTCA------------N--RVFASLTDFVAHCQREVATELGP-----AQLDPWASVRHL-----------------GT----------------PLAELVTNCPPTARASPCL---N------------------------------------ COCSUDRAFT_48713_Coccomyxa_subellipsoidea_C-169_545357568 ------------------------MTDRRAFTLQPLVE-----------------------------AGVF-EPGENVLSC----SVGG--------VEYFADLGP-AGEIIFE------------G--QFFKSPSAFSVFVKRKVNPSR-------KADDGWTSCKYR-----------------GE----------------LLSIYRPQ-LENLLGGDGR---SG----------------------------------- CHLREDRAFT_188109_Chlamydomonas_reinhardtii_159471497 APRAPPKPPAAKGGTGAKPKAAPKRPAG-AITLRTLLD-----------------------------AGFL-VPGSKVLYV----EYKG--------LITWADLTE-EATIMCD------------G--QTFESPSAFSIFVKRKLNPER-------KADDGWKAVKYA-----------------GK----------------LLEHYKEQYLRQQLAAS------------------------------------------ COCSUDRAFT_46372_Coccomyxa_subellipsoidea_C-169_545371515 -LLERQRAAVAKKRGPGTGK---PRAGG-GITLKLLID-----------------------------EDIL-QPGDNILSV----EYKS--------SMTYASLEH-DGRISCFVQ----------GQHLTFESPSAFSIYLKRLVNPAR-------KADDGWKTVKCN-----------------GR----------------FLEQYKLELARRRFGKP------------------------------------------ MICPUCDRAFT_60251_Micromonas_pusilla_CCMP1545_303282299 KPKPKKERPPA------KAP---SGGAS-GVTLAHLID-----------------------------ANLI-APGVDVVST----LYNG--------VTELATLRD-DGAIAWD------------G--REFHSVSAFSLAVKRRTNPDR-------KADDGWKCVKCD-----------------GV----------------ALDAVKRRYEATVAAGG------------------------------------------ MPVG_00227_Micromonas_pusilla_virus_12T_472342784 ---------------MAPSP----------VSLKTLID-----------------------------AGLL-TPGHNVLQI----KINR------KKVTQCASLTN-EGVIVFK------------N--KQYHHPSEWSLYVKNIYNPTL-------TSDRGWTSILYQ-----------------GS----------------TLNVIRSSYISRRD--------------------------------------------- OSTLU_32332_Ostreococcus_lucimarinus_CCE9901_145347713 -RKNRAAGEPLVQLGERGKA---GIGKI-SCSLVDLIQ-----------------------------SGLL-KSGAEKMFI----VYQD--------NVWKGDLGE-DGVITFQ------------G--QRFTSPSAWAIFAKRLTNPTK-------KADDGWKSARYGDP--------------DGP----------------TLDQVKGEYARINQLKLA----------------------------------------- MICPUCDRAFT_47146_Micromonas_pusilla_CCMP1545_303277185 --------GTVGAPKRPARSG--VPGRS-TVSLKMLVD-----------------------------DGKC-NPGHDALWI----TYQN--------QTWSGELSA-DGIITFQ------------G--KTFNSPSAWAIYVKRLANPGK-------KADDGWKSVRYGHE--------------EGP----------------MLDDLKHEYLRGESGAGL----------------------------------------- Bathy04g03050_Bathycoccus_prasinos_612396523 NKKKENPNNLMTTKERKLHE---RGPYD-GITLFTLIQ-----------------------------LKLL-KVGANNLLA--SYEYGG------EKVQVVGSLTE-NGHIYVQ------------KEDEMFRNPRSMSIDVKRRLNPEI-------KFTEGWGHVRYVNTGSWHLDGKNDEK--NGQ----------------TLKEIRERIPIKDVPSLV----------------------------------------- PHYSODRAFT_347238_Phytophthora_sojae_695436758 SPRPRPRPQLVRPPPRGLATSF-IMGDA-RIVLTTLID-----------------------------EKLL-SPGPKKLYV----SYYR--------KRVYADLLP-DGSIRFK------------D--QVYTSPVPCALHMKRTLNPSL-------KTDAGWSSMYSAA---------------SGE----------------SLKDIKDRLNIRKRGTNA----------------------------------------- CAOG_001362_Capsaspora_owczarzaki_ATCC_30864_765549348 --QAAPPKPRA---PQRLE----RKDRI-QITLSDLID-----------------------------AGLL-KPGTVLSS--------G---------SAQCLLQA-DSSVVSQPE----------G--KPYASAQAWLATVYTKE-----------QRPSMWSRVSAK-----------------GM----------------VLNIYREMYIKRAESA------------------------------------------- GUITHDRAFT_132396_Guillardia_theta_CCMP2712_551676387 CEDKTWNPEDL-KKNAEDL----ARDPS-SVTLKQLLD-----------------------------AGFL-SLDAELYWQ-KKHPEKG------LLGPVFGKITL-DGQIEFE------------G--QKHKTPAAFASAACKSLTGRSLK-----RKHDGWSSVRYR-----------------NR----------------TLEYYKSQYIDRNSRKN------------------------------------------ DI09_127p30_Mitosporidium_daphniae_692170605 -------SDKI---KAANLS---AGSKSETFSLMDLIS-----------------------------SGTV-PIGSVAYCF-----------------DCTAKITS-SGSLIDEID----------L--QEYFDPSDWATYVVRCVENRRL------SRQDGLKAIKVG-----------------GK----------------TLEELYSSVFVQQSTTRND---P------------------------------------ MVEG_11466_Mortierella_verticillata_NRRL_6337_672817350 -----ASSATIGLAVNNTVK---LAHQL-RVTLHNLVS-----------------------------TGYL-PAETRVIFR-----------------DHSAIVTA-KGTLIPIYSELNCATHCPWLQ-GEYETPSAWATAVVKGARTGK-------VAVNGWSAIKVNVHQNPALVKMFSGQGLPEV----------------SLDVLRKRYLMDMVDDG------------------------------------------ Gasu_54090_Galdieria_sulphuraria_545702290 ---DKMEKSSV-----SDISSP-RSKEA-QQKLLELTK-----------------------------HGLL-KVGEKVQFY-----YKA--------KEFAAQVTS-EGCLLYQGEN---------GESELFLSPSAFVNTLAKRQGPSSRGKSKPKLNLNGWEFCFVA-----------------GV----------------SLSQLKQQLESILQAEGT----------------------------------------- DDB_G0293300_Dictyostelium_discoideum_AX4_66800521 GKIKRKRGIVI------DNR---RKKVP-DITIDDLMK-----------------------------KDLL-RIGDTLCYC-----IGG--------VNHFALLLR-DGYIEYD------------S--LRLPSVHAFVIHVLSNLEKNKKFR-----WFSPWDSVSVR-----------------GK----------------SLNFIRSIFQSKFYKGG------------------------------------------ Sthe_2269_Sphaerobacter_thermophilus_DSM_20745_269787547 ASVGALWHIPA-SSPQNKGR---YKHSY-GVHLSDLVK-----------------------------SGVL-PAGTPVILV----GPRNK-------DLAHAEVSQ-DGHIIWG------------G--KRYRSLSDRAFCAAFS--PPR-------VSFNGWKHWYAVL--------------PRGRV---------------QLAELREEYLRAVADQK------------------------------------------ DDB_G0272516_Dictyostelium_discoideum_AX4_66823477 -----------------------MVNEH-NVTLSNLID-----------------------------FGLI-KPNQEVKYS-----YRG--------VSYTGVILL-NGEIHTN------------G--VSFTNPTHWTRTISG-------------NNCSGWGTVKLSG--------------ASGP----------------PLLKLKREYLFRVGSTGK----------------------------------------- CAPTEDRAFT_211270_Capitella_teleta_443691830 QYVSPQPKNQV-------KQ---SKRVE-LPTMKTLLK-----------------------------RGII-ISGTKVLSV----HTQE--------GIKFASVDV-EGNIITPT-----------G--HKFVSPLRWAMCLKGVGH-MK--------RSTAYKMILYQ-----------------GA----------------TLYDLTQSSAGSNPLILTS---S------------------------------------ LOC100181315_Ciona_intestinalis_459177714 TKSPNTSQHPA----QADIQ---QDKLD-L-NLSYLLK-----------------------------NKII-QQGSNALKL----KSMG--------TEHVASLTS-NGSILTSE-----------F--QLYVTPVAWIKGVTGRSY-SK---------VNAFKMVTYL-----------------DE----------------PLYNISMRVAQKTQAAV------------------------------------------ BRAFLDRAFT_71623_Branchiostoma_floridae_260823234 HTQGLSTPALM------EGQ---TAGTE-VPTVSELLR-----------------------------AQLI-QPGKDVLSC----KGKA--------GLQFASLQP-DGAVMTQG-----------G--LSFSTVAQWHRAIWGHRT-GQ-------KRAMVFRQVCYK-----------------GT----------------PLADVSSKFTPAAKTITP----------------------------------------- LOTGIDRAFT_237598_Lottia_gigantea_676429688 -------------------T---KKYKK-FPSLKRMMD-----------------------------RGLL-KPGKNKLSI----IRKG--------ERVTATLLN-SGMILDAT-----------G--PGFSTATKWFSAVTGTQL-TT--------KAKAYRMVCYE-----------------NT----------------PLMEFKKQYDKLGNVNLV----------------------------------------- LOC105318954_Crassostrea_gigas_762153126 NMDIGCEQSQY-------EK---YGGLK-FPPLKSLID-----------------------------LHVL-YPAENVLAA----LYMG--------KVFTASLTM-MGNIEGKG-----------G--EVFNTPMKWLSAVKGGEV-VK--------KAQAYREIKYD-----------------GH----------------SLKSYVDGEQQTSIEKINV---L------------------------------------ Tsp_04073_Trichinella_spiralis_339246505 FSQRRGRRKSI-----RGKS---KAQLY-VRNIRQLIK-----------------------------EGIL-EAGVNILVY---PQDQD--------NVHYASLLP-NGRVQANVRL---------G--PLFSSIQRWVGFCLGHQNTSR-------TPLFELLRVRYR-----------------NV----------------SLFKINIILGVGLSRDEVV---E------------------------------------ HELRODRAFT_165788_Helobdella_robusta_675890068 KPKMAPNYKIV--------K---QQEIK-FRSIEQLIL-----------------------------LKVI-EPGANVLSI----QREK--------VNHLASILP-DGLILEDKS----------G--RVHNSLISWYRFITNSKQ-SK-------IAMNQLMEVKYD-----------------GK----------------PLALRLKEANYRRSEITNI---------------------------------------- _Haemophilus_influenzae_127456 KPNPND-IELL------SLE--IKPPK---VPMKTLIE-----------------------------ADFL-RVGQTLFD-------KN--------ENAICIVTQ-DGNVKDN------------E---ETLSIHKMSAKYLNK------------TNNNGWDYFYLFR----------------NNN-------------FITLDSLRYEYTNQ----------------------------------------------- MOMA_RS00825_Moraxella_macacae_497185310 VVNPND-IEWL------SLE--TKPPK---VAMKTLVA-----------------------------ANYL-NIGQALFD-------KN--------QNRICTVLA-DGKVTDS------------V---DTLSIHKMSAKYLNK------------TNHNGWDYFYVIK----------------DNK-------------LITLDSLRYDYASKMGK-------------------------------------------- AAR27819.1_Staphylococcus_sp_L1_38906136 NQVVID-DDYV----NAVFD--KKLIR---VPFKKLVE-----------------------------EGFI-DKNEYIYF-------NN--------TEEYAVISD-DKELLYN------------G----KHSIHSLAGILKGL------------ERANGWNYWYVKR----------------NNK-------------IYFYRSLS----------------------------------------------------- HMPREF1766_RS00325_Fusobacterium_nucleatum_696308956 YQKNMI-TELL-------LE--VKPPK---VPLKKLVE-----------------------------KGYL-KENQVLYN-------SL--------GEAKVTVLS-NGDVFDG------------N---EKLSIHKMSAKILNK------------TNNNGWDYFYVMN---------------NGK--------------LIPLNDLRYQYDKEVNNEK------------------------------------------ CCAN_RS08180_Capnocytophaga_canimorsus_503763750 NVVPEI-DLFS----QLELE--VKPPR---ISMKELIT-----------------------------KGFL-KIGQQLFS-------KD--------KKYSVTICQ-NGNVSDG------------E---EMLSIHKMSAKLLKR------------TNNNGWDYFWTDY---------------KGE--------------FISIDSLRYLANKQEKI-------------------------------------------- DV59_RS09130_Helicobacter_pylori_446268888 NTRDKS-DFIT----NLELE--TKPPK---IPMSLLIS-----------------------------KQLL-KIGDFLYS-------PN--------KEKICQVLE-NGQVRDN------------E--NYETSIHKMSAKYLNK------------TNHNGWKFFYAYY----------------QNQ-------------FLLLDELRYICQRDS---------------------------------------------- F811_RS0110085_Brachyspira_innocens_518849236 NTYVEM-TDLI----NLDYE--VKPPK---VPIKNLIE-----------------------------KGYL-KANQALYS-------KK--------GDEVCKLNG-NGNV-EN------------E--LGNFSIHQMSAKLQNL------------SKYNGWNYFYTYY----------------KDK-------------FISIDELRYIYIGDNHE-------------------------------------------- G500_RS22855_Flexibacter_roseolus_737789152 QVKPLS-KNVL----EYKID--RKKPR---IPFGNLVE-----------------------------KGYV-SIGETLYS-------KD--------KKLTAVVQA-NASIIAN------------G--TAVGSIHKVSSVLLNK------------ATNNGWTFWYVMR----------------ENE-------------LISIDELR----------------------------------------------------- _Thermoplasma_acidophilum_499204035 NTASYQ-QKLL----DYPLE--IRPKR---VPFGSLIE-----------------------------NGYV-KAGEYLYS-------PD--------GEARALVLA-NGTLSYE------------D---KYGSIHKISAMILNK------------PANNGWAFWYVKR---------------DGK--------------LVSINDLRQKLLKDQYANH------------------------------------------ ANT_RS09810_Anaerolinea_thermophila_503325698 QIEPYP-QQAL----ALPVR--SRKSR---LPFGRLVE-----------------------------QNLV-QPGQILFF------DRN--------PEIRAVVLS-DGHLSVN------------G---WKGSIHMTAEKICG-------------HPTNGWERWFFLD--------------EQGI--------------FQPISILRQKYLSNVSIEN------------------------------------------ HAUR_RS15710_Herpetosiphon_aurantiacus_501142320 -ESPSS-TDAL---QALPSN-KRRIPR---IPFGNLLE-----------------------------HGLL-QAGQQLWF------NRD--------PNLVATLLA-DASLRMS------------DG--TRGSIHKLGTILTGQ------------PSCNGWEHWFFQA--------------SDGT--------------LTSIDVLRQEVRRLREQTP------------------------------------------ RCAS_RS10325_Roseiflexus_castenholzii_501069455 -TPVSVCDDAM----LATRS-KRDMPR---VGFGQLVE-----------------------------AQYL-RVGQNLYS-------SD--------RNVVAIVRA-DSQLQWG------------N---ITSSIHRIAALAQHK------------PAFNGWEYWHYED--------------QAGR--------------LVSIDSLREQYRFDQGVAD------------------------------------------ CCALI_RS04915_Chthonomonas_calidirosea_512724354 -TPFCGVDERE---LLITPS-KRAAPR---VAFGQLVE-----------------------------AGYL-KVGTVLYS-------RD--------RRIVAYVKA-DSLLRWD------------S---KEGSIHQIAALAEGK------------PACNGWEYWYYED--------------EDGQ--------------LISIDVLRARYRAENGLE------------------------------------------- OSCT_3182_Oscillochloris_trichoides_DG-6_308225152 ---AIS-ATCLEHGELLTRS-KRNAPR---ISFGQLLE-----------------------------AQYI-SVGQPIFS-------QD--------RAVTAIVKA-DAQLICN------------D---QTGSIHKIAASVQNR------------AAANGWEYWYYED--------------AAGN--------------LVSIDELRERYRHENHVN------------------------------------------- K355_RS0107980_Thalassospira_lucentensis_550983501 KVQMLD-GDSL----EVTES-KRSLPR---IPFGAVIE-----------------------------RGLL-SPGEKIYD--------NR-------GNVAAMVRA-DGSISHK------------D---NAGSIHQIGARVQGA------------EACNGWTYWHYKC---------------DGR--------------LVSIDNLRSQLRKEMGQVP------------------------------------------ MGMSR_RS10205_Magnetospirillum_gryphiswaldense_568205957 KVRPVS-ELSL----LSTPS-KKSEPR---VPFGTVVE-----------------------------RGLL-EVGTVLYG-------NGK-------DSLTAKVRA-DGTLISA------------D---HRGSIHKVGALVQNA------------PACNGWTFWHLKQ----------------GDE-------------LVPIDVLRQKIRAELH--------------------------------------------- L902_RS0138340_Agrobacterium_tumefaciens_665867244 AVEPLG-KAEL----TVMTG-KKAEPR---VAFNTLVE-----------------------------SGLV-RPGQVLTD--------AK-------RRYSAIIRA-DGTIASG------------G---TAGSIHRLGAKVQGL------------DACNGWTFWHFED----------------GDA-------------LKPIDDLRTIIRSELAKAE------------------------------------------ EX02_RS05975_Rhizobium_leguminosarum_739196468 AVEPLG-KAEL----TVMTG-KKQEVR---VAFNVLVE-----------------------------SGLI-KPGQVLTD--------AR-------RRHSAIVRA-DGTVASG------------G---EAGSIHRLGAKVQGL------------DACNGWTFWHFDD----------------GKS-------------LRPIDDLRSVIRSDLAKAE------------------------------------------ _Brucella_melitensis_bv_1_str_16M_81851486 AVEPLG-KAEL----TVMTG-KRAEPR---VAFTSVME-----------------------------AGLL-RPGTVLCD--------ER-------RRFAAIVRA-DGTLTAN------------G---EAGSIHRIGARVQGF------------DACNGWTFWHFEE---------------NGV--------------LKPIDALRKIIREQMAAAG------------------------------------------ BIND_RS03830_Beijerinckia_indica_501352122 AIDPLP-SEAI----ASFPN-KRTEPR---IPFMTLIE-----------------------------SGLL-AAGETLTD--------EK-------GRHEAVVRA-DGTLAVG------------P---IIGSIHKIGALVQGL------------PACNGWTFWHFQR---------------DGQ--------------KHPLDRLRIQLRESGKEPV------------------------------------------ BJ6T_73150_Bradyrhizobium_japonicum_USDA_6_354959883 AVEPLP-EESL----APFMT-AREAPR---VAFSELIE-----------------------------RGMI-MPGTKLFD--------AK-------KKLGALVRA-DGAIMLG------------D---KVGSIHRIGAVAQGA------------QACNGWTFWHVET--------------KKG---------------LKLIDELRAEIRAGMGAE------------------------------------------- QH73_RS47605_Scytonema_millei_748143394 SISPAT-REVL----AVTQS-KRAEPR---IPFGNLVE-----------------------------RGLV-KPGDTLYC--------PR-------GERTARVRA-DGTLISG------------R---STGSIHKVGAEIQKA------------PSCNGWTFWHVRV---------------RGG--------------FQPLDTLRAQIRETMAV-------------------------------------------- YY1_RS0113660_Mastigocoleus_testarum_654345013 AIEPIE-GAVL----ETEKS-KKSLPR---VPFGALLE-----------------------------SGWL-KPGDRLFS--------PQ-------RRYQARIRV-DGSLTTG------------S---VSGSIHRLGAHVQQA------------PACNGWTYWHYED-------------PKRN---------------LAPIDLLRRRYREEMGLN------------------------------------------- HPO_RS06710_Hyphomonas_polymorpha_737625856 AITPLE-GEVL----ETERS-KKSLAR---VPFGALIE-----------------------------TGWL-KPGDRLFS--------PQ-------RRHQARIRV-DGSLTTG------------A---ITGSIHRLGAQVQQA------------PACNGWTYWHYET-------------EKRD---------------LAPIDLLRRRYREEMGLA------------------------------------------- HMPREF1019_RS05155_Campylobacter_sp_10_1_50_496651971 DIVFED-SDIA----HAKFD-KK-PLK---VNLDQMID-----------------------------ANFL-NLGERFYL--------KN-------SDEFAILKR-GSRLEYN------------N---ILYDIHSLAAKLKSAKS----------ERLNGFKFWHVMR---------------DNK--------------KILLDDIRSHFREINA--------------------------------------------- EMIHUDRAFT_114853_Emiliania_huxleyi_CCMP1516_551589812 RQPSLARASVA------------RASVD-SLTSLGAMMA----------------------------HRLI-EAAPAALS---VEAASG--------EWLLADLCD-DGTILHLPADG--------GVPQRFRSPGGFVRFLRPWLSAAS-------VEGGSWTAVHYK-----------------GL----------------PLDLIRKAATSDADALGAV---A------------------------------------ SFUL_6650_Streptomyces_fulvissimus_DSM_40593_485098347 -EPQTATADGH------------RVDL---R-TLVAALP----------------------------PAAF-SAGGIALT------GRGKR------GPAKATLLE-DGRIMCF------------R--QPMNTPTSAARMAAGD------------DTVDGWAFWSLT---------------VDGKAR--------------TLADLRDTLIAQRG--------------------------------------------- G407_RS0122780_Salinarimonas_rosea_655991010 DAPAEAVGAAP-----------RRKRR---LMRTTEMIE----------------------------RGLL-RAGMRLTI-------KGR-------PGSEAVVV--DGRHVEF-------------------GGERMSFNDWG-------------CRVTGWTAIQIYEWAEM----------PDGR----------------LLSALREA--------------------------------------------------- Haur_5215_Herpetosiphon_aurantiacus_DSM_785_159894763 HAVVGEATEQP-----------S-EGR---KPRFNDMVQ----------------------------AKKV-VPGDQLYT-----KKY---------PQRRATVV--DGETVEY-------------------DGVRYPINVWG-------------EKVTGWSSINIYDSVILE---------RTGK----------------PLRSLREEG-------------------------------------------------- M657_RS0116450_Bacillus_megaterium_651957888 SSIKRDNSNRI-----------SKKRY---LPRMKELFE----------------------------WGIL-SPNQKVSI-------KNQ-------DNSDATVVD-EKTVSFN--------------------ENIMSYNQWG-------------KVVTGWSAMSVYEWIIPE---------GQTK----------------TLHELRLERLDNRQWD------------------------------------------- _Methanococcus_maripaludis_500693945 ----KYVSKKT-----------KDITRR-SLPKISDMLE----------------------------WGVV-KPGDVIKA------KD---------HNAEAILLK-NGNVSVIPTEMSIGSADVRKIPMQTAVATEMSMQTWL-------------KSVYGWSSIQTYNFAVLK---------ETGE----------------TLSKIREKYMEKLSSENT----------------------------------------- HMPREF1040_RS01335_Megasphaera_sp_UPII_135-E_494634130 HSNLHLERTIK-----------KI-GK---VEADGSQTS----------------------------EGFVVFKGSHISL-----ADDNTIPA-VIKERRKNALIDEQGVLQED---------------MLFTSPSYAAMFVIG-------------KSANGLTSWKTAD----------------GK----------------TLKSLESSHDTEQTDET------------------------------------------ PTSG_07559_Salpingoeca_rosetta_514687557 EEEAQPSYERE-----------TE-TF---CSAVRELHA----------------------------NGML-QTGDVCEF-----K------------GVVAELDG-DGDLVLA----------D-G--QVFPTPGMMACIMQWSG-----------FSSNGWSCVKFYSSSPQLMDDGPTSSTKRGKRARKSSAAAKPAYSMLTIAQLRKQLLKKEQERRQL---KREE--------------------------------- JONANDRAFT_RS02920_Jonquetella_anthropi_495798143 WNKVPDGQYFI-----SGSK--KNFGK---IKATMHVEN----------------------------GTFIVERGSVCVP-----FADG---------KVQPGFIG-NARIEE-------NILQE-D--VPCSSPSAAGALVLG-------------RSTNGWTAWKDSS----------------GR----------------LIDVYRMPEPDEKEVAQKG---H------------------------------------ MBVG_RS02290_Mycoplasma_bovigenitalium_490555808 SKCIPNGEYFL-----ERNV--KGFGR---VEGRARVHD----------------------------GVFTLLKGSYCAD-----YND----------KYPSQLRK-NAKFKN-------NFLQE-D--IICKSPSAASLIVVG-------------KSTNGWDWWKNID----------------GE----------------SIDIYRK-----KEISD------------------------------------------ B437_00700_Fusobacterium_hwasookii_ChDC_F128_402258132 --DMEKITFVL-------------KGR---VTSGTGRLLSN--------------------------EKFEILKGTSIVL----EVKSENPTTFKRNKNLIDDLLR-KNLIEKSG---DKYIFKE-N--YIATSPSAAAILVLG-------------RSANGWSEWKTYE----------------GK----------------LLSEYRR---------------------------------------------------- CLO_RS14710_Clostridium_botulinum_489467259 --EIMDIDFYC-----------QG-SR---GKGAGKLKK----------------------------GKFIVLKGSRASK----FFYDSV---KTSNTKLVNKLMN-EGKLREEN---EYYILIE-E--CIFTSPSAAAKFILG-------------RSANGWSEWKTYE----------------GD----------------TLDNFRDKEE------------------------------------------------- IF25_RS0123745_Streptomyces_sp_NRRL_B-5680_739980329 --RLPSGGPVA------------R-GR--LLPEKGSNGS----------------------------QKFLVYAGAPARG----KVVPSYSERRASSSRLRTQLID-EGRLRPSERWPGHLEVAE-D--VEFGSPSAAAEVLLG-------------RSANGWTRWRTKD----------------DR----------------PLSEFMPGVWAGPNRAWLV---R------------------------------------ NTHER_RS04135_Natranaerobius_thermophilus_501422597 -------VFVC------------K-GK--EAYAEGDYID----------------------------EGFVVYKGSKANL----TETQSAGEW---LINLRKQLIE-SGVLVKRD---EIYEFSS-N--YIFSSPSAAAATVLA-------------RRANGWTEWKNKD----------------GK----------------TLDDVKRKSD------------------------------------------------- N690_RS02735_Brevibacillus_thermoruber_737312527 -------LLYC------------K-GK--DAKAVGEYTD----------------------------EGLVVLKGSTANL----TESPSSSNT---LKALRKKLID-EGVLVQEG---NVYVFTK-D--YIFGSPSSAADVVLA-------------RSANGWTEWKRED----------------GK----------------TLYQLKRE--------------------------------------------------- H630_RS36845_Salinibacillus_aidingensis_763304606 ----NDNLFYC------------K-GR--GIQAVAEYTE----------------------------EGMVVLKNSEMAR----DTTESFKEYMTSSGMRRDTLIK-DGIVKLTG---DVYTFQE-D--YIFTSPSAAAKTVLG-------------RAANGWTKWKTKD----------------GK----------------TLDQIKRKGK------------------------------------------------- X941_RS08425_Burkholderia_pseudomallei_759571980 --VAKEEVFYC------------KTSN---YDAIAQYTT----------------------------EGMVVMKGSKARV----EMVPSAGES---QQKRRQQLIA-EGVLKLED---GFYVFQR-D--VLFKSPSGASDAITG-------------ASTNGWQLWKTKE----------------GK----------------TLDELKRQPASS----------------------------------------------- MA05_RS01915_Comamonas_aquatica_772620269 NESVEADTFYL------------RSTK---YDAVGEYTA----------------------------EGMVVLKGSKARI----DIATSMAQT--PLVPKRQALIE-DGALKLEG---DFYVFQR-D--VLFKSPSGAAAMVRG-------------ASSNGWVEWVSET----------------GK----------------TLDELKRQAPVNKSVS------------------------------------------- K225_RS0108435_Acidovorax_sp_JHL-9_736707052 GEPQPTELFYC------------K-GPD--ASGVGEYTP----------------------------EGFVVHKGSTARI----GNVASIQGT--SQERFREQLVT-DGVLKLQG---TQYVFTR-D--YLFSSPSMAAIAVLG-------------RSANGWMEWKTEQ----------------GQ----------------TLDGAKRQVMNAVN--------------------------------------------- LI82_07720_Methanococcoides_methylutens_695943706 --------YFC------------KSKN---ANAIGEYTE----------------------------EGFVVNKGSKSNM----EETTSLQPS---IRAFRANLLE-KGIVKEED---GVYVYQE-D--FTFSSPSMSASVVLG-------------RAANGWMLWKDKD----------------GK----------------TLDELVRQKGISNG--------------------------------------------- MPSY_RS00900_Methanolobus_psychrophilus_504865027 --------YFC------------KSKD---ADAVGEYTE----------------------------EGFIVNKGSKSNV----KETPSIQQS---IKTFRANLVD-KGILKEEN---GVYVFQE-D--FTFSSPSMAASVVLA-------------RTANGWAEWRVQD----------------GGNIK-------------TLDDVIRKKEKED---------------------------------------------- DSM3645_RS18720_Blastopirellula_marina_750016631 --STNTESFFL------------KTNE---CDAEGNFVE----------------------------DGFVVRAGAIARK----EITPSGIDL---IEPVRTLLIE-SGVLIDQG---ANLRFTQ-D--YLFNSPSRAAIVVLG-------------RRANGWTEWKDAL----------------GR----------------SLDEVYRADGDA----------------------------------------------- RS9917_RS10690_Synechococcus_sp_RS9917_494162448 ---AEVLSLRS-----------PSNG----VEAKGLYTA----------------------------EGFVVLAGSVGRG----DTAPSLGET---NERWRQRLLD-GGVMQPDDR--GRLVFPK-D--HLFKSPSGAAIALLG-------------RTANGWREWKSPQ----------------GP----------------TLHQLIREGCTDDQHP------------------------------------------- G442_RS0116315_Acetobacter_nitrogenifigens_651265592 SSDEEGVTVYC-----------LAPG----VEGQARYTE----------------------------EGLVVLSGSYGRS----EVSDSFSRH--NYYRKRQNLID-QGALRIDG---SRIVYTR-D--TLFKSPSPGAVYLLG-------------RSANGWFEWKDRT----------------GR----------------SLADVMERKQ------------------------------------------------- Thi970DRAFT_03846_Thiorhodovibrio_sp_970_380878131 --------LVC-----------RIKG----ARALGRPTP----------------------------DGFVVFKDSTAVL----HERPATPKRQPYVVALRKRLVD-EGILVEQD---GYLQFLH-D--AEFSSPSAAASVIHG-------------GGANGLTEWRTEL----------------VS------------------DNGANG--------------------------------------------------- HMPREF9303_0585_Prevotella_denticola_CRIS_18C-A_325482299 --PKEEHLFFT-------------KGR--GCDAKGFYHS----------------------------KGFTVLKGSTIVS----SSSPSFKWK-----NKREKMLS-EYTHSS-K---GKLELSS-D--TTFNSPSTAADFCIG-------------SSNNGWLVWKDKD----------------GN----------------TLDSVYRKQLE------------------------------------------------ M091_1691_Parabacteroides_distasonis_str_3776_D15_i_649528608 --LKEEHLFYT-------------KGR--GCNAKGFYNS----------------------------SGFTVLKGSIIVN----SSVPSFNWK-----EKREKLIN-EYTVTK-N---GLLVMES-D--KTFSSPSTAADFCIG-------------SSNNGWLVWKDKN----------------GQ----------------TLDSVYRKQLE------------------------------------------------ JCM15984_RS09020_Porphyromonas_macacae_640573949 --PKKEHLFYT-------------KGR--GCEANGFYSS----------------------------SGFTVLKGSIIAE----TPTPSFHWK-----EKRDRLIK-EYTSKK-D---GCFVVTS-D--ITFSSPSTAAMFCLG-------------RSANGWDEWKDEN----------------WK----------------TLDAIYRKQLE------------------------------------------------ M068_1150_Bacteroides_fragilis_str_J38-1_596132180 --PKDKPIFQI-------------VSKK--CDAKGFYDT----------------------------SGFTVLKGSRISD----KSTDSLSWR-----DKRTKLIE-EYANNK-N-----FVINE-D--ITFSSPSKAADFCLG-------------SSSNGWIMWKTER----------------GQ----------------TLDSVYRKQLE------------------------------------------------ EH55_RS06500_Synergistes_jonesii_740127358 -NTKNAQILHT--------------TRN-GITALGVYSG----------------------------DKFDVLEGSEINM----DKPVHLPKY----NKQRQELLD-DGHIISEN---GKSILKI-T--LTFNTPSGASNFVLG-------------GSTNGWAEWKNSD----------------GK----------------TLDELFRKS-------------------------------------------------- HGEM_RS06435_Enorma_massiliensis_517958401 -ALADVKLFYT--------------SRR-GVRARGVYTG----------------------------DTFDVLEGSPVDL----KVKPKLDRY----EKLRQELLA-SGDLVQDG---DGGRLVK-T--VSFSTPSGAADFVLG-------------GSNNGWIEWKDGD----------------KQ----------------TLDALYRK--------------------------------------------------- CCUR_RS01535_Cryptobacterium_curtum_502483234 --PSVGSATFH--------------TKKLGVKAIGRYDKET--------------------------GKFIVFAGSQIAL-------DKSIIK-----NRIAITVR-AEQFGETT---ERTTLIN-D--VVFPSPSAAAVFVLG-------------GSQNGWTEWVDDN----------------GN----------------TLSDIYRTEEN------------------------------------------------ l13_07900_Neisseria_weaveri_ATCC_51223_343968291 TWTGRTVTVFC-------------RSRDKILRGKGLFNVET--------------------------KQILLLEGTMIYRKISETLPPGWK-------EVYQQWLA-SDFLADSKDP-DFYVLQS-E--QLCASPSMAAALVLG-------------NNRNGWQYWRTER----------------GL----------------TLDETYRKK-------------------------------------------------- T491_RS0113070_Prevotella_sp_P6B4_655526129 ---SHLIKCLL--------------TR--NASAQGLFNPAD--------------------------QSLTVLSGSKINPVHLNKISPAGR-------KKRDILFA-KYTELRN----GERIVKE-D--ICFDSPSGAAQFCVG-------------GSSNGWSQWKDEN----------------GK----------------ELDSYRSNEVAKVPAPD------------------------------------------ T495_RS0106435_Prevotella_sp_P6B1_697058363 ---SHLIKCLL--------------TR--NASAQGLFNPAD--------------------------QSLTVLSGSKINPVHLNKISPAGR-------KKRDILFA-KYTELRN----GERIVKE-D--ICFDSPSGAAQFCVG-------------GSSNGWSQWKDEN----------------GK----------------ELDSYRSNEVAKVPAPD------------------------------------------ BCH11DRAFT_RS20235_Burkholderia_sp_Ch1-1_494322155 -------------------------EP--A----WVW------------------------------KGVSFPAGTKLRA-----TFKG---------KEYMGEVH-NGAFLLD------------G--VEYTSPSAAAQSVTQ-------------SPVNGWTFWQCLR----------------PG----------------DT-QWIGINTLRR---------------------------------------------- Aam_125_009_Acidocella_aminolytica_101_=_DSM_11237_775294809 -------------------------EP--AAGMGWVS------------------------------KGVTFPDGTEFRA-----TYKG----------QHVTARVARGRLRGA------------GD-KVATSLSQAARMVTQ-------------TSVDGWTFWEVKR----------------PN----------------DL-QWQQAGTLRKKS-------------------------------------------- BI00_RS24945_Rhodococcus_fascians_739317412 IYVPANHHQRVN---AGNSS---PRQSY-DPDILALIE-----------------------------RGYM-MPGDVLVFY----QKKAQ-------RNYQARVRR-DGTITVG------------N--ETFTAVSTALGFCLG-------------YSVNGWQSWRLQR---------------TGE----------------LIHDLRLRVLGETH--------------------------------------------- G418_29712_Rhodococcus_qingshengii_BKS_20-40_452756494 --EQPASNAVV--------D---PEVTY-DDDILQLML-----------------------------FGYL-QVDQQLSY---HDIGRG--------LNFTATVTR-DGALEVN------------G--KRYGSPSAPLTELMG-------------QQRHGWRDWQLA----------------DGR----------------QLSQLRREGRTEMAW-------------------------------------------- SCP1.291c_Streptomyces_coelicolor_499350070 ---TLSLSTGA----QTSSS---SPVTP-HGPLAELMQ-----------------------------ADLI-KAGTVLTF---HQRRAK--------RSGRAVVTA-DGQLIVD------------GHASPFPSPSKAAEAVTG-------------NVINGWTLWHVEG---------------VGR----------------TLDDLRRELDSRTSR-------------------------------------------- TR51_RS33030_Kitasatospora_griseola_763031682 ---TLSLGEGA----ASAQA---APVTP-QGPLAGLMR-----------------------------ARLL-EPGAVLTF---RQRRAN--------RSGRAVVTA-DGQLVVD------------GHSSPFPSPSKAAEAVTG-------------NIINGWTLWRTS----------------DGS----------------TLDQLRQKLDAE----------------------------------------------- INTCA_RS15730_Intrasporangium_calvum_503259250 ADEDEPTPWAA------------TRTQI-PGTVADLLA-----------------------------EGLL-HAGTELRCI-----RGG--------RQGQGAIGS-DGQIIVD------------GV-G-YSTPSLAAGVSLGATNS---------TGYGGWEMWHVGS--------------LTGP----------------TLADLRAQLPKRANR-------------------------------------------- AMYBE_RS0132875_Amycolatopsis_benzoatilytica_522152436 -------GESA-----------GSAPRRAPDALAALIE-----------------------------AGLI-EVGEQLVW--------G---------GHTATVRA-GGVLHDGG-----------GHEFAVATVTSLATHLAGY-------------TANGWHLWHRAR---------------DHR----------------PLSALRTELGTPQ---------------------------------------------- OQ02_RS24335_Saccharothrix_sp_NRRL_B-16314_703488257 LHVEATGIAPLPTGPTEGRS--LGGFGG-NGALADLLA-----------------------------AGLL-YEGEEFIW---DLPGRG--------ARHTARIRS-DGTLVLAD-----------G--RAYANPSGALTALAGS------------FHGNGWVQWKRTS---------------DGR----------------SLAELRAELRTRRGLTIG----------------------------------------- A37G_RS0107850_Dehalobacter_sp_FTH1_736354164 ---------EA-----PPSP---KKPL-APGKLLPLVE-----------------------------AGLI-EEGDVLVH---ERPRKG--------DRFEATVTE-SGWLNAC------------G--VLYQYPSAALGNLVR-------------SQINGWLNWTHQP---------------SGK----------------TLRELELELEGVNKGGSN----------------------------------------- BLA_0918_Bifidobacterium_animalis_subsp_lactis_AD011_219621053 QGQSDRTGTSA------HRS--RRTSK--RVTVGEVVR-----------------------------AGLL-TVGDRFVW---NRPRKH--------EIWRITVTE-SGFRGED------------G--TEYATPTSAARAIGG-------------SSA-SLNVWKRES---------------DGR----------------ALSDIWKTYRTSM---------------------------------------------- _Bradyrhizobium_japonicum_742488091 -VPPDHRSGFA------RAR--RRPGR--NVDLADLIN-----------------------------AGLL-QPGMSLVP---KRKK-F--------SHRVATLLA-DGRVEVD------------G--EAFANAREAATAIYG-------------KKTGGWWFFLTDP--------------ASGR----------------TLRAVRRDYIEAMAVD------------------------------------------- OP04_RS15495_Catenuloplanes_japonicus_703061045 PGRSTRTTYLI-------------DGR--RVVIRDLID-----------------------------AGLL-VPNTELSF---NRPRMR--------ESHRATVTE-TGRIRLDS-----------G--EEFQSPSRAAVAAANT------------RVLDGWRAWVVE---------------PERR----------------SLDALRQEFLDKAVADTP----------------------------------------- L942_RS08340_Amycolatopsis_orientalis_739497435 -------MHLI-------------GGR--RVTVSDLID-----------------------------AGLL-QAGERLRF---ARNRIG--------VAYDATVTA-EGRIRLASD----------G--EEFRSPSRAAMVAAGM------------RAVDGWRAWQVV---------------EQDR----------------LLDGLRQDFLDQALTGT------------------------------------------ AMIS_RS01730_Actinoplanes_missouriensis_754222116 ----------M-------------NGR--RVRILDLIK-----------------------------AGLL-NPGQELVF---ERPRIG--------EIHRAVVTD-NGRIRVAD-----------G--QEFASPSRAADDVSG-------------TGTDGWYAWRVG---------------DDGP----------------LLDQLRQELLKSAASQT------------------------------------------ SBI_RS18605_Streptomyces_bingchenggensis_759777256 ---MGRESYLL-------------EGR--RVTVGDLLE-----------------------------AGYL-NAGAKLTF---ERPRRG--------EKHHAELAA-NGKVQLSD-----------G--QLFRSPSRAAIAAVGG------------GSFDGWHAWTLD----------------DGR----------------TLDQLRQQLLDEAAQQA------------------------------------------ VVMO6_RS10310_Vibrio_vulnificus_763144244 EDIDNLVRPLL--------------DS---LTV----------------------------------KGVTFPPGTQFRA-----SYKG--------SLHHGVVE--SGNIVVN------------G--VRCKSPSKAAEAITG-------------NSVNGWKFWECKL----------------PS----------------CN-HWVAISSLRES--------------------------------------------- SACT1_4469_Streptomyces_griseus_XylebKG-1_326658949 SLNRYAERPRY-----------LTAGR--RVTMADLLD-----------------------------AGLI-TEGTQLTF---ER--AG--------ARYSARVTA-AGRLELVG-----------G--QQFPSPNRAAAAAVGE------------GTVDGWQSWALE----------------DGT----------------TLDRLRQRFLGTSTVASS----------------------------------------- C569_RS0103420_Micromonospora_sp_CNB394_648575907 -MSNGEPRPRRTH---------LLDGR--RVRISDLMD-----------------------------ANLL-KAGDDLYF---QQRIGD--------PPHQATVTE-RGRLRLQD-----------G--REFSTPSKAGAVVARR------------RAVAGWSAWQVG---------------IGGR----------------TLHQLRLQLLRDVAAEVTANEEVPR---AETA--------------------------- KUTG_05615_Kutzneria_sp_744_585093370 --LTGSRSPKA--PSARTKV--SAAER--ALTVADLID-----------------------------AGLL-STRRPLTA-----EWRR--------EQRQAELLS-DGSLRFN------------G--QNYKSLSSAGEAVKMDIAGLDLKEST--RATDGWEFWTAP-------------DPTSGEPE--------------RLKELRRRLADQS---------------------------------------------- Isova_2668_Isoptericola_variabilis_225_334108481 YGSYGSYDTGYPTYPHVP----TDDEE---DEDLVAL------------------------------AGRLGRPTALVWS----RPRRR--------QHFEGTLHP-DGTIELAD-----------G--RRYRNPDAAASAAAGTP-----------T-SDGWGVWRLG---------------AAGP----------------TLLEAYRQHFA------------------------------------------------ XCEL_RS15100_Xylanimonas_cellulosilytica_502643352 EPQPQQQAPMV-----------LDEDA---DADLAAL------------------------------AARFGVPTALVWE----RPRRG--------QRYDATLHP-DGTLELYG-----------G--GRYRHPDVAASAASGSY-----------T-ADGWTVWRVA---------------ATGE----------------TLAEAFRARFA------------------------------------------------ H291_RS0125805_Promicromonospora_sukumoe_649278169 GESIRASTPML-----------FEEED---DPDLEAL------------------------------ARSIGTPTRIVWS----RPRRN--------QHFEAMLLP-DGAIELAN-----------G--ARYRHPDSAATAASGSY-----------T-ADGWSVWRLG---------------DTGP----------------TLVEEFSRRFA------------------------------------------------ CELF_RS02725_Cellulomonas_fimi_503535640 DDDEPQDAPTF-----------LDAP----HPELATL------------------------------AKRRRAVTTLVWV----RERRG--------QRFEAMLRP-DGYVELED-----------G--SVHADPDVAAAAVIGAE-----------SSVDGWRAWRLG---------------DGGP----------------TLAEATGVDRA------------------------------------------------ N866_19315_Actinotalea_ferrariae_CF5-4_601042837 TGPVPVNHPIAVVPGASAGSPATGEHRQGPDPRLVEI------------------------------ASRLDGPGRLVWV----RRRRG--------ERYDAVLHP-DGVVETR------------G--RRFGDPDRAAAFASGG------------TVVDGWSVWRVG--------------HDDGP----------------SLGDLLRGSQLDGARR------------------------------------------- HMPREF1979_RS00220_Actinomyces_johnsonii_545335724 RGSADALDPVAEALGGDPS-KSLVVQE---TAELAAV------------------------------AATLGEPTQLIWQ----RLRRG--------IYHEAMLSV-EGVITLSD-----------G--RSFTDPTSAANAAQDV------------TDADGWRVWRVG---------------VRGA----------------HLGDLRDDLADRSS--------------------------------------------- W5W_RS0104910_Actinomyces_massiliensis_515761987 DRNHRAAGPVL--GAQPTVP--SGGKS---AAALASV------------------------------ASRINTPATLVWQ----RVRRG--------IHHEAVLNA-DGIITLSN-----------G--MRFRDPSAAANAAQHT------------QDIDGWRVWRIG---------------AQGP----------------ALRDFIDDQG------------------------------------------------- HMPREF0045_01501_Actinomyces_graevenitzii_C83_365257398 LSRRDSGQSTVSRSHP------SGEDS---RLALNAL------------------------------ASILSEPVQITWQ----SVTEG--------IFHTAQLRP-DGMIRVSD-----------G--TSFDEPGQAAHHCEPA------------KSVDGWDVWRFG---------------ADGP----------------SLYESLEELIAAAERSPRRPGRPVR---SRRQ--------------------G--R--- HMPREF0574_0913_Mobiluncus_curtisii_subsp_curtisii_ATCC_35241_304326663 GAGQSDEFDLL-----------YRDAT---GLGIIAQ------------------------------VTGEDTPLVALIDF----NGTP--------AEVTAILAE-RGVIILE------------G--REFHDPSDAAREL-G-------------QDVDGWEFWHLG--------------FSEGP----------------TLAEAQAEINAEIQRN---R--------------------------------------- AURANDRAFT_62586_Aureococcus_anophagefferens_676383955 -----EPGKGA------------RAAK---ALTLGEMVA----------------------------RGLV-APGAGVIS---LAR-RP---------DVVADLL--DGGAIRH------------GA-ATYASPTAFATAVIGK------------SVRKGLKAVTYN-----------------GA----------------NLDELRGAAVRGTAAAPAP---P------------------------------------ BGIM_RS0130265_Zavarzinella_formosa_521961820 --------VLA----EGDLS-REARQRAAVIADDADLRLTPPREGTGEPKAFGGQTIATSQSLPQTRDRRL-PPAGTVLT----RAYQG--------RTIRVTVAA-DGF-EFD------------G--EMFGSLSAVAKSVTG-------------SHCNGFAFFKL------------------GG----------------KS--------------------------------------------------------- _Nocardioides_sp_JGI_0001009-J09_655234456 ---------LA----QGDLS-ERAKQRAAEIANDADLRLMPPVVAPGATPRPVPQPAASKSH-----DPRL-PPVGTILN----RPYKG--------RAVQVRVLT-DGF-ECD------------G--KVYSSLSSLAKEITG-------------SHCNGFAFFKLT----------------KGG----------------KE--------------------------------------------------------- B038_RS0115505_Martelella_mediterranea_516723200 -----------------SLA---ASPI--AEGRSWVGKGKS--------------------------AGLMLPHGTDLQM-----VYNG--------QHFTGHVD--NGSLVLE------------G--QRFSSPSGAADELCRTRDGKK-------TSLNGKELIQVRL----------------PG----------------ES-EWQL---------------------------------------------------- RSMK_RS04070_Ralstonia_solanacearum_489363811 -------ELAY-----GALPAGLRRHL---VEAGARLSKIKTATGRG--------------------SQLVLMPGTTLIR-----EWDE--------REYRVTVTP-DGLFELN------------G--QVFKSLSAAARHITG-------------TQWNGPRFFGLRD----------------GK----------------GGTR------------------------------------------------------- JL55_RS07420_Pseudomonas_chloritidismutans_757691291 -------EQAY-----GPLPAGVRRYL---VERGAQFSKIQQA-GRG--------------------TECHLMPGTVLVR-----EWDE--------REYRVTVTA-DGLYDLN------------G--QRFKSLSAAARHITG-------------TQWSGPKFFGLKP----------------GK----------------GGKQ------------------------------------------------------- MMC1_RS13375_Magnetococcus_marinus_500033502 -------ELAW-----GGLSETTKAKL---EAQAAEEKTMDRTPNPIRN------------------DGLP-VAGTRLVR-----EWKG--------VEHSCTVLD-DGF-EYQ------------G--RKFKSLSAAARAVTG-------------TRWNGKIFW-LGG----------------KK--------------------------------------------------------------------------- HMPREF0731_0014_Roseomonas_cervicalis_ATCC_49957_296267968 -------ELAY-----GGLKPETVARL---EALGEQLDGGNVVLRRLRAG-----------------SDRP-VIGTRLIR-----EYQG--------VQHGVTVLA-DGF-EYE------------G--RPYRSLSAIARHITG-------------TRWNGWAFFGLKA----------------QR----------------GSA-------------------------------------------------------- B038_RS0116660_Martelella_mediterranea_516723468 --MLFK-EAQK-----IWLAPQRKDGF---VKTSEGLAGMTTEVR----------------------KGFP-KDGTKCDF-----TYGE--------TIYRGEIV--SGSIVLA------------GVAEQFGSFSAASKHITK-------------TSRNGWNDWYLDL----------------PN----------------GQ-RMLADHWRKSSEA------------------------------------------- IG10_RS0113630_Streptomyces_griseus_664177934 EQLIVE-GPTA-----PTLVG-AEIQE---LRTKRGVA-----------------------------IHAV-YEGQRVDA------------------YYDPLSRV-VRIPSGP------------GR-GEYETPSGAAVAVVHVLNPHVN------PNRNGWNFWTVTA---------------TGR----------------LLQSIR----------------------------------------------------- FJSC11DRAFT_3600_Fischerella_sp_JSC-11_353541191 SKQLHI-FESP-----TQEE-NAEHSQ---VRLSQLFD-----------------------------AGIT-KKGMSVRVKLKREVAKK----LQRDYINGLEISV-KGTIVYN------------G--EEFDKPSPLAAKING-------------GAINGWEYIEVKK----------------DDK-------------WVRLEELRKIWRKTNG--------------------------------------------- HMPREF1650_RS09705_Corynebacterium_freneyi_737135649 GVTSVPTVDSS-----AQLE--ERYDE---PTLAELVE-----------------------------KGLL-RPGALLDP-----VDPG--------WEVDAVIDD-DGTLVID------------GV-HQFDSLTEATHSLGV-------------TNMSGLSFWALET----------------SER-------------LVPLAELVASDTRIPR--------------------------------------------- UPA14_RS00020_Ureaplasma_parvum_493739841 ISAKIG-VVEN-----SFFD--QKPIR---VSMFEMIS-----------------------------DGYF-KLGEYFIN-------SN--------GEKAKLAKA-NGWLEYQ------------G---EINSMHEVAAKMIGRE-----------RRVNAFNYLFVER---------------DGE--------------IISINKIRENYRQHLIAKS------------------------------------------ T403_RS0103085_Mycoplasma_collis_697093027 TQKQIG-NVEN-----AVFD--IKPIK---VNFSSMVE-----------------------------KNFF-FLNEKFYH-------KN--------GQSAELVDP-KGKLKYK------------N---DISSMHEIAAKMMNRN-----------LKVNAYDYLFVKR----------------NEK-------------LISIAKIREDYRKILKAN------------------------------------------- TREVI0001_RS02010_Treponema_vincentii_493197799 TKEMIG-NIEK-----ATFD--IKPIK---VDFIDLIK-----------------------------NNFL-LPDEKFFL-------KN--------SDSFAILKS-DGKIELP------------S--NIVTDIHKGAAILGNKKA----------ARVNGFDFWYVER----------------NNK-------------RKSIKDIREDYRKIIAG-------------------------------------------- RB2501_15584_Robiginitalea_biformata_HTCC2501_88784593 KEILTG--SYD-----KNFN-FFLSLR--AEAIFKVIKQEILDKKEEIQKEFYSIPQVDST------KEIA-IFGS-YYK-------KR---------IEARFNTK-TGSVFYN------------G--KLYETPSSAANQAKIDCGAHSG------ITSSGWTFWKFIN----------------ENDNT-----------EEQIDVLRKISEAY----------------------------------------------- WH82_RS16745_Geobacillus_thermoglucosidasius_755032699 TEIIDG--AYD-----DFYI-AFLEKR--AEAIFKLVEKYIIAKEQKIVDLFYQPPK--TK------GNIK-IFAS-YYN-------KK---------VEAIFDIE-TQQVHYN------------G---EVLSVSAAADKAKYNLSGKDN------TSTNGWRFWKYIN----------------EQN-Q-----------ERYIDDFRK---------------------------------------------------- BN613_00479_Cryptobacterium_sp_CAG:338_548076194 NISERG-VRLA-----DKCV--ATWPI--PKQYQSLVEVSDVNSGRRSPFRFS--------------MVGL-SEGDVVTF------ADD--------SSKTAVITD-DSHVEYC------------G---EIFSLTALAMKLLKSS-----------HSVQGPFYFMYDG----------------E-----------------RLSELREQVESGMF--------------------------------------------- consensus/100% .....................................................................................................................................................................h................................................................................................. consensus/95% ..........................................................................s............................h.............................s...h...h...................uh..h................................................................................................. consensus/90% ....................................h.....................................s..h.........................l...pu.h......................s.p.hs..h..................pua..h....................s..................l..h...................................................... consensus/85% ....................................h................................hh...s..h.......................s.l...pu.l...............s......s.p.hu..h..................sGa..h....................s..................l..hb..................................................... consensus/80% ....................................h...............................shh...Gp.h.......................s.l...pG.l...............s....h.o.p.hu..h..................sGW..h....................s..................l..hb..................................................... consensus/75% ..........h...............p....sh..hhp..............................shl...Gp.h..........s.........p..s.l...sG.l...............s....h.o.p.hu..h.................ssGW..hp...................u..................lp.hb..................................................... consensus/70% ..........h..............sp...hsh..hhp..............................shl..sGp.h..........s.........p..s.l.s.sG.l...............s....a.oss.hu..h...............psssGWp.hph..................Gp.................Lsphb.p...................................................Back to Contents
--Eukaryotic versions---- GI Architectures Pfam-archs Gene name Len Taxonomy Species Genbank # 174; MPND-like with fusions to JAB 167524112 RAMA+JAB AvrE MONBRDRAFT_26083 1097 eukaryota>choanoflagellida Monosiga brevicollis MX1 hypothetical protein [Monosiga brevicollis MX1]. 391344637 RAMA+JAB Prok-JAB LOC100902139 519 eukaryota>metazoa Metaseiulus occidentalis PREDICTED: MPN domain-containing protein-like [Metaseiulus occidentalis]. 761911769 RAMA+JAB Prok-JAB LOC100641685 387 eukaryota>metazoa Amphimedon queenslandica PREDICTED: MPN domain-containing protein-like [Amphimedon queenslandica]. 241173425 RAMA+JAB JAB IscW_ISCW016991 405 eukaryota>metazoa Ixodes scapularis MPN domain-containing protein, putative, partial [Ixodes scapularis]. 443694149 RAMA+JAB JAB CAPTEDRAFT_110378 411 eukaryota>metazoa>annelida Capitella teleta hypothetical protein CAPTEDRAFT_110378 [Capitella teleta]. 198425307 RAMA+JAB FAP LOC100178477 470 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: MPN domain-containing protein-like [Ciona intestinalis]. 260806149 RAMA+JAB JAB BRAFLDRAFT_221394 432 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_221394, partial [Branchiostoma floridae]. 612011725 RAMA+JAB Prok-JAB MPND 568 eukaryota>metazoa>chordata>vertebrata Monodelphis domestica PREDICTED: MPN domain-containing protein, partial [Monodelphis domestica]. 444509502 RAMA+JAB Prok-JAB TREES_T100006078 567 eukaryota>metazoa>chordata>vertebrata Tupaia chinensis MPN domain-containing protein [Tupaia chinensis]. 586477103 RAMA+JAB Prok-JAB MPND 547 eukaryota>metazoa>chordata>vertebrata Chrysochloris asiatica PREDICTED: MPN domain-containing protein [Chrysochloris asiatica]. 635035272 RAMA+JAB GRP MPND 530 eukaryota>metazoa>chordata>vertebrata Chlorocebus sabaeus PREDICTED: MPN domain-containing protein [Chlorocebus sabaeus]. 741880389 RAMA+JAB Prok-JAB MPND 529 eukaryota>metazoa>chordata>vertebrata Bos taurus PREDICTED: MPN domain-containing protein isoform X1 [Bos taurus]. 513008759 RAMA+JAB GRP Mpnd 523 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber PREDICTED: MPN domain-containing protein isoform X1 [Heterocephalus glaber]. 729729621 RAMA+JAB Prok-JAB MPND 523 eukaryota>metazoa>chordata>vertebrata Haliaeetus leucocephalus PREDICTED: MPN domain-containing protein [Haliaeetus leucocephalus]. 543377065 RAMA+JAB Prok-JAB MPND 588 eukaryota>metazoa>chordata>vertebrata Pseudopodoces humilis PREDICTED: MPN domain-containing protein [Pseudopodoces humilis]. 507541562 RAMA+JAB Prok-JAB Mpnd 511 eukaryota>metazoa>chordata>vertebrata Jaculus jaculus PREDICTED: MPN domain-containing protein [Jaculus jaculus]. 569001445 RAMA+JAB Prok-JAB Mpnd 511 eukaryota>metazoa>chordata>vertebrata Mus musculus PREDICTED: MPN domain-containing protein isoform X1 [Mus musculus]. 426386710 RAMA+JAB DUF2763 MPND 508 eukaryota>metazoa>chordata>vertebrata Gorilla gorilla gorilla PREDICTED: MPN domain-containing protein [Gorilla gorilla gorilla]. 632966309 RAMA+JAB Prok-JAB mpnd 507 eukaryota>metazoa>chordata>vertebrata Callorhinchus milii PREDICTED: MPN domain-containing protein [Callorhinchus milii]. 525025711 RAMA+JAB Adeno_terminal MPND 503 eukaryota>metazoa>chordata>vertebrata Ficedula albicollis PREDICTED: MPN domain-containing protein [Ficedula albicollis]. 344237591 RAMA+JAB Prok-JAB I79_008167 502 eukaryota>metazoa>chordata>vertebrata Cricetulus griseus MPN domain-containing protein [Cricetulus griseus]. 564243961 RAMA+JAB Prok-JAB MPND 501 eukaryota>metazoa>chordata>vertebrata Alligator mississippiensis PREDICTED: MPN domain-containing protein [Alligator mississippiensis]. 664805955 RAMA+JAB DUF2763 MPND 501 eukaryota>metazoa>chordata>vertebrata Homo sapiens MPN domain-containing protein isoform 3 [Homo sapiens]. 694972324 RAMA+JAB GRP MPND 501 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: MPN domain-containing protein [Pan troglodytes]. 667305367 RAMA+JAB Prok-JAB MPND 499 eukaryota>metazoa>chordata>vertebrata Galeopterus variegatus PREDICTED: MPN domain-containing protein [Galeopterus variegatus]. 4581082 RAMA+JAB DUF2763 - 498 eukaryota>metazoa>chordata>vertebrata Homo sapiens R31167_1, partial protein, partial [Homo sapiens]. 676421995 RAMA+JAB Prok-JAB N302_03640 295 eukaryota>metazoa>chordata>vertebrata Corvus brachyrhynchos MPN domain-containing protein, partial [Corvus brachyrhynchos]. 669290398 RAMA+JAB Prok-JAB MPND 322 eukaryota>metazoa>chordata>vertebrata Corvus brachyrhynchos PREDICTED: MPN domain-containing protein [Corvus brachyrhynchos]. 395518304 RAMA+JAB Prok-JAB LOC100916797 333 eukaryota>metazoa>chordata>vertebrata Sarcophilus harrisii PREDICTED: MPN domain-containing protein-like, partial [Sarcophilus harrisii]. 677295082 RAMA+JAB Adeno_terminal N310_04334 358 eukaryota>metazoa>chordata>vertebrata Acanthisitta chloris MPN domain-containing protein, partial [Acanthisitta chloris]. 146134497 RAMA+JAB Prok-JAB Mpnd 487 eukaryota>metazoa>chordata>vertebrata Mus musculus MPN domain-containing protein [Mus musculus]. 507697284 RAMA+JAB Prok-JAB MPND 372 eukaryota>metazoa>chordata>vertebrata Echinops telfairi PREDICTED: MPN domain-containing protein [Echinops telfairi]. 664779050 RAMA - LOC103543755 376 eukaryota>metazoa>chordata>vertebrata Equus przewalskii PREDICTED: MPN domain-containing protein-like, partial [Equus przewalskii]. 683465952 RAMA+JAB Adeno_terminal N321_06775 377 eukaryota>metazoa>chordata>vertebrata Caprimulgus carolinensis MPN domain-containing protein, partial [Caprimulgus carolinensis]. 548473217 RAMA+JAB Prok-JAB MPND 377 eukaryota>metazoa>chordata>vertebrata Capra hircus PREDICTED: MPN domain-containing protein, partial [Capra hircus]. 675756747 RAMA+JAB Prok-JAB LOC100402569 389 eukaryota>metazoa>chordata>vertebrata Callithrix jacchus PREDICTED: MPN domain-containing protein [Callithrix jacchus]. 759183571 RAMA - MPND 393 eukaryota>metazoa>chordata>vertebrata Pteropus vampyrus PREDICTED: MPN domain-containing protein [Pteropus vampyrus]. 641763616 RAMA+JAB Prok-JAB MPND 395 eukaryota>metazoa>chordata>vertebrata Chrysemys picta bellii PREDICTED: MPN domain-containing protein isoform X2 [Chrysemys picta bellii]. 554528053 RAMA+JAB Prok-JAB MPND 399 eukaryota>metazoa>chordata>vertebrata Myotis brandtii PREDICTED: MPN domain-containing protein [Myotis brandtii]. 478534656 RAMA+JAB Prok-JAB MPND 399 eukaryota>metazoa>chordata>vertebrata Ceratotherium simum simum PREDICTED: MPN domain-containing protein [Ceratotherium simum simum]. 640778776 RAMA+JAB Prok-JAB MPND 400 eukaryota>metazoa>chordata>vertebrata Tarsius syrichta PREDICTED: MPN domain-containing protein [Tarsius syrichta]. 585173801 RAMA+JAB Prok-JAB MPND 400 eukaryota>metazoa>chordata>vertebrata Leptonychotes weddellii PREDICTED: MPN domain-containing protein [Leptonychotes weddellii]. 560906146 RAMA+JAB Prok-JAB MPND 400 eukaryota>metazoa>chordata>vertebrata Camelus ferus PREDICTED: LOW QUALITY PROTEIN: MPN domain containing, partial [Camelus ferus]. 676589955 RAMA+JAB Adeno_terminal N303_15020 404 eukaryota>metazoa>chordata>vertebrata Cuculus canorus MPN domain-containing protein, partial [Cuculus canorus]. 351711696 RAMA+JAB Prok-JAB GW7_06760 592 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber MPN domain-containing protein [Heterocephalus glaber]. 281349777 RAMA+JAB Prok-JAB PANDA_018471 406 eukaryota>metazoa>chordata>vertebrata Ailuropoda melanoleuca hypothetical protein PANDA_018471, partial [Ailuropoda melanoleuca]. 543725811 RAMA+JAB Adeno_terminal MPND 407 eukaryota>metazoa>chordata>vertebrata Columba livia PREDICTED: MPN domain-containing protein, partial [Columba livia]. 511983578 RAMA+JAB Prok-JAB MPND 407 eukaryota>metazoa>chordata>vertebrata Mustela putorius furo PREDICTED: MPN domain-containing protein, partial [Mustela putorius furo]. 395831687 RAMA+JAB Prok-JAB MPND 407 eukaryota>metazoa>chordata>vertebrata Otolemur garnettii PREDICTED: MPN domain-containing protein [Otolemur garnettii]. 591296222 RAMA+JAB - MPND 408 eukaryota>metazoa>chordata>vertebrata Panthera tigris altaica PREDICTED: MPN domain-containing protein, partial [Panthera tigris altaica]. 694857258 RAMA+JAB Adeno_terminal MPND 409 eukaryota>metazoa>chordata>vertebrata Nipponia nippon PREDICTED: MPN domain-containing protein [Nipponia nippon]. 355702999 RAMA+JAB Prok-JAB EGK_09937 410 eukaryota>metazoa>chordata>vertebrata Macaca mulatta hypothetical protein EGK_09937, partial [Macaca mulatta]. 555967973 RAMA+JAB Prok-JAB MPND 411 eukaryota>metazoa>chordata>vertebrata Bos mutus PREDICTED: MPN domain-containing protein, partial [Bos mutus]. 532030006 RAMA+JAB Prok-JAB Mpnd 601 eukaryota>metazoa>chordata>vertebrata Microtus ochrogaster PREDICTED: MPN domain-containing protein [Microtus ochrogaster]. 465951216 RAMA+JAB - UY3_18872 413 eukaryota>metazoa>chordata>vertebrata Chelonia mydas MPN domain-containing protein [Chelonia mydas]. 532093187 RAMA+JAB Prok-JAB Mpnd 494 eukaryota>metazoa>chordata>vertebrata Ictidomys tridecemlineatus PREDICTED: MPN domain-containing protein [Ictidomys tridecemlineatus]. 354479184 RAMA+JAB Prok-JAB Mpnd 487 eukaryota>metazoa>chordata>vertebrata Cricetulus griseus PREDICTED: MPN domain-containing protein isoform X2 [Cricetulus griseus]. 524963049 RAMA+JAB Prok-JAB Mpnd 487 eukaryota>metazoa>chordata>vertebrata Mesocricetus auratus PREDICTED: MPN domain-containing protein [Mesocricetus auratus]. 589939100 RAMA+JAB Prok-JAB Mpnd 487 eukaryota>metazoa>chordata>vertebrata Peromyscus maniculatus bairdii PREDICTED: MPN domain-containing protein isoform X2 [Peromyscus maniculatus bairdii]. 211826729 RAMA+JAB Prok-JAB Mpnd 486 eukaryota>metazoa>chordata>vertebrata Mus musculus Mpnd protein, partial [Mus musculus]. 146134361 RAMA+JAB Prok-JAB Mpnd 487 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus MPN domain-containing protein [Rattus norvegicus]. 641763614 RAMA+JAB Prok-JAB MPND 484 eukaryota>metazoa>chordata>vertebrata Chrysemys picta bellii PREDICTED: MPN domain-containing protein isoform X1 [Chrysemys picta bellii]. 471381804 RAMA+JAB Prok-JAB MPND 482 eukaryota>metazoa>chordata>vertebrata Trichechus manatus latirostris PREDICTED: MPN domain-containing protein [Trichechus manatus latirostris]. 557264364 RAMA+JAB Prok-JAB MPND 481 eukaryota>metazoa>chordata>vertebrata Alligator sinensis PREDICTED: MPN domain-containing protein [Alligator sinensis]. 556970552 RAMA+JAB JAB MPND 480 eukaryota>metazoa>chordata>vertebrata Latimeria chalumnae PREDICTED: MPN domain-containing protein [Latimeria chalumnae]. 602649789 RAMA+JAB Prok-JAB MPND 480 eukaryota>metazoa>chordata>vertebrata Python bivittatus PREDICTED: MPN domain-containing protein [Python bivittatus]. 589939098 RAMA+JAB Prok-JAB Mpnd 488 eukaryota>metazoa>chordata>vertebrata Peromyscus maniculatus bairdii PREDICTED: MPN domain-containing protein isoform X1 [Peromyscus maniculatus bairdii]. 533186386 RAMA+JAB Prok-JAB Mpnd 476 eukaryota>metazoa>chordata>vertebrata Chinchilla lanigera PREDICTED: MPN domain-containing protein [Chinchilla lanigera]. 472351398 RAMA+JAB API5 MPND 474 eukaryota>metazoa>chordata>vertebrata Odobenus rosmarus divergens PREDICTED: MPN domain-containing protein [Odobenus rosmarus divergens]. 558220477 RAMA+JAB Prok-JAB MPND 474 eukaryota>metazoa>chordata>vertebrata Pelodiscus sinensis PREDICTED: MPN domain-containing protein [Pelodiscus sinensis]. 513008765 RAMA+JAB GRP Mpnd 473 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber PREDICTED: MPN domain-containing protein isoform X4 [Heterocephalus glaber]. 578833691 RAMA+JAB DUF2763 MPND 472 eukaryota>metazoa>chordata>vertebrata Homo sapiens PREDICTED: MPN domain-containing protein isoform X1 [Homo sapiens]. 296485713 RAMA+JAB Prok-JAB BOS_7727 585 eukaryota>metazoa>chordata>vertebrata Bos taurus TPA: CG4751-like [Bos taurus]. 544507859 RAMA+JAB Bacteriocin_IIc MPND 471 eukaryota>metazoa>chordata>vertebrata Macaca fascicularis PREDICTED: MPN domain-containing protein isoform X1 [Macaca fascicularis]. 513008761 RAMA+JAB GRP Mpnd 495 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber PREDICTED: MPN domain-containing protein isoform X2 [Heterocephalus glaber]. 562889145 RAMA+JAB Prok-JAB MPND 718 eukaryota>metazoa>chordata>vertebrata Tupaia chinensis PREDICTED: MPN domain-containing protein [Tupaia chinensis]. 545535184 RAMA+JAB Prok-JAB MPND 470 eukaryota>metazoa>chordata>vertebrata Canis lupus familiaris PREDICTED: MPN domain-containing protein [Canis lupus familiaris]. 663260198 RAMA+JAB Prok-JAB MPND 489 eukaryota>metazoa>chordata>vertebrata Calypte anna PREDICTED: MPN domain-containing protein [Calypte anna]. 74183910 RAMA+JAB Prok-JAB - 467 eukaryota>metazoa>chordata>vertebrata Mus musculus unnamed protein product [Mus musculus]. 507966202 RAMA+JAB Prok-JAB MPND 467 eukaryota>metazoa>chordata>vertebrata Condylura cristata PREDICTED: MPN domain-containing protein [Condylura cristata]. 589939102 RAMA+JAB Prok-JAB Mpnd 467 eukaryota>metazoa>chordata>vertebrata Peromyscus maniculatus bairdii PREDICTED: MPN domain-containing protein isoform X3 [Peromyscus maniculatus bairdii]. 744621045 RAMA+JAB Prok-JAB MPND 490 eukaryota>metazoa>chordata>vertebrata Camelus dromedarius PREDICTED: MPN domain-containing protein [Camelus dromedarius]. 156717812 RAMA+JAB Prok-JAB mpnd 466 eukaryota>metazoa>chordata>vertebrata Xenopus (Silurana) tropicalis MPN domain-containing protein [Xenopus (Silurana) tropicalis]. 634876687 RAMA+JAB GRP MPND 466 eukaryota>metazoa>chordata>vertebrata Orycteropus afer afer PREDICTED: MPN domain-containing protein isoform X1 [Orycteropus afer afer]. 625249800 RAMA+JAB Prok-JAB Mpnd 465 eukaryota>metazoa>chordata>vertebrata Cricetulus griseus PREDICTED: MPN domain-containing protein isoform X3 [Cricetulus griseus]. 724844417 RAMA+JAB Bacteriocin_IIc MPND 465 eukaryota>metazoa>chordata>vertebrata Rhinopithecus roxellana PREDICTED: MPN domain-containing protein [Rhinopithecus roxellana]. 731512762 RAMA+JAB GRP MPND 465 eukaryota>metazoa>chordata>vertebrata Loxodonta africana PREDICTED: MPN domain-containing protein isoform X2 [Loxodonta africana]. 602697096 RAMA+JAB GRP MPND 463 eukaryota>metazoa>chordata>vertebrata Lipotes vexillifer PREDICTED: MPN domain-containing protein [Lipotes vexillifer]. 752433833 RAMA+JAB Prok-JAB MPND 463 eukaryota>metazoa>chordata>vertebrata Ailuropoda melanoleuca PREDICTED: MPN domain-containing protein [Ailuropoda melanoleuca]. 594035787 RAMA+JAB GRP MPND 461 eukaryota>metazoa>chordata>vertebrata Bubalus bubalis PREDICTED: MPN domain-containing protein isoform X2 [Bubalus bubalis]. 335282406 RAMA+JAB Prok-JAB MPND 460 eukaryota>metazoa>chordata>vertebrata Sus scrofa PREDICTED: MPN domain-containing protein isoform 2 [Sus scrofa]. 768397034 RAMA+JAB Prok-JAB MPND 491 eukaryota>metazoa>chordata>vertebrata Aquila chrysaetos canadensis PREDICTED: MPN domain-containing protein [Aquila chrysaetos canadensis]. 742217815 RAMA+JAB Prok-JAB MPND 491 eukaryota>metazoa>chordata>vertebrata Bison bison bison PREDICTED: MPN domain-containing protein [Bison bison bison]. 675752711 RAMA+JAB Prok-JAB MPND 457 eukaryota>metazoa>chordata>vertebrata Pan paniscus PREDICTED: MPN domain-containing protein [Pan paniscus]. 699627942 RAMA+JAB Prok-JAB MPND 457 eukaryota>metazoa>chordata>vertebrata Picoides pubescens PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Picoides pubescens]. 594035785 RAMA+JAB GRP MPND 491 eukaryota>metazoa>chordata>vertebrata Bubalus bubalis PREDICTED: MPN domain-containing protein isoform X1 [Bubalus bubalis]. 358413003 RAMA+JAB Prok-JAB MPND 491 eukaryota>metazoa>chordata>vertebrata Bos taurus PREDICTED: MPN domain-containing protein isoform X2 [Bos taurus]. 743741068 RAMA+JAB Prok-JAB MPND 492 eukaryota>metazoa>chordata>vertebrata Camelus bactrianus PREDICTED: MPN domain-containing protein [Camelus bactrianus]. 696986250 RAMA+JAB Adeno_terminal MPND 492 eukaryota>metazoa>chordata>vertebrata Cuculus canorus PREDICTED: MPN domain-containing protein [Cuculus canorus]. 426230708 RAMA+JAB GRP MPND 497 eukaryota>metazoa>chordata>vertebrata Ovis aries PREDICTED: MPN domain-containing protein [Ovis aries]. 674057368 RAMA+JAB Prok-JAB Mpnd 492 eukaryota>metazoa>chordata>vertebrata Nannospalax galili PREDICTED: MPN domain-containing protein isoform X2 [Nannospalax galili]. 586519820 RAMA+JAB Prok-JAB MPND 454 eukaryota>metazoa>chordata>vertebrata Pteropus alecto PREDICTED: MPN domain-containing protein [Pteropus alecto]. 672063995 RAMA+JAB Prok-JAB Mpnd 493 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: MPN domain-containing protein isoform X1 [Rattus norvegicus]. 513008763 RAMA+JAB GRP Mpnd 493 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber PREDICTED: MPN domain-containing protein isoform X3 [Heterocephalus glaber]. 14042888 RAMA+JAB DUF2763 - 451 eukaryota>metazoa>chordata>vertebrata Homo sapiens unnamed protein product [Homo sapiens]. 229577180 RAMA+JAB DUF2763 MPND 451 eukaryota>metazoa>chordata>vertebrata Homo sapiens MPN domain-containing protein isoform 2 [Homo sapiens]. 544507861 RAMA+JAB Bacteriocin_IIc MPND 451 eukaryota>metazoa>chordata>vertebrata Macaca fascicularis PREDICTED: MPN domain-containing protein isoform X2 [Macaca fascicularis]. 466045649 RAMA+JAB GRP MPND 493 eukaryota>metazoa>chordata>vertebrata Orcinus orca PREDICTED: MPN domain-containing protein [Orcinus orca]. 731512760 RAMA+JAB GRP MPND 494 eukaryota>metazoa>chordata>vertebrata Loxodonta africana PREDICTED: MPN domain-containing protein isoform X1 [Loxodonta africana]. 4581083 RAMA+JAB DUF2763 - 448 eukaryota>metazoa>chordata>vertebrata Homo sapiens R31167_2, partial protein, partial [Homo sapiens]. 530667319 RAMA+JAB Prok-JAB CB1_000413045 448 eukaryota>metazoa>chordata>vertebrata Camelus ferus hypothetical protein CB1_000413045 [Camelus ferus]. 634876690 RAMA+JAB GRP MPND 446 eukaryota>metazoa>chordata>vertebrata Orycteropus afer afer PREDICTED: MPN domain-containing protein isoform X2 [Orycteropus afer afer]. 676274283 RAMA+JAB Prok-JAB H920_09774 446 eukaryota>metazoa>chordata>vertebrata Fukomys damarensis MPN domain-containing protein [Fukomys damarensis]. 440905924 RAMA+JAB Prok-JAB M91_12016 443 eukaryota>metazoa>chordata>vertebrata Bos mutus MPN domain-containing protein, partial [Bos mutus]. 505852230 RAMA+JAB Prok-JAB MPND 441 eukaryota>metazoa>chordata>vertebrata Sorex araneus PREDICTED: MPN domain-containing protein [Sorex araneus]. 591355969 RAMA+JAB - MPND 441 eukaryota>metazoa>chordata>vertebrata Chelonia mydas PREDICTED: MPN domain-containing protein [Chelonia mydas]. 755695164 RAMA+JAB Prok-JAB MPND 441 eukaryota>metazoa>chordata>vertebrata Felis catus PREDICTED: MPN domain-containing protein, partial [Felis catus]. 449281945 RAMA+JAB Adeno_terminal A306_02191 439 eukaryota>metazoa>chordata>vertebrata Columba livia MPN domain-containing protein, partial [Columba livia]. 512916480 RAMA+JAB Prok-JAB Mpnd 438 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber PREDICTED: MPN domain-containing protein, partial [Heterocephalus glaber]. 683931016 RAMA+JAB Prok-JAB MPND 437 eukaryota>metazoa>chordata>vertebrata Serinus canaria PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein, partial [Serinus canaria]. 75517321 RAMA+JAB Prok-JAB Mpnd 436 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus Mpnd protein, partial [Rattus norvegicus]. 556717255 RAMA+JAB Prok-JAB MPND 436 eukaryota>metazoa>chordata>vertebrata Pantholops hodgsonii PREDICTED: MPN domain-containing protein, partial [Pantholops hodgsonii]. 432116860 RAMA+JAB Prok-JAB MDA_GLEAN10011104 435 eukaryota>metazoa>chordata>vertebrata Myotis davidii MPN domain-containing protein [Myotis davidii]. 529444158 RAMA+JAB JAB MPND 435 eukaryota>metazoa>chordata>vertebrata Falco peregrinus PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Falco peregrinus]. 584067554 RAMA+JAB Prok-JAB MPND 435 eukaryota>metazoa>chordata>vertebrata Myotis davidii PREDICTED: MPN domain-containing protein, partial [Myotis davidii]. 74147413 RAMA+JAB Prok-JAB - 434 eukaryota>metazoa>chordata>vertebrata Mus musculus unnamed protein product [Mus musculus]. 507652018 RAMA+JAB CD99L2 Mpnd 498 eukaryota>metazoa>chordata>vertebrata Octodon degus PREDICTED: MPN domain-containing protein [Octodon degus]. 403296246 RAMA+JAB Prok-JAB MPND 433 eukaryota>metazoa>chordata>vertebrata Saimiri boliviensis boliviensis PREDICTED: MPN domain-containing protein, partial [Saimiri boliviensis boliviensis]. 560968488 RAMA+JAB Prok-JAB MPND 433 eukaryota>metazoa>chordata>vertebrata Vicugna pacos PREDICTED: MPN domain-containing protein [Vicugna pacos]. 731239925 RAMA+JAB Prok-JAB Mpnd 433 eukaryota>metazoa>chordata>vertebrata Fukomys damarensis PREDICTED: MPN domain-containing protein, partial [Fukomys damarensis]. 625196764 RAMA+JAB Prok-JAB Mpnd 494 eukaryota>metazoa>chordata>vertebrata Cricetulus griseus PREDICTED: MPN domain-containing protein isoform X1 [Cricetulus griseus]. 431922315 RAMA+JAB Prok-JAB PAL_GLEAN10006063 430 eukaryota>metazoa>chordata>vertebrata Pteropus alecto MPN domain-containing protein, partial [Pteropus alecto]. 594631561 RAMA+JAB Prok-JAB MPND 430 eukaryota>metazoa>chordata>vertebrata Balaenoptera acutorostrata scammoni PREDICTED: MPN domain-containing protein, partial [Balaenoptera acutorostrata scammoni]. 641718859 RAMA+JAB Prok-JAB MPND 430 eukaryota>metazoa>chordata>vertebrata Eptesicus fuscus PREDICTED: MPN domain-containing protein [Eptesicus fuscus]. 511904656 RAMA+JAB Prok-JAB MPND 427 eukaryota>metazoa>chordata>vertebrata Mustela putorius furo PREDICTED: MPN domain-containing protein, partial [Mustela putorius furo]. 617610563 RAMA+JAB GRP MPND 494 eukaryota>metazoa>chordata>vertebrata Erinaceus europaeus PREDICTED: MPN domain-containing protein [Erinaceus europaeus]. 514442534 RAMA+JAB Prok-JAB Mpnd 422 eukaryota>metazoa>chordata>vertebrata Cavia porcellus PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein, partial [Cavia porcellus]. 558208646 RAMA+JAB Prok-JAB MPND 422 eukaryota>metazoa>chordata>vertebrata Myotis lucifugus PREDICTED: MPN domain-containing protein, partial [Myotis lucifugus]. 593726834 RAMA+JAB Prok-JAB MPND 421 eukaryota>metazoa>chordata>vertebrata Physeter catodon PREDICTED: MPN domain-containing protein [Physeter catodon]. 297275824 RAMA+JAB Gly_rich LOC721854 420 eukaryota>metazoa>chordata>vertebrata Macaca mulatta PREDICTED: MPN domain-containing protein-like [Macaca mulatta]. 74186431 RAMA+JAB Prok-JAB - 417 eukaryota>metazoa>chordata>vertebrata Mus musculus unnamed protein product, partial [Mus musculus]. 677968406 RAMA+JAB Adeno_terminal MPND 417 eukaryota>metazoa>chordata>vertebrata Acanthisitta chloris PREDICTED: MPN domain-containing protein, partial [Acanthisitta chloris]. 704318525 RAMA+JAB Prok-JAB MPND 417 eukaryota>metazoa>chordata>vertebrata Caprimulgus carolinensis PREDICTED: MPN domain-containing protein, partial [Caprimulgus carolinensis]. 674057366 RAMA+JAB Prok-JAB Mpnd 496 eukaryota>metazoa>chordata>vertebrata Nannospalax galili PREDICTED: MPN domain-containing protein isoform X1 [Nannospalax galili]. 31542699 RAMA+JAB DUF2763 MPND 471 eukaryota>metazoa>chordata>vertebrata Homo sapiens MPN domain-containing protein isoform 1 [Homo sapiens]. 551488218 RAMA+JAB JAB LOC102231554 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Xiphophorus maculatus PREDICTED: MPN domain-containing protein-like [Xiphophorus maculatus]. 642090869 RAMA+JAB JAB GSONMT00029460001 469 eukaryota>metazoa>chordata>vertebrata>actinopterygii Oncorhynchus mykiss unnamed protein product [Oncorhynchus mykiss]. 47215175 RAMA+JAB Prok-JAB GSTEN:00020207:G:001 466 eukaryota>metazoa>chordata>vertebrata>actinopterygii Tetraodon nigroviridis unnamed protein product, partial [Tetraodon nigroviridis]. 742103817 RAMA+JAB JAB mpnd 459 eukaryota>metazoa>chordata>vertebrata>actinopterygii Esox lucius PREDICTED: MPN domain-containing protein [Esox lucius]. 115497614 RAMA+JAB JAB mpnd 458 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio MPN domain-containing protein [Danio rerio]. 410921370 RAMA+JAB Prok-JAB mpnd 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Takifugu rubripes PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Takifugu rubripes]. 432853455 RAMA+JAB JAB mpnd 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Oryzias latipes PREDICTED: MPN domain-containing protein [Oryzias latipes]. 498992415 RAMA+JAB JAB mpnd 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Maylandia zebra PREDICTED: MPN domain-containing protein [Maylandia zebra]. 542212081 RAMA+JAB JAB LOC100696290 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Oreochromis niloticus PREDICTED: MPN domain-containing protein-like [Oreochromis niloticus]. 583990359 RAMA+JAB JAB LOC102785695 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Neolamprologus brichardi PREDICTED: MPN domain-containing protein-like [Neolamprologus brichardi]. 657555408 RAMA+JAB JAB mpnd 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Stegastes partitus PREDICTED: MPN domain-containing protein [Stegastes partitus]. 736173885 RAMA+JAB JAB mpnd 454 eukaryota>metazoa>chordata>vertebrata>actinopterygii Notothenia coriiceps PREDICTED: MPN domain-containing protein [Notothenia coriiceps]. 617421732 RAMA+JAB JAB mpnd 451 eukaryota>metazoa>chordata>vertebrata>actinopterygii Poecilia formosa PREDICTED: MPN domain-containing protein isoform X1 [Poecilia formosa]. 657740336 RAMA+JAB Prok-JAB mpnd 449 eukaryota>metazoa>chordata>vertebrata>actinopterygii Cynoglossus semilaevis PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein [Cynoglossus semilaevis]. 734603143 RAMA+JAB JAB mpnd 622 eukaryota>metazoa>chordata>vertebrata>actinopterygii Larimichthys crocea PREDICTED: MPN domain-containing protein [Larimichthys crocea]. 617421735 RAMA+JAB Prok-JAB mpnd 365 eukaryota>metazoa>chordata>vertebrata>actinopterygii Poecilia formosa PREDICTED: MPN domain-containing protein isoform X2 [Poecilia formosa]. 573904867 RAMA+JAB Prok-JAB LOC102696675 485 eukaryota>metazoa>chordata>vertebrata>actinopterygii Lepisosteus oculatus PREDICTED: MPN domain-containing protein-like [Lepisosteus oculatus]. 156394960 RAMA+JAB Prok-JAB NEMVEDRAFT_v1g22327 423 eukaryota>metazoa>cnidaria Nematostella vectensis predicted protein, partial [Nematostella vectensis]. 449665213 RAMA+JAB JAB LOC100209039 529 eukaryota>metazoa>cnidaria Hydra vulgaris PREDICTED: MPN domain-containing protein-like [Hydra vulgaris]. 321479075 RAMA+JAB JAB DAPPUDRAFT_23970 413 eukaryota>metazoa>crustacea Daphnia pulex hypothetical protein DAPPUDRAFT_23970, partial [Daphnia pulex]. 390342613 RAMA+JAB JAB LOC587071 476 eukaryota>metazoa>echinodermata Strongylocentrotus purpuratus PREDICTED: MPN domain-containing protein [Strongylocentrotus purpuratus]. 170038365 RAMA+JAB Prok-JAB CpipJ_CPIJ005266 268 eukaryota>metazoa>hexapoda Culex quinquefasciatus MPN domain-containing protein [Culex quinquefasciatus]. 755873328 RAMA+JAB DUF755 LOC101887400 1459 eukaryota>metazoa>hexapoda Musca domestica PREDICTED: MPN domain-containing protein CG4751 [Musca domestica]. 195030430 RAMA+JAB E_Pc_C Dgri_GH10765 1444 eukaryota>metazoa>hexapoda Drosophila grimshawi GH10765 [Drosophila grimshawi]. 198474706 RAMA+JAB DUF4557 Dpse_GA18404 1442 eukaryota>metazoa>hexapoda Drosophila pseudoobscura pseudoobscura GA18404 [Drosophila pseudoobscura pseudoobscura]. 195118722 RAMA+JAB DUF2968 Dmoj_GI20608 1441 eukaryota>metazoa>hexapoda Drosophila mojavensis GI20608 [Drosophila mojavensis]. 194759065 RAMA+JAB DUF755 Dana_GF14762 1384 eukaryota>metazoa>hexapoda Drosophila ananassae GF14762 [Drosophila ananassae]. 195434072 RAMA+JAB Secretin_N_2 Dwil_GK14895 1432 eukaryota>metazoa>hexapoda Drosophila willistoni GK14895 [Drosophila willistoni]. 19921138 RAMA+JAB TSA Dmel_CG4751 1412 eukaryota>metazoa>hexapoda Drosophila melanogaster CG4751 [Drosophila melanogaster]. 195340083 RAMA+JAB Atrophin-1 Dsec_GM18929 1410 eukaryota>metazoa>hexapoda Drosophila sechellia GM18929 [Drosophila sechellia]. 646722843 RAMA+JAB Vicilin_N L798_11172 678 eukaryota>metazoa>hexapoda Zootermopsis nevadensis MPN domain-containing protein [Zootermopsis nevadensis]. 195472102 RAMA+JAB TSA Dyak_GE18516 1410 eukaryota>metazoa>hexapoda Drosophila yakuba GE18516 [Drosophila yakuba]. 194861797 RAMA+JAB Med15 Dere_GG23709 1407 eukaryota>metazoa>hexapoda Drosophila erecta GG23709 [Drosophila erecta]. 478259688 RAMA+JAB JAB YQE_03995 746 eukaryota>metazoa>hexapoda Dendroctonus ponderosae hypothetical protein YQE_03995, partial [Dendroctonus ponderosae]. 91083749 RAMA+JAB Prok-JAB LOC659985 650 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: MPN domain-containing protein CG4751 isoform X1 [Tribolium castaneum]. 195578469 RAMA+JAB Prok-JAB Dsim_GD23765 1393 eukaryota>metazoa>hexapoda Drosophila simulans GD23765 [Drosophila simulans]. 768415170 RAMA+JAB Prok-JAB LOC105380298 900 eukaryota>metazoa>hexapoda Plutella xylostella PREDICTED: MPN domain-containing protein CG4751 [Plutella xylostella]. 751770382 RAMA+JAB JAB LOC105223148 1201 eukaryota>metazoa>hexapoda Bactrocera dorsalis PREDICTED: MPN domain-containing protein CG4751 [Bactrocera dorsalis]. 751478045 RAMA+JAB JAB LOC105219911 1189 eukaryota>metazoa>hexapoda Bactrocera cucurbitae PREDICTED: MPN domain-containing protein CG4751 [Bactrocera cucurbitae]. 357602867 RAMA+JAB Prok-JAB KGM_00027 900 eukaryota>metazoa>hexapoda Danaus plexippus hypothetical protein KGM_00027 [Danaus plexippus]. 641676301 RAMA+JAB JAB LOC100164705 539 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: MPN domain-containing protein isoform X1 [Acyrthosiphon pisum]. 498932490 RAMA+JAB JAB LOC101449034 1174 eukaryota>metazoa>hexapoda Ceratitis capitata PREDICTED: MPN domain-containing protein CG4751 isoform X1 [Ceratitis capitata]. 568252643 RAMA+JAB T4SS AND_006389 1600 eukaryota>metazoa>hexapoda Anopheles darlingi hypothetical protein AND_006389 [Anopheles darlingi]. 668454611 RAMA+JAB OEP ZHAS_00010716 1593 eukaryota>metazoa>hexapoda Anopheles sinensis hypothetical protein ZHAS_00010716 [Anopheles sinensis]. 642924394 RAMA+JAB Prok-JAB LOC659985 580 eukaryota>metazoa>hexapoda Tribolium castaneum PREDICTED: MPN domain-containing protein CG4751 isoform X2 [Tribolium castaneum]. 158299477 RAMA+JAB Gly-zipper_OmpA AgaP_AGAP008858 1417 eukaryota>metazoa>hexapoda Anopheles gambiae str. PEST AGAP008858-PA [Anopheles gambiae str. PEST]. 157117027 RAMA+JAB DUF3915 AaeL_AAEL007827 1313 eukaryota>metazoa>hexapoda Aedes aegypti AAEL007827-PA [Aedes aegypti]. 512926144 RAMA+JAB Prok-JAB LOC101745255 934 eukaryota>metazoa>hexapoda Bombyx mori PREDICTED: MPN domain-containing protein CG4751 isoform X1 [Bombyx mori]. 641676305 RAMA+JAB JAB LOC100164705 471 eukaryota>metazoa>hexapoda Acyrthosiphon pisum PREDICTED: MPN domain-containing protein isoform X2 [Acyrthosiphon pisum]. 195385142 RAMA+JAB Herpes_BLLF1 Dvir_GJ13201 1399 eukaryota>metazoa>hexapoda Drosophila virilis GJ13201 [Drosophila virilis]. 157141845 RAMA+JAB SprA-related AaeL_AAEL015398 866 eukaryota>metazoa>hexapoda Aedes aegypti AAEL015398-PA, partial [Aedes aegypti]. 195148332 RAMA+JAB DUF4557 Dper_GL19545 1441 eukaryota>metazoa>hexapoda Drosophila persimilis GL19545 [Drosophila persimilis]. 676438501 RAMA+JAB JAB LOTGIDRAFT_140609 434 eukaryota>metazoa>mollusca Lottia gigantea hypothetical protein LOTGIDRAFT_140609 [Lottia gigantea]. 524883584 RAMA+JAB Prok-JAB LOC101862270 492 eukaryota>metazoa>mollusca Aplysia californica PREDICTED: MPN domain-containing protein-like [Aplysia californica]. 762155186 RAMA+JAB RNA_pol_3_Rpc31 LOC105319737 534 eukaryota>metazoa>mollusca Crassostrea gigas PREDICTED: MPN domain-containing protein-like isoform X1 [Crassostrea gigas]. 762155188 RAMA+JAB JAB LOC105319737 523 eukaryota>metazoa>mollusca Crassostrea gigas PREDICTED: MPN domain-containing protein-like isoform X2 [Crassostrea gigas]. 196002159 RAMA+JAB Prok-JAB TRIADDRAFT_54406 509 eukaryota>metazoa>placozoa Trichoplax adhaerens hypothetical protein TRIADDRAFT_54406 [Trichoplax adhaerens]. 255085490 RAMA+JAB JAB MICPUN_68730 333 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein, partial [Micromonas sp. RCC299]. 693497512 RAMA+JAB Prok-JAB OT_ostta12g00050 540 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri Tryptophan synthase beta subunit-like PLP-dependent enzymes superfamily [Ostreococcus tauri]. 145352800 RAMA+JAB Prok-JAB OSTLU_26664 458 eukaryota>viridiplantae>chlorophyta Ostreococcus lucimarinus CCE9901 predicted protein [Ostreococcus lucimarinus CCE9901]. 303282299 RAMA+JAB Prok-JAB MICPUCDRAFT_60251 603 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 302836718 RAMA+JAB Cnd2 VOLCADRAFT_117378 903 eukaryota>viridiplantae>chlorophyta Volvox carteri f. nagariensis hypothetical protein VOLCADRAFT_117378, partial [Volvox carteri f. nagariensis]. 545371515 RAMA+JAB DUF2076 COCSUDRAFT_46372 733 eukaryota>viridiplantae>chlorophyta Coccomyxa subellipsoidea C-169 hypothetical protein COCSUDRAFT_46372 [Coccomyxa subellipsoidea C-169]. 541962068 RAMA+JAB Prok-JAB FSD1 871 eukaryota>metazoa>chordata>vertebrata Falco cherrug PREDICTED: fibronectin type III and SPRY domain-containing protein 1 [Falco cherrug]. # 94; ANKRD31-like 670997993 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 2017 eukaryota>metazoa>chordata>vertebrata Ursus maritimus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ursus maritimus]. 676274142 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 H920_09890 2013 eukaryota>metazoa>chordata>vertebrata Fukomys damarensis Ankyrin repeat domain-containing protein 31 [Fukomys damarensis]. 545185913 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1993 eukaryota>metazoa>chordata>vertebrata Equus caballus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Equus caballus]. 742156297 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1957 eukaryota>metazoa>chordata>vertebrata Bison bison bison PREDICTED: putative ankyrin repeat domain-containing protein 31 [Bison bison bison]. 594655164 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1942 eukaryota>metazoa>chordata>vertebrata Balaenoptera acutorostrata scammoni PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Balaenoptera acutorostrata scammoni]. 528961800 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1940 eukaryota>metazoa>chordata>vertebrata Bos taurus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Bos taurus]. 741883924 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1940 eukaryota>metazoa>chordata>vertebrata Bos taurus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Bos taurus]. 694910754 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1931 eukaryota>metazoa>chordata>vertebrata Pan troglodytes PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pan troglodytes]. 767935664 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1931 eukaryota>metazoa>chordata>vertebrata Homo sapiens PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Homo sapiens]. 675645695 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1930 eukaryota>metazoa>chordata>vertebrata Callithrix jacchus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Callithrix jacchus]. 767935666 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1930 eukaryota>metazoa>chordata>vertebrata Homo sapiens PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Homo sapiens]. 675645697 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1929 eukaryota>metazoa>chordata>vertebrata Callithrix jacchus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Callithrix jacchus]. 593725202 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1922 eukaryota>metazoa>chordata>vertebrata Physeter catodon PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Physeter catodon]. 426233803 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1920 eukaryota>metazoa>chordata>vertebrata Ovis aries PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ovis aries]. 532027299 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1913 eukaryota>metazoa>chordata>vertebrata Microtus ochrogaster PREDICTED: putative ankyrin repeat domain-containing protein 31 [Microtus ochrogaster]. 752395703 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1913 eukaryota>metazoa>chordata>vertebrata Ailuropoda melanoleuca PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ailuropoda melanoleuca]. 767935668 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1907 eukaryota>metazoa>chordata>vertebrata Homo sapiens PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Homo sapiens]. 675645699 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1906 eukaryota>metazoa>chordata>vertebrata Callithrix jacchus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Callithrix jacchus]. 431907833 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 PAL_GLEAN10024895 1894 eukaryota>metazoa>chordata>vertebrata Pteropus alecto Ankyrin repeat domain-containing protein 31 [Pteropus alecto]. 556748271 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1889 eukaryota>metazoa>chordata>vertebrata Pantholops hodgsonii PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Pantholops hodgsonii]. 731240079 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1883 eukaryota>metazoa>chordata>vertebrata Fukomys damarensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Fukomys damarensis]. 731240081 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1882 eukaryota>metazoa>chordata>vertebrata Fukomys damarensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Fukomys damarensis]. 426384418 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1881 eukaryota>metazoa>chordata>vertebrata Gorilla gorilla gorilla PREDICTED: putative ankyrin repeat domain-containing protein 31 [Gorilla gorilla gorilla]. 471368209 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 LOC101360661 1877 eukaryota>metazoa>chordata>vertebrata Trichechus manatus latirostris PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Trichechus manatus latirostris]. 544437792 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1877 eukaryota>metazoa>chordata>vertebrata Macaca fascicularis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Macaca fascicularis]. 544437794 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1876 eukaryota>metazoa>chordata>vertebrata Macaca fascicularis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Macaca fascicularis]. 635028857 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1876 eukaryota>metazoa>chordata>vertebrata Chlorocebus sabaeus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Chlorocebus sabaeus]. 297294549 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1875 eukaryota>metazoa>chordata>vertebrata Macaca mulatta PREDICTED: ankyrin repeat domain-containing protein 31-like [Macaca mulatta]. 544437796 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1875 eukaryota>metazoa>chordata>vertebrata Macaca fascicularis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Macaca fascicularis]. 635028859 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1875 eukaryota>metazoa>chordata>vertebrata Chlorocebus sabaeus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Chlorocebus sabaeus]. 332233855 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1874 eukaryota>metazoa>chordata>vertebrata Nomascus leucogenys PREDICTED: putative ankyrin repeat domain-containing protein 31 [Nomascus leucogenys]. 512006126 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1874 eukaryota>metazoa>chordata>vertebrata Mustela putorius furo PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Mustela putorius furo]. 634855178 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1874 eukaryota>metazoa>chordata>vertebrata Orycteropus afer afer PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Orycteropus afer afer]. 767935670 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1874 eukaryota>metazoa>chordata>vertebrata Homo sapiens PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X4 [Homo sapiens]. 256574792 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1873 eukaryota>metazoa>chordata>vertebrata Homo sapiens putative ankyrin repeat domain-containing protein 31 [Homo sapiens]. 397478346 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1873 eukaryota>metazoa>chordata>vertebrata Pan paniscus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pan paniscus]. 472393840 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1873 eukaryota>metazoa>chordata>vertebrata Odobenus rosmarus divergens PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Odobenus rosmarus divergens]. 532086706 ANK+ANK++ANK+ANK+ANK+SbcC+ANK+ANK+RAMA Ank_3 Ankrd31 1869 eukaryota>metazoa>chordata>vertebrata Ictidomys tridecemlineatus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Ictidomys tridecemlineatus]. 585166062 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1869 eukaryota>metazoa>chordata>vertebrata Leptonychotes weddellii PREDICTED: putative ankyrin repeat domain-containing protein 31 [Leptonychotes weddellii]. 351698287 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Apc3 GW7_02855 1868 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber Ankyrin repeat domain-containing protein 31 [Heterocephalus glaber]. 403256456 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1868 eukaryota>metazoa>chordata>vertebrata Saimiri boliviensis boliviensis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Saimiri boliviensis boliviensis]. 560950350 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1868 eukaryota>metazoa>chordata>vertebrata Vicugna pacos PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Vicugna pacos]. 560927924 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1867 eukaryota>metazoa>chordata>vertebrata Camelus ferus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Camelus ferus]. 743708583 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1867 eukaryota>metazoa>chordata>vertebrata Camelus bactrianus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Camelus bactrianus]. 744549800 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1867 eukaryota>metazoa>chordata>vertebrata Camelus dromedarius PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Camelus dromedarius]. 759163878 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA+RAMA Ank_3 ANKRD31 1865 eukaryota>metazoa>chordata>vertebrata Pteropus vampyrus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pteropus vampyrus]. 586560310 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA+RAMA Ank_3 ANKRD31 1863 eukaryota>metazoa>chordata>vertebrata Pteropus alecto PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Pteropus alecto]. 344272374 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1861 eukaryota>metazoa>chordata>vertebrata Loxodonta africana PREDICTED: putative ankyrin repeat domain-containing protein 31 [Loxodonta africana]. 507934536 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA+RAMA Ank_3 ANKRD31 1859 eukaryota>metazoa>chordata>vertebrata Condylura cristata PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Condylura cristata]. 602699333 ANK+ANK+ANK+SbcC+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA Ank_3 ANKRD31 1859 eukaryota>metazoa>chordata>vertebrata Lipotes vexillifer PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Lipotes vexillifer]. 395825692 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1857 eukaryota>metazoa>chordata>vertebrata Otolemur garnettii PREDICTED: ankyrin repeat domain-containing protein 31 [Otolemur garnettii]. 466014300 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1857 eukaryota>metazoa>chordata>vertebrata Orcinus orca PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Orcinus orca]. 568899965 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1857 eukaryota>metazoa>chordata>vertebrata Mus musculus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Mus musculus]. 568984809 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1857 eukaryota>metazoa>chordata>vertebrata Mus musculus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Mus musculus]. 640824341 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1857 eukaryota>metazoa>chordata>vertebrata Tarsius syrichta PREDICTED: putative ankyrin repeat domain-containing protein 31 [Tarsius syrichta]. 568899967 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1856 eukaryota>metazoa>chordata>vertebrata Mus musculus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Mus musculus]. 568984811 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1856 eukaryota>metazoa>chordata>vertebrata Mus musculus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Mus musculus]. 655893862 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1856 eukaryota>metazoa>chordata>vertebrata Oryctolagus cuniculus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Oryctolagus cuniculus]. 755691260 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1856 eukaryota>metazoa>chordata>vertebrata Felis catus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Felis catus]. 478492470 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1855 eukaryota>metazoa>chordata>vertebrata Ceratotherium simum simum PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ceratotherium simum simum]. 544437798 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1853 eukaryota>metazoa>chordata>vertebrata Macaca fascicularis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X4 [Macaca fascicularis]. 664736023 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1850 eukaryota>metazoa>chordata>vertebrata Equus przewalskii PREDICTED: putative ankyrin repeat domain-containing protein 31 [Equus przewalskii]. 345798574 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1848 eukaryota>metazoa>chordata>vertebrata Canis lupus familiaris PREDICTED: putative ankyrin repeat domain-containing protein 31 [Canis lupus familiaris]. 296483786 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 BOS_10089 1847 eukaryota>metazoa>chordata>vertebrata Bos taurus TPA: ankyrin repeat domain 31 [Bos taurus]. 511894632 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1845 eukaryota>metazoa>chordata>vertebrata Mustela putorius furo PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Mustela putorius furo]. 533143043 ANK+ANK+MED12+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1844 eukaryota>metazoa>chordata>vertebrata Chinchilla lanigera PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Chinchilla lanigera]. 505839474 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1841 eukaryota>metazoa>chordata>vertebrata Sorex araneus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Sorex araneus]. 554526760 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1838 eukaryota>metazoa>chordata>vertebrata Myotis brandtii PREDICTED: putative ankyrin repeat domain-containing protein 31 [Myotis brandtii]. 512830241 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1833 eukaryota>metazoa>chordata>vertebrata Heterocephalus glaber PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Heterocephalus glaber]. 584062428 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1833 eukaryota>metazoa>chordata>vertebrata Myotis davidii PREDICTED: putative ankyrin repeat domain-containing protein 31 [Myotis davidii]. 548481845 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1826 eukaryota>metazoa>chordata>vertebrata Capra hircus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Capra hircus]. 555956017 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1825 eukaryota>metazoa>chordata>vertebrata Bos mutus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Bos mutus]. 594102027 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1824 eukaryota>metazoa>chordata>vertebrata Bubalus bubalis PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Bubalus bubalis]. 585716479 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1822 eukaryota>metazoa>chordata>vertebrata Elephantulus edwardii PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Elephantulus edwardii]. 767935672 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1815 eukaryota>metazoa>chordata>vertebrata Homo sapiens PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X5 [Homo sapiens]. 504147955 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1813 eukaryota>metazoa>chordata>vertebrata Ochotona princeps PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Ochotona princeps]. 586487981 ANK+ANK+ANK+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA Ank_3 ANKRD31 1811 eukaryota>metazoa>chordata>vertebrata Chrysochloris asiatica PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Chrysochloris asiatica]. 589946321 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1806 eukaryota>metazoa>chordata>vertebrata Peromyscus maniculatus bairdii PREDICTED: putative ankyrin repeat domain-containing protein 31 [Peromyscus maniculatus bairdii]. 507697054 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1804 eukaryota>metazoa>chordata>vertebrata Octodon degus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Octodon degus]. 685546121 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1797 eukaryota>metazoa>chordata>vertebrata Papio anubis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Papio anubis]. 617548503 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_5 ANKRD31 1795 eukaryota>metazoa>chordata>vertebrata Erinaceus europaeus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Erinaceus europaeus]. 674044047 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+MED7+ANK+ANK+RAMA Ank_3 Ankrd31 1777 eukaryota>metazoa>chordata>vertebrata Nannospalax galili PREDICTED: putative ankyrin repeat domain-containing protein 31 [Nannospalax galili]. 348551138 ANK+ANK+ANK+ANK+ANK +RAMA Ank_3 Ankrd31 1776 eukaryota>metazoa>chordata>vertebrata Cavia porcellus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Cavia porcellus]. 558097829 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1762 eukaryota>metazoa>chordata>vertebrata Myotis lucifugus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Myotis lucifugus]. 625212140 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1753 eukaryota>metazoa>chordata>vertebrata Cricetulus griseus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 isoform X1 [Cricetulus griseus]. 507539698 ANK+ANK+ANK+ANK+ANK+ANK+EVH1+ANK+ANK+RAMA Ank_3 LOC101605591 1748 eukaryota>metazoa>chordata>vertebrata Jaculus jaculus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Jaculus jaculus]. 524922599 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1744 eukaryota>metazoa>chordata>vertebrata Mesocricetus auratus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Mesocricetus auratus]. 667286971 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1743 eukaryota>metazoa>chordata>vertebrata Galeopterus variegatus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Galeopterus variegatus]. 724797638 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1740 eukaryota>metazoa>chordata>vertebrata Rhinopithecus roxellana PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Rhinopithecus roxellana]. 591306729 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1703 eukaryota>metazoa>chordata>vertebrata Panthera tigris altaica PREDICTED: putative ankyrin repeat domain-containing protein 31 [Panthera tigris altaica]. 731240083 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1695 eukaryota>metazoa>chordata>vertebrata Fukomys damarensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Fukomys damarensis]. 297675470 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1693 eukaryota>metazoa>chordata>vertebrata Pongo abelii PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Pongo abelii]. 507534796 ANK+ANK+SWC3+ANK+ANK+RAMA Ank_3 Ankrd31 1655 eukaryota>metazoa>chordata>vertebrata Jaculus jaculus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Jaculus jaculus]. 625271191 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 Ankrd31 1345 eukaryota>metazoa>chordata>vertebrata Cricetulus griseus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 isoform X2, partial [Cricetulus griseus]. 700388987 ANK+ANK+ANK+ANK+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA Ank_3 ANKRD31 1664 eukaryota>metazoa>chordata>vertebrata Opisthocomus hoazin PREDICTED: putative ankyrin repeat domain-containing protein 31 [Opisthocomus hoazin]. 529449538 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1653 eukaryota>metazoa>chordata>vertebrata Falco peregrinus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Falco peregrinus]. 690452132 ANK+ANK+ANK+ANK+ANK+ANK+SbcC+ANK+ANK+RAMA Ank_3 ANKRD31 1625 eukaryota>metazoa>chordata>vertebrata Pygoscelis adeliae PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pygoscelis adeliae]. 768382274 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1611 eukaryota>metazoa>chordata>vertebrata Aquila chrysaetos canadensis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Aquila chrysaetos canadensis]. 543729927 ANK+ANK+ANK+ANK+ANK+ANK+ANK+Imm22+ANK+ANK+RAMA Ank_3 ANKRD31 1608 eukaryota>metazoa>chordata>vertebrata Columba livia PREDICTED: putative ankyrin repeat domain-containing protein 31 [Columba livia]. 729767935 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1593 eukaryota>metazoa>chordata>vertebrata Haliaeetus leucocephalus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Haliaeetus leucocephalus]. 541975648 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1578 eukaryota>metazoa>chordata>vertebrata Falco cherrug PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Falco cherrug]. 700426650 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1564 eukaryota>metazoa>chordata>vertebrata Leptosomus discolor PREDICTED: putative ankyrin repeat domain-containing protein 31 [Leptosomus discolor]. 696971827 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1552 eukaryota>metazoa>chordata>vertebrata Cuculus canorus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Cuculus canorus]. 513229707 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1548 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X8 [Gallus gallus]. 513229710 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1547 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X9 [Gallus gallus]. 513229713 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1545 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X10 [Gallus gallus]. 527247521 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1544 eukaryota>metazoa>chordata>vertebrata Melopsittacus undulatus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Melopsittacus undulatus]. 513229716 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1538 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X7 [Gallus gallus]. 686604346 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1520 eukaryota>metazoa>chordata>vertebrata Aptenodytes forsteri PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Aptenodytes forsteri]. 513229721 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1514 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X11 [Gallus gallus]. 701430760 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1492 eukaryota>metazoa>chordata>vertebrata Chaetura pelagica PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Chaetura pelagica]. 663291212 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1491 eukaryota>metazoa>chordata>vertebrata Calypte anna PREDICTED: putative ankyrin repeat domain-containing protein 31 [Calypte anna]. 675413736 ANK+ANK+ANK+Imm22+ANK+ANK+RAMA Ank_3 ANKRD31 1470 eukaryota>metazoa>chordata>vertebrata Manacus vitellinus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Manacus vitellinus]. 701288139 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1437 eukaryota>metazoa>chordata>vertebrata Nestor notabilis PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Nestor notabilis]. 525026971 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1415 eukaryota>metazoa>chordata>vertebrata Ficedula albicollis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ficedula albicollis]. 695155706 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1400 eukaryota>metazoa>chordata>vertebrata Phalacrocorax carbo PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Phalacrocorax carbo]. 705661395 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1400 eukaryota>metazoa>chordata>vertebrata Chlamydotis macqueenii PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Chlamydotis macqueenii]. 697830380 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1397 eukaryota>metazoa>chordata>vertebrata Egretta garzetta PREDICTED: putative ankyrin repeat domain-containing protein 31 [Egretta garzetta]. 542159298 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1389 eukaryota>metazoa>chordata>vertebrata Zonotrichia albicollis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Zonotrichia albicollis]. 719764347 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1389 eukaryota>metazoa>chordata>vertebrata Tinamus guttatus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Tinamus guttatus]. 727001773 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1378 eukaryota>metazoa>chordata>vertebrata Corvus cornix cornix PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X5 [Corvus cornix cornix]. 669287973 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1356 eukaryota>metazoa>chordata>vertebrata Corvus brachyrhynchos PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Corvus brachyrhynchos]. 683923285 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1353 eukaryota>metazoa>chordata>vertebrata Serinus canaria PREDICTED: putative ankyrin repeat domain-containing protein 31 [Serinus canaria]. 513229724 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1333 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X12 [Gallus gallus]. 513229727 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1330 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X13 [Gallus gallus]. 704245972 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1304 eukaryota>metazoa>chordata>vertebrata Eurypyga helias PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31, partial [Eurypyga helias]. 699659375 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1291 eukaryota>metazoa>chordata>vertebrata Picoides pubescens PREDICTED: putative ankyrin repeat domain-containing protein 31 [Picoides pubescens]. 543281441 ANK+ANK+ANK+ANK+FUNDEAMN+ANK+ANK+RAMA Ank ANKRD31 1270 eukaryota>metazoa>chordata>vertebrata Geospiza fortis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Geospiza fortis]. 727001756 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1250 eukaryota>metazoa>chordata>vertebrata Corvus cornix cornix PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Corvus cornix cornix]. 543357448 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1246 eukaryota>metazoa>chordata>vertebrata Pseudopodoces humilis PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Pseudopodoces humilis]. 727001759 ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1198 eukaryota>metazoa>chordata>vertebrata Corvus cornix cornix PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Corvus cornix cornix]. 706133765 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1141 eukaryota>metazoa>chordata>vertebrata Colius striatus PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Colius striatus]. 700345054 ANK+ANK+ANK+ANK+ANK+RAMA Ank ANKRD31 1078 eukaryota>metazoa>chordata>vertebrata Haliaeetus albicilla PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Haliaeetus albicilla]. 675626531 ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1006 eukaryota>metazoa>chordata>vertebrata Merops nubicus PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Merops nubicus]. 697030258 ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1002 eukaryota>metazoa>chordata>vertebrata Fulmarus glacialis PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Fulmarus glacialis]. 698383920 ANK+ANK+ANK+SbcC+ANK+RAMA Ank_3 ANKRD31 998 eukaryota>metazoa>chordata>vertebrata Gavia stellata PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Gavia stellata]. 701383969 ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 858 eukaryota>metazoa>chordata>vertebrata Tyto alba PREDICTED: putative ankyrin repeat domain-containing protein 31 [Tyto alba]. 694641918 ANK+ANK+ANK+ANK+RAMA Ank LOC104029345 789 eukaryota>metazoa>chordata>vertebrata Pelecanus crispus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pelecanus crispus]. 678179736 ANK+ANK+ANK+SbcC+ANK+RAMA Ank_3 N328_02606 744 eukaryota>metazoa>chordata>vertebrata Gavia stellata Putative ankyrin repeat domain-containing protein 31, partial [Gavia stellata]. 679141382 ANK+ANK+ANK+SbcC+ANK+RAMA Ank_3 AS28_06212 743 eukaryota>metazoa>chordata>vertebrata Pygoscelis adeliae Putative ankyrin repeat domain-containing protein 31, partial [Pygoscelis adeliae]. 675312958 ANK+ANK+ANK+ANK+RAMA Ank_3 AS27_09918 742 eukaryota>metazoa>chordata>vertebrata Aptenodytes forsteri Putative ankyrin repeat domain-containing protein 31, partial [Aptenodytes forsteri]. 676819697 ANK+ANK+ANK+ANK+RAMA Ank_3 Z169_10261 741 eukaryota>metazoa>chordata>vertebrata Egretta garzetta Putative ankyrin repeat domain-containing protein 31, partial [Egretta garzetta]. 679004453 ANK+ANK+ANK+ANK+RAMA Ank N327_04278 741 eukaryota>metazoa>chordata>vertebrata Fulmarus glacialis Putative ankyrin repeat domain-containing protein 31, partial [Fulmarus glacialis]. 676584541 ANK+ANK+ANK+ANK+RAMA Ank_3 N303_00948 740 eukaryota>metazoa>chordata>vertebrata Cuculus canorus Putative ankyrin repeat domain-containing protein 31, partial [Cuculus canorus]. 678221323 ANK+ANK+ANK+ANK+RAMA Ank_3 N308_15574 740 eukaryota>metazoa>chordata>vertebrata Struthio camelus australis Putative ankyrin repeat domain-containing protein 31, partial [Struthio camelus australis]. 449278665 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 A306_04927 739 eukaryota>metazoa>chordata>vertebrata Columba livia Ankyrin repeat domain-containing protein 31, partial [Columba livia]. 697419024 ANK+ANK+ANK+ANK+RAMA Ank_3 N309_15508 739 eukaryota>metazoa>chordata>vertebrata Tinamus guttatus Putative ankyrin repeat domain-containing protein 31, partial [Tinamus guttatus]. 678997602 ANK+ANK+ANK+ANK+RAMA Ank_3 N326_09043 738 eukaryota>metazoa>chordata>vertebrata Eurypyga helias Putative ankyrin repeat domain-containing protein 31, partial [Eurypyga helias]. 677552062 ANK+ANK+ANK+SbcC+ANK+RAMA Ank_3 N306_04945 737 eukaryota>metazoa>chordata>vertebrata Opisthocomus hoazin Putative ankyrin repeat domain-containing protein 31, partial [Opisthocomus hoazin]. 483522412 ANK+ANK+ANK+ANK+RAMA Ank_3 Anapl_02880 735 eukaryota>metazoa>chordata>vertebrata Anas platyrhynchos Ankyrin repeat domain-containing protein 31, partial [Anas platyrhynchos]. 677470756 ANK+ANK+ANK+ANK+RAMA Ank N334_10293 735 eukaryota>metazoa>chordata>vertebrata Pelecanus crispus Putative ankyrin repeat domain-containing protein 31, partial [Pelecanus crispus]. 676420350 ANK+ANK+ANK+ANK+RAMA Ank_3 N302_11259 734 eukaryota>metazoa>chordata>vertebrata Corvus brachyrhynchos Putative ankyrin repeat domain-containing protein 31, partial [Corvus brachyrhynchos]. 679188311 ANK+ANK+ANK+RAMA Ank_3 N305_02338 734 eukaryota>metazoa>chordata>vertebrata Manacus vitellinus Putative ankyrin repeat domain-containing protein 31, partial [Manacus vitellinus]. 676781911 ANK+ANK+ANK+ANK+RAMA Ank_3 N300_04350 732 eukaryota>metazoa>chordata>vertebrata Calypte anna Putative ankyrin repeat domain-containing protein 31, partial [Calypte anna]. 683462737 ANK+ANK+ANK+ANK+RAMA Ank N338_06128 732 eukaryota>metazoa>chordata>vertebrata Podiceps cristatus Putative ankyrin repeat domain-containing protein 31, partial [Podiceps cristatus]. 704177536 ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 730 eukaryota>metazoa>chordata>vertebrata Buceros rhinoceros silvestris PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Buceros rhinoceros silvestris]. 677395853 ANK+ANK+ANK+ANK+RAMA Ank_3 N330_10098 729 eukaryota>metazoa>chordata>vertebrata Leptosomus discolor Putative ankyrin repeat domain-containing protein 31, partial [Leptosomus discolor]. 676701437 ANK+ANK+ANK+RAMA Ank_3 N320_10473 728 eukaryota>metazoa>chordata>vertebrata Buceros rhinoceros silvestris Putative ankyrin repeat domain-containing protein 31, partial [Buceros rhinoceros silvestris]. 697455688 ANK+ANK+Tox-REase-7+ANK+RAMA Ank N301_04126 723 eukaryota>metazoa>chordata>vertebrata Charadrius vociferus Putative ankyrin repeat domain-containing protein 31, partial [Charadrius vociferus]. 678199929 ANK+ANK+ANK+ANK+RAMA Ank_3 N307_02015 717 eukaryota>metazoa>chordata>vertebrata Picoides pubescens Putative ankyrin repeat domain-containing protein 31, partial [Picoides pubescens]. 678120939 ANK+ANK+ANK+RAMA Ank_3 M959_03623 714 eukaryota>metazoa>chordata>vertebrata Chaetura pelagica Ankyrin repeat domain-containing protein 31, partial [Chaetura pelagica]. 677066606 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 N325_11692 713 eukaryota>metazoa>chordata>vertebrata Colius striatus Putative ankyrin repeat domain-containing protein 31, partial [Colius striatus]. 698470854 ANK+ANK+ANK+RAMA Ank_4 LOC104167505 704 eukaryota>metazoa>chordata>vertebrata Cariama cristata PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31, partial [Cariama cristata]. 677277442 ANK+ANK+ANK+RAMA Ank_3 N322_03767 687 eukaryota>metazoa>chordata>vertebrata Cariama cristata Putative ankyrin repeat domain-containing protein 31, partial [Cariama cristata]. 677432816 ANK+ANK+ANK+ANK+POTRA+RAMA Ank_3 N331_11356 673 eukaryota>metazoa>chordata>vertebrata Merops nubicus Putative ankyrin repeat domain-containing protein 31, partial [Merops nubicus]. 677380247 ANK+ANK+ANK+ANK+RAMA Ank N329_03825 667 eukaryota>metazoa>chordata>vertebrata Haliaeetus albicilla Putative ankyrin repeat domain-containing protein 31, partial [Haliaeetus albicilla]. 677447578 ANK+ANK+ANK+ANK+RAMA Ank N333_13417 663 eukaryota>metazoa>chordata>vertebrata Nestor notabilis Putative ankyrin repeat domain-containing protein 31, partial [Nestor notabilis]. 694858262 ANK+RAMA Ank_4 LOC104019302 649 eukaryota>metazoa>chordata>vertebrata Nipponia nippon PREDICTED: putative ankyrin repeat domain-containing protein 31, partial [Nipponia nippon]. 677544370 ANK+RAMA Ank_2 Y956_09939 644 eukaryota>metazoa>chordata>vertebrata Nipponia nippon Putative ankyrin repeat domain-containing protein 31, partial [Nipponia nippon]. 677115369 ANK+ANK+ANK+ANK+RAMA Ank_3 N324_07686 633 eukaryota>metazoa>chordata>vertebrata Chlamydotis macqueenii Putative ankyrin repeat domain-containing protein 31, partial [Chlamydotis macqueenii]. 677495871 ANK+ANK+ANK+SbcC+ANK+RAMA Ank N337_13309 632 eukaryota>metazoa>chordata>vertebrata Phoenicopterus ruber ruber Putative ankyrin repeat domain-containing protein 31, partial [Phoenicopterus ruber ruber]. 677223564 ANK+RAMA Ank_2 N323_01666 624 eukaryota>metazoa>chordata>vertebrata Cathartes aura Putative ankyrin repeat domain-containing protein 31, partial [Cathartes aura]. 679210087 ANK+ANK+ANK Ank_3 N336_07707 589 eukaryota>metazoa>chordata>vertebrata Phalacrocorax carbo Putative ankyrin repeat domain-containing protein 31, partial [Phalacrocorax carbo]. 678131372 ANK+RAMA - N340_11409 575 eukaryota>metazoa>chordata>vertebrata Tauraco erythrolophus Putative ankyrin repeat domain-containing protein 31, partial [Tauraco erythrolophus]. 676240185 RAMA - N312_04000 537 eukaryota>metazoa>chordata>vertebrata Balearica regulorum gibbericeps Putative ankyrin repeat domain-containing protein 31, partial [Balearica regulorum gibbericeps]. 677478221 RAMA - N335_11233 534 eukaryota>metazoa>chordata>vertebrata Phaethon lepturus Putative ankyrin repeat domain-containing protein 31, partial [Phaethon lepturus]. 733925440 RAMA - LOC104914976 462 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: putative ankyrin repeat domain-containing protein 31 [Meleagris gallopavo]. 440910685 ANK+ANK+ANK+RAMA Ank_3 M91_07742 738 eukaryota>metazoa>chordata>vertebrata Bos mutus Ankyrin repeat domain-containing protein 31 [Bos mutus]. 15207865 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 - 733 eukaryota>metazoa>chordata>vertebrata Macaca fascicularis hypothetical protein [Macaca fascicularis]. 281339402 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 PANDA_005463 733 eukaryota>metazoa>chordata>vertebrata Ailuropoda melanoleuca hypothetical protein PANDA_005463, partial [Ailuropoda melanoleuca]. 355691398 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 EGK_16593 733 eukaryota>metazoa>chordata>vertebrata Macaca mulatta hypothetical protein EGK_16593 [Macaca mulatta]. 528757498 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 CB1_001305002 703 eukaryota>metazoa>chordata>vertebrata Camelus ferus Ankyrin repeat domain protein 11 (Ankyrin repeat-containing cofactor-1)-like protein [Camelus ferus]. 537215000 ANK+RAMA Ank_5 H671_2g8049 599 eukaryota>metazoa>chordata>vertebrata Cricetulus griseus ankyrin repeat domain-containing protein 31 [Cricetulus griseus]. 470622172 RAMA - ANKRD31 489 eukaryota>metazoa>chordata>vertebrata Tursiops truncatus PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Tursiops truncatus]. 637259745 ANK+ANK+ANK+RAMA Ank_3 ankrd31 1122 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Anolis carolinensis]. 637259749 ANK+ANK+ANK+RAMA Ank_3 ankrd31 1115 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Anolis carolinensis]. 637259753 ANK+ANK+ANK+RAMA Ank_3 ankrd31 1085 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Anolis carolinensis]. 637259758 ANK+ANK+ANK+RAMA Ank_3 ankrd31 1053 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X4 [Anolis carolinensis]. 637259762 ANK+ANK+ANK+RAMA Ank_3 ankrd31 1038 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X5 [Anolis carolinensis]. 637259766 ANK+ANK+ANK+RAMA Ank_3 ankrd31 981 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X6 [Anolis carolinensis]. 499004945 ANK+ANK+ANK+ANK+RAMA Ank_3 LOC101480116 695 eukaryota>metazoa>chordata>vertebrata>actinopterygii Maylandia zebra PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Maylandia zebra]. 542224388 ANK+ANK+ANK+ANK+RAMA Ank_3 LOC102077672 679 eukaryota>metazoa>chordata>vertebrata>actinopterygii Oreochromis niloticus PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Oreochromis niloticus]. 768948668 ANK+ANK+ANK+RAMA Ank_3 LOC101078418 621 eukaryota>metazoa>chordata>vertebrata>actinopterygii Takifugu rubripes PREDICTED: tankyrase-like [Takifugu rubripes]. 548357001 ANK+ANK+ANK+ANK+RAMA Ank_3 LOC102212411 510 eukaryota>metazoa>chordata>vertebrata>actinopterygii Pundamilia nyererei PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Pundamilia nyererei]. 554822723 ANK+ANK+ANK+ANK+RAMA Ank_3 LOC102308989 510 eukaryota>metazoa>chordata>vertebrata>actinopterygii Haplochromis burtoni PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Haplochromis burtoni]. 632974189 ANK+ANK+ANK+ANK+ANK+RADICAL-SAM+ANK+ANK Ank_3 ankrd31 1518 eukaryota>metazoa>chordata>vertebrata Callorhinchus milii PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Callorhinchus milii]. 632974191 ANK+ANK+ANK+ANK+ANK+RADICAL-SAM+ANK+ANK Ank_3 ankrd31 1511 eukaryota>metazoa>chordata>vertebrata Callorhinchus milii PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X2 [Callorhinchus milii]. 632974193 ANK+ANK+ANK+ANK+ANK+RADICAL-SAM+ANK+ANK Ank_3 ankrd31 1425 eukaryota>metazoa>chordata>vertebrata Callorhinchus milii PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X3 [Callorhinchus milii]. 697522283 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1822 eukaryota>metazoa>chordata>vertebrata Struthio camelus australis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Struthio camelus australis]. 641759975 ANK+ANK+ANK+ANK+RAMA Ank LOC101940798 1434 eukaryota>metazoa>chordata>vertebrata Chrysemys picta bellii PREDICTED: putative ankyrin repeat domain-containing protein 31 [Chrysemys picta bellii]. 699661481 ANK+ANK+ANK+Tox-REase-7+ANK+RAMA Ank ANKRD31 1406 eukaryota>metazoa>chordata>vertebrata Charadrius vociferus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Charadrius vociferus]. 672015439 ANK+ANK+ANK+SFII-RAD3+ANK+RAMA Ank_3 Ankrd31 1048 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: putative ankyrin repeat domain-containing protein 31 isoform X1 [Rattus norvegicus]. 672041375 ANK+ANK+ANK+SFII-RAD3+ANK+RAMA Ank_3 Ankrd31 1177 eukaryota>metazoa>chordata>vertebrata Rattus norvegicus PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 isoform X2 [Rattus norvegicus]. 734646682 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 LOC104937873 385 eukaryota>metazoa>chordata>vertebrata>actinopterygii Larimichthys crocea PREDICTED: ankyrin repeat domain-containing protein 11-like [Larimichthys crocea]. 602639254 ANK+ANK+ANK+ANK+ANK+RAMA TctB ANKRD31 1273 eukaryota>metazoa>chordata>vertebrata Python bivittatus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Python bivittatus]. 449514661 ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1062 eukaryota>metazoa>chordata>vertebrata Taeniopygia guttata PREDICTED: putative ankyrin repeat domain-containing protein 31 [Taeniopygia guttata]. 507649885 ANK+ANK+ANK+ANK+ANK+ANK+MND1+ANK+ANK+RAMA Ank_3 ANKRD31 1616 eukaryota>metazoa>chordata>vertebrata Echinops telfairi PREDICTED: putative ankyrin repeat domain-containing protein 31 [Echinops telfairi]. 558168115 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 2337 eukaryota>metazoa>chordata>vertebrata Pelodiscus sinensis PREDICTED: putative ankyrin repeat domain-containing protein 31 [Pelodiscus sinensis]. 465967576 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 UY3_11445 1388 eukaryota>metazoa>chordata>vertebrata Chelonia mydas Ankyrin repeat domain-containing protein 31 [Chelonia mydas]. 521020823 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK Ank_3 D623_10035941 1209 eukaryota>metazoa>chordata>vertebrata Myotis brandtii Ankyrin repeat domain-containing protein 31 [Myotis brandtii]. 620941514 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1693 eukaryota>metazoa>chordata>vertebrata Ornithorhynchus anatinus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Ornithorhynchus anatinus]. 641695633 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_5 ANKRD31 1705 eukaryota>metazoa>chordata>vertebrata Eptesicus fuscus PREDICTED: putative ankyrin repeat domain-containing protein 31 [Eptesicus fuscus]. 557323251 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1726 eukaryota>metazoa>chordata>vertebrata Alligator sinensis PREDICTED: LOW QUALITY PROTEIN: putative ankyrin repeat domain-containing protein 31 [Alligator sinensis]. 612003933 ANK+ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 2041 eukaryota>metazoa>chordata>vertebrata Monodelphis domestica PREDICTED: putative ankyrin repeat domain-containing protein 31 [Monodelphis domestica]. 591380064 ANK+ANK+ANK+ANK+ANK+ANK+ANK+RAMA Ank_3 ANKRD31 1198 eukaryota>metazoa>chordata>vertebrata Chelonia mydas PREDICTED: putative ankyrin repeat domain-containing protein 31 [Chelonia mydas]. 688556915 ANK+ANK+ANK+ANK+ANK Ank_3 LOC103911116 847 eukaryota>metazoa>chordata>vertebrata>actinopterygii Danio rerio PREDICTED: putative ankyrin repeat domain-containing protein 31 [Danio rerio]. 395510523 ANK+ANK+ANK+ANK Ank_3 LOC100921662 1245 eukaryota>metazoa>chordata>vertebrata Sarcophilus harrisii PREDICTED: ankyrin repeat domain-containing protein 31-like [Sarcophilus harrisii]. # 16; 695436758 RAMA+PHD DDRGK PHYSODRAFT_347238 1408 eukaryota>stramenopiles Phytophthora sojae hypothetical protein PHYSODRAFT_347238 [Phytophthora sojae]. 566028887 RAMA+PHD DUF1675 F443_04913 1314 eukaryota>stramenopiles Phytophthora parasitica P1569 hypothetical protein F443_04913 [Phytophthora parasitica P1569]. 567966240 RAMA+PHD DUF1675 L915_04781 1314 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein L915_04781 [Phytophthora parasitica]. 567994957 RAMA+PHD DUF1675 L916_04727 1314 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein L916_04727 [Phytophthora parasitica]. 568024215 RAMA+PHD DUF1675 L917_04629 1314 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein L917_04629 [Phytophthora parasitica]. 568054639 RAMA+PHD DUF1675 L914_04731 1314 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein L914_04731 [Phytophthora parasitica]. 570991851 RAMA+PHD DUF1675 F442_04975 1314 eukaryota>stramenopiles Phytophthora parasitica P10297 hypothetical protein F442_04975 [Phytophthora parasitica P10297]. 675186757 RAMA+PHD DUF1675 PPTG_09144 1314 eukaryota>stramenopiles Phytophthora parasitica INRA-310 hypothetical protein PPTG_09144 [Phytophthora parasitica INRA-310]. 566028888 RAMA+PHD DUF4407 F443_04913 1307 eukaryota>stramenopiles Phytophthora parasitica P1569 hypothetical protein, variant 1 [Phytophthora parasitica P1569]. 567966241 RAMA+PHD DUF1675 L915_04781 1307 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein, variant 1 [Phytophthora parasitica]. 567994958 RAMA+PHD DUF1675 L916_04727 1307 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein, variant 1 [Phytophthora parasitica]. 568024216 RAMA+PHD DUF1675 L917_04629 1307 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein, variant 1 [Phytophthora parasitica]. 568054640 RAMA+PHD DUF4407 L914_04731 1307 eukaryota>stramenopiles Phytophthora parasitica hypothetical protein, variant 1 [Phytophthora parasitica]. 570991852 RAMA+PHD DUF1675 F442_04975 1307 eukaryota>stramenopiles Phytophthora parasitica P10297 hypothetical protein, variant 1 [Phytophthora parasitica P10297]. 675186763 RAMA+PHD OmpH PPTG_09144 1307 eukaryota>stramenopiles Phytophthora parasitica INRA-310 hypothetical protein, variant 1 [Phytophthora parasitica INRA-310]. 301095381 RAMA+PHD Lipase_chap PITG_17280 1304 eukaryota>stramenopiles Phytophthora infestans T30-4 conserved hypothetical protein [Phytophthora infestans T30-4]. 669167767 RAMA+PHD DUF605 SDRG_15055 763 eukaryota>stramenopiles Saprolegnia diclina VS20 hypothetical protein SDRG_15055 [Saprolegnia diclina VS20]. 641537267 RAMA+PHD PHD SPRG_04269 758 eukaryota>stramenopiles Saprolegnia parasitica CBS 223.65 hypothetical protein SPRG_04269 [Saprolegnia parasitica CBS 223.65]. 698784133 RAMA+PHD MSP1_C H257_02952 737 eukaryota>stramenopiles Aphanomyces astaci hypothetical protein H257_02952 [Aphanomyces astaci]. 698784135 RAMA+PHD MSP1_C H257_02952 713 eukaryota>stramenopiles Aphanomyces astaci hypothetical protein, variant [Aphanomyces astaci]. 301090511 RAMA+PHD V_ATPase_I PITG_20831 1037 eukaryota>stramenopiles Phytophthora infestans T30-4 conserved hypothetical protein [Phytophthora infestans T30-4]. # 9; 470649823 RAMA - LOC101339899 237 eukaryota>metazoa>chordata>vertebrata Tursiops truncatus PREDICTED: MPN domain-containing protein-like [Tursiops truncatus]. 675365883 RAMA - X975_21913 212 eukaryota>metazoa Stegodyphus mimosarum MPN domain-containing protein, partial [Stegodyphus mimosarum]. 705695653 RAMA Adeno_terminal LOC104483765 212 eukaryota>metazoa>chordata>vertebrata Chlamydotis macqueenii PREDICTED: MPN domain-containing protein-like, partial [Chlamydotis macqueenii]. 685554980 RAMA DUF2763 LOC103886366 210 eukaryota>metazoa>chordata>vertebrata Papio anubis PREDICTED: MPN domain-containing protein-like isoform X1 [Papio anubis]. 637356646 RAMA - LOC103280867 205 eukaryota>metazoa>chordata>vertebrata Anolis carolinensis PREDICTED: MPN domain-containing protein-like, partial [Anolis carolinensis]. 685554982 RAMA DUF2763 LOC103886366 200 eukaryota>metazoa>chordata>vertebrata Papio anubis PREDICTED: MPN domain-containing protein-like isoform X2 [Papio anubis]. 675756739 RAMA API5 LOC100401478 193 eukaryota>metazoa>chordata>vertebrata Callithrix jacchus PREDICTED: MPN domain-containing protein-like [Callithrix jacchus]. 504183573 RAMA - LOC101526050 170 eukaryota>metazoa>chordata>vertebrata Ochotona princeps PREDICTED: MPN domain-containing protein-like, partial [Ochotona princeps]. 565301529 RAMA - L345_15667 127 eukaryota>metazoa>chordata>vertebrata Ophiophagus hannah MPN domain-containing protein, partial [Ophiophagus hannah]. 3; 308805448 RAMA - Ot06g04230 441 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri 26S proteasome regulatory complex, subunit RPN11 (ISS) [Ostreococcus tauri]. 693499678 RAMA - OT_ostta06g03960 409 eukaryota>viridiplantae>chlorophyta Ostreococcus tauri unnamed product [Ostreococcus tauri]. 145347713 RAMA - OSTLU_32332 341 eukaryota>viridiplantae>chlorophyta Ostreococcus lucimarinus CCE9901 predicted protein [Ostreococcus lucimarinus CCE9901]. # 2; 765549348 RAMA+PHD+PHD+PHD+BROMO+PHD+SJA/FYR+SET Atrophin-1 CAOG_001362 1884 eukaryota Capsaspora owczarzaki ATCC 30864 mixed-lineage leukemia protein [Capsaspora owczarzaki ATCC 30864]. 470324569 RAMA+PHD+PHD+PHD+BROMO+PHD+SJA/FYR+SET FYVE_2 CAOG_01362 1858 eukaryota Capsaspora owczarzaki ATCC 30864 mixed-lineage leukemia protein, partial [Capsaspora owczarzaki ATCC 30864]. 514687557 RAMA+PHD+PHD+PHD+PHD+SJA/FYR+SET DUF2413 PTSG_07559 2027 eukaryota>choanoflagellida Salpingoeca rosetta mixed-lineage leukemia protein [Salpingoeca rosetta]. # 2; 669313787 RAMA - M513_01753 410 eukaryota>metazoa>nematoda Trichuris suis hypothetical protein M513_01753 [Trichuris suis]. 669222357 RAMA - TTRE_0000461101 320 eukaryota>metazoa>nematoda Trichuris trichiura hypothetical protein TTRE_0000461101 [Trichuris trichiura]. # 2; 751771263 RAMA - LOC105226825 197 eukaryota>metazoa>hexapoda Bactrocera dorsalis PREDICTED: MPN domain-containing protein CG4751-like [Bactrocera dorsalis]. 751479038 RAMA - LOC105220345 195 eukaryota>metazoa>hexapoda Bactrocera cucurbitae PREDICTED: MPN domain-containing protein CG4751-like [Bactrocera cucurbitae]. # 1; 545357568 RAMA+ZFCW+BRIGHT ARID COCSUDRAFT_48713 1111 eukaryota>viridiplantae>chlorophyta Coccomyxa subellipsoidea C-169 hypothetical protein COCSUDRAFT_48713 [Coccomyxa subellipsoidea C-169]. 761967687 RAMA+ZFCW zf-CW MNEG_8869 296 eukaryota>viridiplantae>chlorophyta Monoraphidium neglectum hypothetical protein MNEG_8869 [Monoraphidium neglectum]. 545371504 RAMA+ZFCW zf-CW COCSUDRAFT_60779 229 eukaryota>viridiplantae>chlorophyta Coccomyxa subellipsoidea C-169 hypothetical protein COCSUDRAFT_60779 [Coccomyxa subellipsoidea C-169]. # 1; 676429688 RAMA - LOTGIDRAFT_237598 1826 eukaryota>metazoa>mollusca Lottia gigantea hypothetical protein LOTGIDRAFT_237598 [Lottia gigantea]. 692170605 RING+RAMA+BRCT+BRCT zf-RING_5 DI09_127p30 539 eukaryota>fungi>microsporidia Mitosporidium daphniae Rad18-like protein [Mitosporidium daphniae]. 281208107 RAMA+BRCT+BRCT Herpes_TAF50 PPL_04709 1110 eukaryota>amoebozoa>mycetozoa>dictyosteliida Polysphondylium pallidum PN500 hypothetical protein PPL_04709 [Polysphondylium pallidum PN500]. 66800521 RAMA+BRCT+BRCT DUF4175 DDB_G0293300 1217 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium discoideum AX4 hypothetical protein DDB_G0293300 [Dictyostelium discoideum AX4]. 330844042 RAMA+BRCT+BRCT DUF4175 DICPUDRAFT_158874 1093 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium purpureum hypothetical protein DICPUDRAFT_158874 [Dictyostelium purpureum]. 672817350 RAMA+PHD+PHD+PHD RAG2_PHD MVEG_11466 1018 eukaryota>fungi Mortierella verticillata NRRL 6337 hypothetical protein MVEG_11466 [Mortierella verticillata NRRL 6337]. 733923653 RAMA+RRM DUF3584 LOC100540096 891 eukaryota>metazoa>chordata>vertebrata Meleagris gallopavo PREDICTED: scaffold attachment factor B1-like [Meleagris gallopavo]. 735859753 RAMA VAR1 SAMD00019534_002340 904 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_002340 [Acytostelium subglobosum LB1]. 676383955 RAMA+PARPFIN+TUDOR+RING DUF3584 AURANDRAFT_62586 1445 eukaryota>stramenopiles Aureococcus anophagefferens hypothetical protein AURANDRAFT_62586 [Aureococcus anophagefferens]. 470238462 RAMA+BRCT+BRCT PTCB-BRCT DFA_12254 890 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium fasciculatum hypothetical protein DFA_12254 [Dictyostelium fasciculatum]. 513226956 RAMA+DUF4417 - MPND 629 eukaryota>metazoa>chordata>vertebrata Gallus gallus PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein-like [Gallus gallus]. 735848583 RAMA Rrn6 SAMD00019534_125030 487 eukaryota>amoebozoa>mycetozoa>dictyosteliida Acytostelium subglobosum LB1 hypothetical protein SAMD00019534_125030 [Acytostelium subglobosum LB1]. 612395248 RAMA Nucleoplasmin Bathy05g04310 487 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 642092905 RAMA - GSONMT00016346001 373 eukaryota>metazoa>chordata>vertebrata>actinopterygii Oncorhynchus mykiss unnamed protein product [Oncorhynchus mykiss]. 159478713 RAMA FAM222A CHLREDRAFT_150353 1245 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein, partial [Chlamydomonas reinhardtii]. 620987738 RAMA Mito_fiss_reg LOC100089729 371 eukaryota>metazoa>chordata>vertebrata Ornithorhynchus anatinus PREDICTED: MPN domain-containing protein-like [Ornithorhynchus anatinus]. 303277185 RAMA - MICPUCDRAFT_47146 361 eukaryota>viridiplantae>chlorophyta Micromonas pusilla CCMP1545 predicted protein [Micromonas pusilla CCMP1545]. 339246505 RAMA - Tsp_04073 324 eukaryota>metazoa>nematoda Trichinella spiralis hypothetical protein Tsp_04073 [Trichinella spiralis]. 546679818 RAMA - D910_07564 308 eukaryota>metazoa>hexapoda Dendroctonus ponderosae hypothetical protein D910_07564 [Dendroctonus ponderosae]. 543287426 RAMA Adeno_terminal MPND 415 eukaryota>metazoa>chordata>vertebrata Geospiza fortis PREDICTED: LOW QUALITY PROTEIN: MPN domain-containing protein, partial [Geospiza fortis]. 762153126 RAMA - LOC105318954 886 eukaryota>metazoa>mollusca Crassostrea gigas PREDICTED: uncharacterized protein LOC105318954 [Crassostrea gigas]. 119616158 RAMA - hCG_1647033 186 eukaryota>metazoa>chordata>vertebrata Homo sapiens hCG1647033, partial [Homo sapiens]. 761962926 RAMA - MNEG_12234 74 eukaryota>viridiplantae>chlorophyta Monoraphidium neglectum hypothetical protein MNEG_12234, partial [Monoraphidium neglectum]. 633909077 RAMA - H632_c1317p0 248 eukaryota>viridiplantae>chlorophyta Helicosporidium sp. ATCC 50920 hypothetical protein H632_c1317p0 [Helicosporidium sp. ATCC 50920]. 675890068 RAMA - HELRODRAFT_165788 847 eukaryota>metazoa>annelida Helobdella robusta hypothetical protein HELRODRAFT_165788 [Helobdella robusta]. 671037731 RAMA - MPND 501 eukaryota>metazoa>chordata>vertebrata Ursus maritimus PREDICTED: MPN domain-containing protein [Ursus maritimus]. 545702290 RAMA Filament Gasu_54090 805 eukaryota>rhodophyta Galdieria sulphuraria hypothetical protein Gasu_54090 [Galdieria sulphuraria]. 255072729 RAMA - MICPUN_98973 247 eukaryota>viridiplantae>chlorophyta Micromonas sp. RCC299 predicted protein [Micromonas sp. RCC299]. 66823477 RAMA DUF4175 DDB_G0272516 465 eukaryota>amoebozoa>mycetozoa>dictyosteliida Dictyostelium discoideum AX4 hypothetical protein DDB_G0272516 [Dictyostelium discoideum AX4]. 612386108 RAMA Daxx Bathy16g00130 620 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 488545531 RAMA - MPND 241 eukaryota>metazoa>chordata>vertebrata Dasypus novemcinctus PREDICTED: MPN domain-containing protein, partial [Dasypus novemcinctus]. 760437079 RAMA TT_ORF1 F751_2934 306 eukaryota>viridiplantae>chlorophyta Auxenochlorella protothecoides MPN domain-containing protein [Auxenochlorella protothecoides]. 552813497 RAMA - CHLNCDRAFT_139765 582 eukaryota>viridiplantae>chlorophyta Chlorella variabilis hypothetical protein CHLNCDRAFT_139765 [Chlorella variabilis]. 443691830 RAMA - CAPTEDRAFT_211270 576 eukaryota>metazoa>annelida Capitella teleta hypothetical protein CAPTEDRAFT_211270 [Capitella teleta]. 159471497 RAMA JAB CHLREDRAFT_188109 545 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii predicted protein [Chlamydomonas reinhardtii]. 159476834 RAMA - CHLREDRAFT_187079 540 eukaryota>viridiplantae>chlorophyta Chlamydomonas reinhardtii hypothetical protein CHLREDRAFT_187079 [Chlamydomonas reinhardtii]. 584005556 RAMA - LOC102779856 243 eukaryota>metazoa>chordata>vertebrata>actinopterygii Neolamprologus brichardi PREDICTED: putative ankyrin repeat domain-containing protein 31-like [Neolamprologus brichardi]. 405977144 RAMA - CGI_10025066 531 eukaryota>metazoa>mollusca Crassostrea gigas hypothetical protein CGI_10025066 [Crassostrea gigas]. 701427628 RAMA - MPND 236 eukaryota>metazoa>chordata>vertebrata Chaetura pelagica PREDICTED: MPN domain-containing protein, partial [Chaetura pelagica]. 761967974 RAMA MAT1 MNEG_8657 1281 eukaryota>viridiplantae>chlorophyta Monoraphidium neglectum hypothetical protein MNEG_8657 [Monoraphidium neglectum]. 260823234 RAMA+DOUBLECORTIN+DOUBLECORTIN+STYKIN CCDC66 BRAFLDRAFT_71623 2268 eukaryota>metazoa>chordata Branchiostoma floridae hypothetical protein BRAFLDRAFT_71623 [Branchiostoma floridae]. 612396523 RAMA+N6-MTase+ZFCW DUF1421 Bathy04g03050 1310 eukaryota>viridiplantae>chlorophyta Bathycoccus prasinos predicted protein [Bathycoccus prasinos]. 551676387 AT-hook+BRIGHT/ARID+RAMA+TAMMBD ARID GUITHDRAFT_132396 1403 eukaryota>cryptophyta Guillardia theta CCMP2712 hypothetical protein GUITHDRAFT_132396 [Guillardia theta CCMP2712]. 551621810 RAMA+CHROMO Chromo EMIHUDRAFT_224016 613 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_224016 [Emiliania huxleyi CCMP1516]. 551589812 RAMA+CHROMO+LRR+LRR+LRR+LRR LRR_4 EMIHUDRAFT_114853 1118 eukaryota>haptophyceae Emiliania huxleyi CCMP1516 hypothetical protein EMIHUDRAFT_114853 [Emiliania huxleyi CCMP1516]. 302672755 RAMA+BULBLECTIN B_lectin SCHCODRAFT_238794 296 eukaryota>fungi>basidiomycota Schizophyllum commune H4-8 hypothetical protein SCHCODRAFT_238794 [Schizophyllum commune H4-8]. 636755953 RAMA - GLAREA_06546 312 eukaryota>fungi>ascomycota Glarea lozoyensis ATCC 20868 hypothetical protein GLAREA_06546 [Glarea lozoyensis ATCC 20868]. 459177714 RAMA - LOC100181315 872 eukaryota>metazoa>chordata Ciona intestinalis PREDICTED: uncharacterized protein LOC100181315 [Ciona intestinalis]. --A limited number of prokaryotic operons are presented here. GI Gene neighborhoods Architectures Pfam-archs Gene-name Len Taxonomy Species Genbank # 26; 501352122 <-N6-MTase+RAMA* N6-MTase+RAMA N6_N4_Mtase - 395 bacteria>proteobacteria>alphaproteobacteria Beijerinckia indica restriction endonuclease subunit M [Beijerinckia indica]. <-501352115_?<-501352116_?<-501352117_?<-501352118_?<-501352119_?||754154301_?->501352121_?-><-501352122_N6-MTase+RAMA*||501352123_?->501352124_?->501352125_?-><-501352126_?<-754154302_?<-501352128_?||754154303_?-> 499204035 - N6-MTase+RAMA N6_N4_Mtase - 381 archaea>euryarchaeota Thermoplasma acidophilum DNA methyltransferase [Thermoplasma acidophilum]. 739196468 N6-MTase+RAMA*-> N6-MTase+RAMA SP+N6_N4_Mtase - 376 bacteria>proteobacteria>alphaproteobacteria Rhizobium leguminosarum restriction endonuclease subunit M [Rhizobium leguminosarum]. 739196456_?->739196457_?-><-739196458_?||739196461_?-><-739196566_?||739196463_?->739196466_?->739196468_N6-MTase+RAMA*-><-739196470_?<-739196471_?<-739196474_?||739196567_?->739196477_?->739196569_?->739196480_?-> 501142320 <-N6-MTase+RAMA* N6-MTase+RAMA N6_N4_Mtase - 375 bacteria>chloroflexi Herpetosiphon aurantiacus restriction endonuclease subunit M [Herpetosiphon aurantiacus]. 501142313_?->501142314_?->501142315_?->752637577_?->501142317_?-><-501142318_?<-501142319_?<-501142320_N6-MTase+RAMA*<-501142321_?<-752637579_?<-501142323_?<-501142324_?||501142325_?-><-501142326_?||501142327_?-> 512724354 <-N6-MTase+RAMA* N6-MTase+RAMA N6_N4_Mtase - 373 bacteria Chthonomonas calidirosea restriction endonuclease subunit M [Chthonomonas calidirosea]. <-512724347_?||769177975_?-><-512724349_?<-769179139_?<-512724351_?<-769179140_?||512724353_?-><-512724354_N6-MTase+RAMA*||769179142_?->512724356_?->512724357_?-><-512724358_?<-512724359_?<-512724360_?||512724362_?-> 493197799 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 372 bacteria>spirochaetes Treponema vincentii restriction endonuclease subunit M [Treponema vincentii]. <-493197792_?<-493197793_?<-748667390_?<-493197795_?||748667391_?->493197797_?->493197798_?->493197799_N6-MTase+RAMA*->748667392_?->493197801_?-><-493197802_?<-493197803_?<-493197804_?||493197805_?-><-493197806_? 501069455 REase->?->?->N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 370 bacteria>chloroflexi Roseiflexus castenholzii restriction endonuclease subunit M [Roseiflexus castenholzii]. 501069448_?-><-501069449_?<-501069450_?||501069451_?->501069452_REase->501069453_?->752683981_?->501069455_N6-MTase+RAMA*-><-501069456_?<-501069457_?<-501069458_?<-501069460_?||501069461_?->501069462_?->501069463_?-> 496651971 HNH-><-?<-?||?-><-?<-?<-?<-N6-MTase+RAMA*<-REase N6-MTase+RAMA N6_N4_Mtase - 369 bacteria>proteobacteria>epsilonproteobacteria Campylobacter sp. 10_1_50 restriction endonuclease subunit M [Campylobacter sp. 10_1_50]. 736902613_HNH-><-496651959_?<-496651961_?||496651963_?-><-496651965_?<-496651967_?<-496651969_?<-496651971_N6-MTase+RAMA*<-496651973_REase<-496651975_?<-736902809_?<-496651979_?<-496651981_?<-496651983_?<-489029481_? 503325698 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 368 bacteria>chloroflexi Anaerolinea thermophila restriction endonuclease subunit M [Anaerolinea thermophila]. 503325692_?-><-752816319_?<-752816320_?<-752815681_?<-752815682_?<-503325696_?<-752815683_?||503325698_N6-MTase+RAMA*->752816321_?->503325700_?->503325701_?->503325702_?-><-752815684_?||503325704_?->503325705_?-> 550983501 <-N6-MTase+RAMA* N6-MTase+RAMA N6_N4_Mtase - 366 bacteria>proteobacteria>alphaproteobacteria Thalassospira lucentensis restriction endonuclease subunit M [Thalassospira lucentensis]. <-655387197_?<-703179655_?<-550983496_?<-550983497_?<-703179657_?||655387198_?-><-550983500_?<-550983501_N6-MTase+RAMA*||550983502_?->703179660_?->550983504_?->550983505_?->550983506_?->550983507_?->550983508_?-> 696308956 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 365 bacteria>fusobacteria Fusobacterium nucleatum DNA methyltransferase [Fusobacterium nucleatum]. 492647398_?->492647397_?->552907250_?->552907251_?->492647394_?->492647392_?->492647391_?->696308956_N6-MTase+RAMA*->552907253_?->552907254_?->696308958_?->552907257_?->552907258_?->492627342_?->495970163_?-> 493739841 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 363 bacteria>tenericutes Ureaplasma parvum restriction endonuclease subunit M [Ureaplasma parvum]. 493739899_?->493739841_N6-MTase+RAMA*->493739954_?->755116483_?->755116485_?->755116486_?->493739937_?->493739927_?-><-493739876_? 497185310 <-N6-MTase+RAMA* N6-MTase+RAMA N6_N4_Mtase - 361 bacteria>proteobacteria>gammaproteobacteria Moraxella macacae DNA methylase N-4/N-6 domain-containing protein [Moraxella macacae]. <-750342964_?<-750342967_?||497185298_?->497185301_?->497185304_?->497185306_?-><-497185308_?<-497185310_N6-MTase+RAMA*<-497185312_?<-497185314_?||497185316_?->750342968_?-><-497185321_?<-497185605_?<-497185606_? 503763750 <-REase<-N6-MTase+RAMA*<-?<-?<-?<-?<-?<-?<-ParB N6-MTase+RAMA N6_N4_Mtase - 361 bacteria>bacteroidetes Capnocytophaga canimorsus DNA methyltransferase [Capnocytophaga canimorsus]. <-754503245_?<-503763744_?<-754502974_?<-503763746_?<-503763747_?<-503763748_?<-503763749_REase<-503763750_N6-MTase+RAMA*<-503763751_?<-503763754_?<-503763755_?<-503763756_?<-503763757_?<-503763758_?<-503763759_ParB 518849236 N6-MTase+RAMA*->?->?-><-?<-HNH<-?||?->McrB-> N6-MTase+RAMA N6_N4_Mtase - 360 bacteria>spirochaetes Brachyspira innocens DNA methyltransferase [Brachyspira innocens]. <-518849229_?||518849230_?-><-703420857_?<-518849232_?||703420859_?->518849234_?->518849235_?->518849236_N6-MTase+RAMA*->518849237_?->518849238_?-><-518849239_?<-518849240_HNH<-518849241_?||518849242_?->518849243_McrB-> 654345013 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase+PCMT - 360 bacteria>cyanobacteria Mastigocoleus testarum restriction endonuclease subunit M [Mastigocoleus testarum]. 654345007_?-><-654345008_?||654345009_?-><-654345010_?||654345011_?->738308971_?->654345012_?->654345013_N6-MTase+RAMA*-><-654345014_?||654345015_?-><-738308930_?<-738308933_?||738308975_?->654345018_?->654345019_?-> 697093027 REase->N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 360 bacteria>tenericutes Mycoplasma collis restriction endonuclease subunit M [Mycoplasma collis]. 697093022_?->697093023_?->697093024_?->697093036_?->697093025_?->697093026_REase->697093027_N6-MTase+RAMA*-><-697093028_?<-697093029_?||697093030_?->697093037_?->697093031_?->738479282_?->697093032_?-> 737625856 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 360 bacteria>proteobacteria>alphaproteobacteria Hyphomonas polymorpha restriction endonuclease subunit M [Hyphomonas polymorpha]. 737625844_?-><-737625847_?||737626074_?->737625850_?-><-737625853_?||737626077_?-><-737626080_?||737625856_N6-MTase+RAMA*-><-737625859_?<-737626083_?<-737625862_?||737625865_?->737625868_?->737625870_?->737625873_?-> 748143394 <-N6-MTase+RAMA* N6-MTase+RAMA N6_N4_Mtase - 360 bacteria>cyanobacteria Scytonema millei restriction endonuclease subunit M [Scytonema millei]. <-748143391_?<-748143392_?<-748143393_?<-748143596_?<-748143597_?<-748143598_?||748143599_?-><-748143394_N6-MTase+RAMA*<-748143395_?<-748143396_?<-748143600_?||748143601_?-><-748143602_?<-748143397_?<-748143398_? 446268888 <-REase<-N6-MTase+RAMA* N6-MTase+RAMA N6_N4_Mtase - 359 bacteria>proteobacteria>epsilonproteobacteria Helicobacter pylori DNA methyltransferase [Helicobacter pylori]. <-447055814_?||487802840_?-><-446116267_?||446003551_?->446761496_?-><-658502684_?<-727092483_REase<-446268888_N6-MTase+RAMA*<-446833673_?<-446375435_?<-447064608_?<-446148357_?<-446875834_?<-727086548_?<-446836634_? 308225152 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase OSCT_3182 357 bacteria>chloroflexi Oscillochloris trichoides DG-6 DNA methylase N-4/N-6 domain-containing protein [Oscillochloris trichoides DG-6]. 308225190_?-><-308225191_?<-308225192_?||308225193_?->308225194_?->308225150_?->308225151_?->308225152_N6-MTase+RAMA*->308225153_?-><-308225154_?<-308225155_?<-308225156_?||308225157_?->308225158_?->308225159_?-> 568205957 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 357 bacteria>proteobacteria>alphaproteobacteria Magnetospirillum gryphiswaldense restriction endonuclease subunit M [Magnetospirillum gryphiswaldense]. <-753897375_?<-568205951_?<-753897377_?||753897379_?->753897381_?->568205955_?->753897383_?->568205957_N6-MTase+RAMA*-><-568205958_?<-568205959_?<-568205960_?||753897386_?-><-568205962_?||568205963_?->753897389_?-> 665867244 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase - 353 bacteria>proteobacteria>alphaproteobacteria Agrobacterium tumefaciens restriction endonuclease subunit M, partial [Agrobacterium tumefaciens]. 665867244_N6-MTase+RAMA*->489604643_?->648671957_?-><-489604641_?<-523694408_?||523694409_?->489604638_?->489604637_?-> 38906136 N6-MTase+RAMA*-> N6-MTase+RAMA Methyltransf_26+N6_N4_Mtase - 350 bacteria>firmicutes Staphylococcus sp. L1 adenine DNA methyltransferase [Staphylococcus sp. L1]. 38906136_N6-MTase+RAMA*-> 737789152 N6-MTase+RAMA*->REase-> N6-MTase+RAMA N6_N4_Mtase - 349 bacteria>bacteroidetes Flexibacter roseolus hypothetical protein, partial [Flexibacter roseolus]. 737789149_?-><-652631573_?||737789152_N6-MTase+RAMA*->652631574_REase-><-652631575_?||737789146_?-> 354959883 N6-MTase+RAMA*-> N6-MTase+RAMA N6_N4_Mtase BJ6T_73150 344 bacteria>proteobacteria>alphaproteobacteria Bradyrhizobium japonicum USDA 6 adenine DNA methyltransferase [Bradyrhizobium japonicum USDA 6]. 354959876_?-><-354959877_?<-354959878_?||354959879_?->354959880_?-><-354959881_?<-354959882_?||354959883_N6-MTase+RAMA*->354959884_?->354959885_?-><-354959886_?<-354959887_?<-354959888_?<-354959889_?<-354959890_? # 11; 651265592 <-URI+RAMA* URI+RAMA GIY-YIG+DUF4357 - 305 bacteria>proteobacteria>alphaproteobacteria Acetobacter nitrogenifigens hypothetical protein [Acetobacter nitrogenifigens]. 651265585_?->651265586_?->651265587_?->651265588_?-><-737395337_?||651265590_?->651265591_?-><-651265592_URI+RAMA*<-651265593_?<-651265594_?<-651265595_?||737395338_?->651265596_?->737395339_?->737395340_?-> 772620269 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 - 295 bacteria>proteobacteria>betaproteobacteria Comamonas aquatica excinuclease ABC subunit C [Comamonas aquatica]. 772620263_?->772620264_?->772620265_?->772622099_?->772620266_?->772620267_?->772620268_?->772620269_URI+RAMA*->772620270_?->772620271_?->772620272_?-><-772620273_?||772620274_?->772620275_?-><-759660784_? 494162448 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 - 294 bacteria>cyanobacteria Synechococcus sp. RS9917 excinuclease ABC subunit C [Synechococcus sp. RS9917]. <-494162439_?<-494162440_?||494162441_?->494162442_?->494162443_?->494162444_?->494162445_?->494162448_URI+RAMA*->494162449_?->494162450_?-><-740157707_?||494162452_?->494162454_?-><-494162455_?<-740157709_? 736707052 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 - 293 bacteria>proteobacteria>betaproteobacteria Acidovorax sp. JHL-9 excinuclease ABC subunit C [Acidovorax sp. JHL-9]. <-651308342_?<-651308343_?<-651308344_?<-651308345_?<-736707050_?<-651308347_?||651308348_?->736707052_URI+RAMA*->651308350_?-><-651308351_?<-651308352_?||651308353_?->736707081_?->651308354_?->736707082_?-> 759571980 URI+RAMA*-> URI+RAMA DUF4357 - 287 bacteria>proteobacteria>betaproteobacteria Burkholderia pseudomallei hypothetical protein [Burkholderia pseudomallei]. 759571971_?->759571973_?-><-740943948_?||759571977_?->740943944_?->759572002_?->759571978_?->759571980_URI+RAMA*->759571982_?->759572005_?->759571985_?->759571988_?->759572008_?->759572010_?-><-759572013_? 763304606 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 - 287 bacteria>firmicutes Salinibacillus aidingensis hypothetical protein [Salinibacillus aidingensis]. 763304595_?->763304597_?->763304599_?->763304601_?->763304630_?->763304603_?->763304604_?->763304606_URI+RAMA*->763304608_?->763304610_?->763304611_?->763304631_?->763304613_?-><-763304633_? 504865027 <-URI+RAMA*<-?||?-><-?<-URI+RAMA URI+RAMA GIY-YIG+DUF4357 - 285 archaea>euryarchaeota Methanolobus psychrophilus hypothetical protein [Methanolobus psychrophilus]. <-504865020_?<-504865021_?<-504865022_?||504865024_?->851281505_?->504865025_?-><-851281508_?<-504865027_URI+RAMA*<-851281511_?||504865029_?-><-851281508_?<-504865030_URI+RAMA||504865031_?->504865032_?-><-504865033_? 750016631 <-URI+RAMA* URI+RAMA GIY-YIG+DUF4357 - 285 bacteria>planctomycetes Blastopirellula marina hypothetical protein [Blastopirellula marina]. 750016625_?-><-750016626_?<-488730430_?<-750016628_?||488730433_?->750016629_?-><-488730435_?<-750016631_URI+RAMA*<-488730437_?<-488730438_?||750015383_?->488730440_?->488730441_?->750015384_?->488730444_?-> 695943706 <-URI+RAMA* URI+RAMA GIY-YIG+DUF4357 LI82_07720 283 archaea>euryarchaeota Methanococcoides methylutens hypothetical protein LI82_07720 [Methanococcoides methylutens]. <-695943703_?||695943704_?-><-695943705_?<-695943706_URI+RAMA*||695943707_?->695943708_?->695943709_?->695943710_?-><-695943711_?<-695943712_?<-695943713_? 737312527 <-URI+RAMA* URI+RAMA GIY-YIG+DUF4357 - 281 bacteria>firmicutes Brevibacillus thermoruber hypothetical protein [Brevibacillus thermoruber]. 737312520_?-><-737312521_?<-737312523_?<-737311946_?||517953542_?->737311949_?-><-737312525_?<-737312527_URI+RAMA*<-737312529_?<-737312530_?<-737312532_?<-737312534_?<-737312535_?<-737312537_?||737312884_?-> 501422597 URI+RAMA*->HNH->?->McrB->McrC-> URI+RAMA GIY-YIG+DUF4357 - 279 bacteria>firmicutes Natranaerobius thermophilus hypothetical protein [Natranaerobius thermophilus]. 752720267_?->501422593_?->501422594_?->501422595_?->501422520_?->752719768_?->501422596_?->501422597_URI+RAMA*->501422598_HNH->501422599_?->752720268_McrB->501422601_McrC->752720270_?->501422603_?->501422604_?-> # 5; 648575907 RAMA+CBS*-> RAMA+CBS SP+DUF4357+CBS - 467 bacteria>actinobacteria Micromonospora sp. CNB394 hypothetical protein [Micromonospora sp. CNB394]. <-517614212_?<-517614213_?<-517614214_?<-517614215_?<-517614216_?<-517614217_?||517614218_?->648575907_RAMA+CBS*-><-738393323_?||517614221_?->517614222_?-><-517614223_?||517614224_?-><-517614227_?<-738393325_? 759777256 RAMA+CBS*-> RAMA+CBS DUF4357+CBS - 460 bacteria>actinobacteria Streptomyces bingchenggensis hypothetical protein [Streptomyces bingchenggensis]. <-503942394_?<-503942395_?<-759781450_?<-503942397_?<-503942398_?<-759781453_?<-503942400_?||759777256_RAMA+CBS*-><-503942402_?||759781456_?->503942404_?->503942406_?->503942407_?->503942408_?-><-503942409_? 754222116 <-RAMA+CBS* RAMA+CBS DUF4357+CBS - 457 bacteria>actinobacteria Actinoplanes missouriensis hypothetical protein [Actinoplanes missouriensis]. 504253353_?->504253354_?->754222113_?-><-754220794_?||504253357_?-><-504253358_?<-754222114_?<-754222116_RAMA+CBS*||504253361_?->504253362_?->504253363_?->504253364_?->504253365_?-><-504253366_?<-754220795_? 703061045 <-RAMA+CBS* RAMA+CBS CBS - 456 bacteria>actinobacteria Catenuloplanes japonicus hypothetical protein [Catenuloplanes japonicus]. <-703061033_?<-703061035_?<-703061037_?||703061039_?->703061041_?->703061043_?->703061672_?-><-703061045_RAMA+CBS*<-703061047_?<-703061049_?<-703061052_?<-703061054_?<-703061678_?||703061681_?->703061056_?-> 739497435 RAMA+CBS*-> RAMA+CBS DUF4357+CBS - 453 bacteria>actinobacteria Amycolatopsis orientalis hypothetical protein [Amycolatopsis orientalis]. 739497424_?->739497426_?-><-739497428_?||739497482_?-><-739497430_?<-739497432_?||739497434_?->739497435_RAMA+CBS*->739497483_?-><-739497437_?||739497439_?->739497441_?-><-739497484_?<-739497443_?||739497445_?-> # 4; 640573949 <-URI+RAMA* URI+RAMA GIY-YIG+DUF4357 - 275 bacteria>bacteroidetes Porphyromonas macacae methionine sulfoxide reductase [Porphyromonas macacae]. 738988659_?->640573943_?->640573944_?->640573945_?->640573946_?->640573947_?-><-640573948_?<-640573949_URI+RAMA*<-738988661_?<-640573951_?<-640573952_?<-640573953_?<-517171461_?<-640573954_?<-640573955_? 649528608 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 M091_1691 247 bacteria>bacteroidetes Parabacteroides distasonis str. 3776 D15 i GIY-YIG catalytic domain protein [Parabacteroides distasonis str. 3776 D15 i]. <-649528276_?||649528560_?-><-649528583_?||649528351_?->649528575_?->649528432_?->649528598_?->649528608_URI+RAMA*->649528373_?-><-649528566_?||649528459_?->649528613_?-><-649528574_?||649528362_?->649528551_?-> 596132180 <-URI+RAMA* URI+RAMA GIY-YIG+DUF4357 M068_1150 245 bacteria>bacteroidetes Bacteroides fragilis str. J38-1 GIY-YIG catalytic domain protein [Bacteroides fragilis str. J38-1]. 596132234_?->596132210_?->596132254_?->596132135_?->596132046_?->596132118_?-><-596132198_?<-596132180_URI+RAMA*<-596132083_?<-596132050_?<-596132066_?<-596132205_?<-596132209_?<-596132185_?<-596132235_? 325482299 <-URI+RAMA* URI+RAMA DUF4357 HMPREF9303_0585 235 bacteria>bacteroidetes Prevotella denticola CRIS 18C-A hypothetical protein HMPREF9303_0585 [Prevotella denticola CRIS 18C-A]. 325482290_?->325482255_?->325482244_?->325482263_?->325482315_?-><-325482295_?<-325482292_?<-325482299_URI+RAMA*<-325482335_?<-325482243_?<-325482249_?<-325482334_?||325482258_?->325482287_?->325482242_?-> # 4; 296267968 <-RAMA* RAMA DUF2924 HMPREF0731_0014 158 bacteria>proteobacteria>alphaproteobacteria Roseomonas cervicalis ATCC 49957 hypothetical protein HMPREF0731_0014 [Roseomonas cervicalis ATCC 49957]. <-296267968_RAMA*<-296267969_? 489363811 S-resolvase<-RAMA* RAMA DUF2924 - 154 bacteria>proteobacteria>betaproteobacteria Ralstonia solanacearum hypothetical protein [Ralstonia solanacearum]. 746510525_?->489357851_?->489357852_?->489357853_?->755831621_?-><-755831623_?<-489367078_S-resolvase<-489363811_RAMA*<-489367076_?<-489367075_?<-489367073_?<-746524612_?<-755654450_?<-755654448_?||755830982_?-> 757691291 RAMA*-> RAMA DUF2924 - 148 bacteria>proteobacteria>gammaproteobacteria Pseudomonas chloritidismutans hypothetical protein [Pseudomonas chloritidismutans]. <-757691263_?<-757691286_?<-757691265_?<-757691287_?<-757691289_?<-757691267_?||757691269_?->757691291_RAMA*->757691292_?-><-757691271_?<-757691272_?<-757691273_?||757691275_?->757691276_?->757691294_?-> 500033502 RAMA*->S-resolvase RAMA DUF2924 - 144 bacteria>proteobacteria>alphaproteobacteria Magnetococcus marinus hypothetical protein [Magnetococcus marinus]. <-753915544_?<-500033496_?<-500033497_?<-500033498_?<-500033499_?<-500033500_?||500033501_?->500033502_RAMA*->500033503_S-resolvase->753915546_?->753915551_?-><-500033506_?<-500033507_?<-500033508_?<-500033509_? # 3; 159894763 <-PARB+HNH+RAMA* PARB+HNH+RAMA DUF262+DUF1524 Haur_5215 690 bacteria>chloroflexi Herpetosiphon aurantiacus DSM 785 protein of unknown function DUF262 (plasmid) [Herpetosiphon aurantiacus DSM 785]. <-159894756_?<-159894757_?<-159894758_?||159894759_?-><-159894760_?<-159894761_?<-159894762_?<-159894763_PARB+HNH+RAMA*<-159894764_?<-159894765_?<-159894766_?||159894767_?->159894768_?->159894769_?->159894770_?-> 488784593 PARB+HNH+RAMA*-> PARB+HNH+RAMA SP+DUF262+DUF1524 - 689 bacteria>bacteroidetes Microscilla marina hypothetical protein [Microscilla marina]. <-488784585_?<-488784586_?<-488784587_?<-488784588_?<-770187064_?||488784590_?->770186989_?->488784593_PARB+HNH+RAMA*->488784595_?-><-488784597_?||488784598_?->488784601_?-><-488784602_?<-488784603_?||488784605_?-> 755032699 <-PARB+HNH+RAMA* PARB+HNH+RAMA DUF262+DUF1524 - 672 bacteria>firmicutes Geobacillus thermoglucosidasius hypothetical protein [Geobacillus thermoglucosidasius]. <-755032684_?<-755032686_?<-755032688_?<-755032690_?||755032692_?-><-755032695_?<-755032696_?<-755032699_PARB+HNH+RAMA*||755032700_?-><-503642854_?<-503642853_?<-755032703_? # 3; 763144244 DCM->HNH->SeqA+RAMA*-> SeqA+RAMA DUF4357 - 137 bacteria>proteobacteria>gammaproteobacteria Vibrio vulnificus hypothetical protein [Vibrio vulnificus]. 763144243_?->503337475_?->503337476_?->763144312_?->686222949_?->763144313_DCM->763144314_HNH->763144244_RAMA*->503337482_?->499393456_?->503337483_?->503337484_?->503337485_?-><-499462739_?<-763144245_? 775294809 RAMA*-> RAMA - Aam_125_009 134 bacteria>proteobacteria>alphaproteobacteria Acidocella aminolytica 101 = DSM 11237 hypothetical protein Aam_125_009 [Acidocella aminolytica 101 = DSM 11237]. <-775294802_?<-775294803_?||775294804_?->775294805_?-><-775294806_?||775294807_?->775294808_?->775294809_RAMA*->775294810_?->775294811_?->775294812_?->775294813_?-> 494322155 RAMA*-> RAMA DUF2924 - 126 bacteria>proteobacteria>betaproteobacteria Burkholderia sp. Ch1-1 DUF2924 domain-containing protein [Burkholderia sp. Ch1-1]. 737545229_?->494322149_?->494322150_?->494322151_?->494322152_?->494322153_?->494322154_?->494322155_RAMA*->737545230_?->494322157_?->737544494_?->737545231_?-><-494322159_?||494322160_?-><-737545232_? # 2; 402258132 <-PARB+HNH+RAMA* PARB+HNH+RAMA DUF262+DUF1524+DUF4357 B437_00700 693 bacteria>fusobacteria Fusobacterium hwasookii ChDC F128 hypothetical protein B437_00700 [Fusobacterium hwasookii ChDC F128]. <-402258125_?<-402258126_?<-402258127_?||402258128_?->402258129_?->402258130_?-><-402258131_?<-402258132_PARB+HNH+RAMA*<-402258133_?<-402258134_?<-402258135_?<-402258136_?||402258137_?->402258138_?->402258139_?-> 489467259 <-PARB+HNH+RAMA* PARB+HNH+RAMA DUF262+DUF1524+DUF4357 - 674 bacteria>firmicutes Clostridium botulinum hypothetical protein [Clostridium botulinum]. 489465522_?-><-489464004_?<-489464013_?<-489464728_?||737813847_?-><-489465217_?<-489469295_?<-489467259_PARB+HNH+RAMA*<-489466793_?<-489464181_?<-489465116_?<-489469379_?<-489466191_?<-489464120_?<-489469029_? # 2; 655526129 <-HSDRN+DUF4268+RAMA*<-HNH HSDRN+DUF4268+RAMA HSDR_N_2+DUF4357+DUF4268 - 509 bacteria>bacteroidetes Prevotella sp. P6B4 hypothetical protein [Prevotella sp. P6B4]. 655526122_?->655526123_?->655526124_?->655526125_?->655526126_?->655526127_?->655526128_?-><-655526129_HSDRN+DUF4268+RAMA*<-655526130_HNH<-739035040_?||655526131_?->697058486_?->655526132_?->655526133_?-> 697058363 HNH->HSDRN+DUF4268+RAMA*-> HSDRN+DUF4268+RAMA HSDR_N_2+DUF4357+DUF4268 - 509 bacteria>bacteroidetes Prevotella sp. P6B1 hypothetical protein [Prevotella sp. P6B1]. <-697058485_?<-697058359_?<-697058360_?<-697058486_?<-697058361_?||697058487_?->697058362_HNH->697058363_HSDRN+DUF4268+RAMA*-><-655526128_?<-697058364_?<-697058365_?||697058366_?-><-697058367_?<-697058368_?<-655526118_? # 2; 500693945 - REase+RAMA PDDEXK_4 - 358 archaea>euryarchaeota Methanococcus maripaludis hypothetical protein [Methanococcus maripaludis]. 651957888 <-REase+RAMA* REase+RAMA DUF91 - 333 bacteria>firmicutes Bacillus megaterium hypothetical protein [Bacillus megaterium]. <-651957878_?||651957879_?->651957880_?->651957881_?-><-651957882_?||651957884_?-><-651957886_?<-651957888_REase+RAMA*||651957890_?-><-651957892_?<-651957894_? # 2; 740127358 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 - 277 bacteria>synergistetes Synergistes jonesii hypothetical protein [Synergistes jonesii]. <-740127346_?||740127347_?->740127351_?->740127353_?-><-740127355_?<-740127722_?||740127356_?->740127358_URI+RAMA*->740127360_?->740127366_?->740127368_?->740127371_?->740127725_?-><-740127728_?<-740127373_? 517958401 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 - 276 bacteria>actinobacteria Enorma massiliensis hypothetical protein [Enorma massiliensis]. 517958393_?->517958394_?-><-737113754_?<-517958396_?||517958398_?->737113756_?->737113031_?->517958401_URI+RAMA*->517958402_?->517958403_?->517958404_?->737113758_?-><-737113760_?<-517958407_?<-517958409_? # 2; 495798143 T5orf172+RAMA*-> T5orf172+RAMA DUF4357 - 236 bacteria>synergistetes Jonquetella anthropi hypothetical protein [Jonquetella anthropi]. <-495798139_?<-495798140_?<-495798141_?<-495798142_?<-495796199_?<-495796200_?<-495796202_?||495798143_T5orf172+RAMA*->495798146_?-><-495798147_?<-495798148_?||495798149_?-><-495798150_?||495798151_?->737819285_?-> 490555808 T5orf172+RAMA*-> T5orf172+RAMA T5orf172+DUF4357 - 223 bacteria>tenericutes Mycoplasma bovigenitalium protein, helicase [Mycoplasma bovigenitalium]. <-490555796_?||750242421_?->490555798_?->490555801_?->490555803_?->490555805_?->490555807_?->490555808_T5orf172+RAMA*->750242422_?-> # 2; 521961820 RAMA*->S-resolvase RAMA DUF2924 - 165 bacteria>planctomycetes Zavarzinella formosa hypothetical protein [Zavarzinella formosa]. 750609086_?-><-657929650_?||750609087_?->521961815_?->750609088_?->521961818_?->657929652_?->521961820_RAMA*->521961821_S-resolvase->521961822_?->521961823_?->521961824_?->521961825_?->521961826_?->521961827_?-> 655234456 S-resolvase<-RAMA* RAMA DUF2924 - 155 bacteria>actinobacteria Nocardioides sp. JGI 0001009-J09 hypothetical protein [Nocardioides sp. JGI 0001009-J09]. <-655234449_?<-655234450_?<-655234451_?<-655234452_?<-655234453_?<-655234454_?<-655234455_S-resolvase<-655234456_RAMA*||655234457_?->655234458_?->655234459_?->655234460_?-> # 2; 499350070 <-RAMA* RAMA DUF4357 - 137 bacteria>actinobacteria Streptomyces coelicolor hypothetical protein [Streptomyces coelicolor]. 21234283_?-><-21234284_?||21234285_?->21234286_?-><-21234287_?||21234288_?-><-21234289_?<-499350070_RAMA*<-21234291_?<-21234292_?||21234293_?->21234294_?->21234295_?->21234296_?->21234297_?-> 763031682 RAMA*-> RAMA DUF4357 - 133 bacteria>actinobacteria Kitasatospora griseola hypothetical protein [Kitasatospora griseola]. 763031679_?->763032620_?->763032621_?->763031680_?-><-763031681_?<-763032622_?<-763032623_?||763031682_RAMA*->763031683_?->763031684_?->763031685_?-><-763032624_?||763031686_?->763031687_?-><-763031688_? # 1; 326658949 <-N6-MTase+RAMA* N6-MTase+RAMA HsdM_N+N6_Mtase+DUF4357+CBS SACT1_4469 867 bacteria>actinobacteria Streptomyces griseus XylebKG-1 N-6 DNA methylase [Streptomyces griseus XylebKG-1]. <-326658942_?||326658943_?->326658944_?->326658945_?->326658946_?->326658947_?->326658948_?-><-326658949_N6-MTase+RAMA*||326658950_?-><-326658951_?||326658952_?->326658953_?->326658954_?->326658955_?->326658956_?-> 545335724 RAMA*-> RAMA - - 731 bacteria>actinobacteria Actinomyces johnsonii hypothetical protein [Actinomyces johnsonii]. 545335723_?->545335724_RAMA*->736449647_?->545335726_?->545333444_?-><-545335727_?<-496526014_?<-545335728_?||545335729_?-> 649278169 RAMA*-> RAMA DUF1707+MCPVI+DUF605 - 678 bacteria>actinobacteria Promicromonospora sukumoe hypothetical protein [Promicromonospora sukumoe]. 518862026_?-><-518862027_?||518862028_?-><-518862029_?<-518862030_?||518862031_?-><-649278167_?||649278169_RAMA*-><-649278171_?<-703011700_?<-518862035_?<-518862036_?<-703011703_?<-518862038_?||518862039_?-> 516723468 ParB->?->ParB->?-><-PARB+HNH+RAMA* PARB+HNH+RAMA DUF262+DUF1524 - 706 bacteria>proteobacteria>alphaproteobacteria Martelella mediterranea hypothetical protein [Martelella mediterranea]. <-516723460_?<-516723461_?||516723462_?->516723463_ParB->516723464_?->516723465_ParB->516723466_?-><-516723468_PARB+HNH+RAMA*||516723471_?-><-648481907_?||503898728_?->648481908_?->516723474_?->516723475_?->516723476_?-> 737135649 PARB+HNH+RAMA*-> PARB+HNH+RAMA DUF262+DUF1524 - 702 bacteria>actinobacteria Corynebacterium freneyi hypothetical protein [Corynebacterium freneyi]. <-737135688_?<-737135647_?||737135648_?->737135649_PARB+HNH+RAMA*-><-737135689_?<-737135690_?<-737135691_?<-737135651_?<-737135653_?<-737135654_?||737135656_?-> 742488091 PARB+HNH+RAMA*-> PARB+HNH+RAMA DUF262+DUF1524 - 712 bacteria>proteobacteria>alphaproteobacteria Bradyrhizobium japonicum hypothetical protein, partial [Bradyrhizobium japonicum]. 742488083_?->742488084_?-><-742488085_?<-742488086_?<-742488087_?<-742488089_?||742488090_?->742488091_PARB+HNH+RAMA*-><-654689942_?<-654689943_?||654689944_?->654689945_?->654689946_?->654689947_?->742488092_?-> 548076194 PARB+HNH+RAMA*-> PARB+HNH+RAMA DUF262+DUF1524 - 676 bacteria>actinobacteria Cryptobacterium sp. CAG:338 hypothetical protein [Cryptobacterium sp. CAG:338]. 548076163_?->548076167_?-><-548076172_?||548076176_?->548076179_?->548076184_?->548076189_?->548076194_PARB+HNH+RAMA*-><-548076199_?<-548076204_?||548076210_?-><-548076215_?||548076219_?->548076222_?->548076225_?-> 664177934 <-PARB+HNH+RAMA* PARB+HNH+RAMA DUF262+DUF1524 - 641 bacteria>actinobacteria Streptomyces griseus hypothetical protein [Streptomyces griseus]. <-664177917_?<-664177919_?<-664177921_?<-664177924_?||664177926_?-><-664177929_?||664177931_?-><-664177934_PARB+HNH+RAMA*||664177937_?-><-664177940_?<-664177943_?<-664177946_?<-497744080_?<-664177949_?<-664177952_? 739317412 <-PARB+HNH+RAMA* PARB+HNH+RAMA DUF262+DUF4357 - 843 bacteria>actinobacteria Rhodococcus fascians hypothetical protein [Rhodococcus fascians]. 694052417_?->694052418_?->694052332_?->694052333_?->694052334_?->694052335_?->694052419_?-><-739317412_PARB+HNH+RAMA*||694052421_?->694052422_?-><-694052423_?<-694052424_?||694052336_?-><-694052337_?<-694052338_? 739980329 <-McrC<-RAMA+EVE+McrB* RAMA+EVE+McrB DUF4357+AAA_5 - 599 bacteria>actinobacteria Streptomyces sp. NRRL B-5680 ATPase AAA, partial [Streptomyces sp. NRRL B-5680]. <-663270903_?<-663270906_?<-663271111_?<-663271115_?||663270908_?-><-663270911_?<-663270915_McrC<-739980329_AAA++RAMA*<-663270921_?<-663270926_?||663270932_?-><-663270935_?<-663270938_?<-663270958_?<-739980313_? 334108481 InPase-><-?||?->?->?-><-?||?-><-RAMA* RAMA - Isova_2668 578 bacteria>actinobacteria Isoptericola variabilis 225 hypothetical protein Isova_2668 [Isoptericola variabilis 225]. 334108474_InPase-><-334108475_?||334108476_?->334108477_?->334108478_?-><-334108479_?||334108480_?-><-334108481_RAMA*||334108482_?-><-334108483_?||334108484_?->334108485_?-><-334108486_?<-334108487_?||334108488_?-> 452756494 <-X+RAMA* X+RAMA DUF1571+DUF4357 G418_29712 558 bacteria>actinobacteria Rhodococcus qingshengii BKS 20-40 hypothetical protein G418_29712 [Rhodococcus qingshengii BKS 20-40]. 452756487_?->452756488_?-><-452756489_?||452756490_?->452756491_?->452756492_?-><-452756493_?<-452756494_X+RAMA*<-452756495_?<-452756496_?||452756497_?->452756498_?-><-452756499_?<-452756500_?||452756501_?-> 304326663 <-X+RAMA* X+RAMA - HMPREF0574_0913 554 bacteria>actinobacteria Mobiluncus curtisii subsp. curtisii ATCC 35241 hypothetical protein HMPREF0574_0913 [Mobiluncus curtisii subsp. curtisii ATCC 35241]. <-304326656_?<-304326657_?||304326658_?-><-304326659_?||304326660_?->304326661_?->304326662_?-><-304326663_X+RAMA*<-304326664_?<-304326665_?<-304326666_?||304326667_?->304326668_?->304326669_?->304326670_?-> 503535640 X+RAMA*-><-?<-?<-?<-?<-?||?-><-InPase X+RAMA Dicty_REP - 532 bacteria>actinobacteria Cellulomonas fimi hypothetical protein [Cellulomonas fimi]. <-503535633_?<-503535634_?||753798085_?->503535636_?->503535637_?->753797675_?->503535639_?->503535640_X+RAMA*-><-503535641_?<-503535642_?<-503535643_?<-503535644_?<-753798086_?||503535646_?-><-503535647_InPase 502643352 <-RAMA* RAMA PAT1 - 521 bacteria>actinobacteria Xylanimonas cellulosilytica hypothetical protein [Xylanimonas cellulosilytica]. <-502643345_?||502643346_?->502643347_?->502643348_?-><-502643349_?||502643350_?->502643351_?-><-502643352_RAMA*<-502643353_?||502643354_?->502643355_?->502643356_?-><-502643357_?||502643358_?-><-502643359_? 365257398 X+RAMA*-><-?||?->?->?->?-><-InPase X+RAMA Med3 HMPREF0045_01501 490 bacteria>actinobacteria Actinomyces graevenitzii C83 hypothetical protein HMPREF0045_01501 [Actinomyces graevenitzii C83]. 365257391_?->365257392_?->365257393_?->365257394_?-><-365257395_?||365257396_?->365257397_?->365257398_X+RAMA*-><-365257399_?||365257400_?->365257401_?->365257402_?->365257403_?-><-365257404_InPase<-365257405_? 515761987 X+RAMA*->?->?-><-?<-?<-?||?-><-InPase X+RAMA - - 460 bacteria>actinobacteria Actinomyces massiliensis hypothetical protein [Actinomyces massiliensis]. 515761982_?->515761983_?->496008053_?->515761985_?->515761986_?->657869467_?->657869468_?->515761987_X+RAMA*->515761988_?->515761989_?-><-496004859_?<-496004863_?<-515761991_?||515761992_?-><-496005322_InPase 585093370 HSDRN+HSDR-REase+RAMA*-> HSDRN+HSDR-REase+RAMA - KUTG_05615 435 bacteria>actinobacteria Kutzneria sp. 744 type I restriction enzyme R protein [Kutzneria sp. 744]. 585093363_?->585093364_?-><-585093365_?<-585093366_?<-585093367_?||585093368_?->585093369_?->585093370_HSDRN+HSDR-REase+RAMA*-><-585093371_?<-585093372_?<-585093373_?<-585093374_?<-585093375_?||585093376_?->585093377_?-> 759995468 RAMA*-> RAMA - - 420 bacteria>actinobacteria Nocardia vulneris hypothetical protein [Nocardia vulneris]. <-759995508_?<-759995453_?||759995456_?->759995459_?->759995462_?->759995465_?->759995513_?->759995468_RAMA*->759995471_?-><-759995474_?<-759995477_?<-759995480_?<-759995483_?<-759995486_?<-759995489_? 269787547 <-HSDRN+HSDR-REase+RAMA* HSDRN+HSDR-REase+RAMA HSDR_N_2 Sthe_2269 394 bacteria>chloroflexi Sphaerobacter thermophilus DSM 20745 protein of unknown function DUF450 [Sphaerobacter thermophilus DSM 20745]. 269787540_?->269787541_?->269787542_?->269787543_?->269787544_?->269787545_?-><-269787546_?<-269787547_HSDRN+HSDR-REase+RAMA*<-269787548_?||269787549_?-><-269787550_?||269787551_?->269787552_?->269787553_?-><-269787554_? 485098347 <-HNH+RAMA*<-?<-N6-MTase HNH+RAMA HNH SFUL_6650 394 bacteria>actinobacteria Streptomyces fulvissimus DSM 40593 hypothetical protein SFUL_6650 [Streptomyces fulvissimus DSM 40593]. 485098340_?->485098341_?-><-485098342_?||485098343_?-><-485098344_?<-485098345_?||485098346_?-><-485098347_HNH+RAMA*<-485098348_?<-485098349_N6-MTase<-485098350_?<-485098351_?<-485098352_?<-485098353_?<-485098354_? 601042837 X+RAMA*-> X+RAMA - N866_19315 375 bacteria>actinobacteria Actinotalea ferrariae CF5-4 oxidoreductase [Actinotalea ferrariae CF5-4]. <-601042830_?<-601042831_?||601042832_?->601042833_?->601042834_?-><-601042835_?||601042836_?->601042837_X+RAMA*-><-601042838_?<-601042839_?<-601042840_?<-601042841_? 655991010 HSDRN+HSDR-REase+RAMA*-> HSDRN+HSDR-REase+RAMA HSDR_N - 355 bacteria>proteobacteria>alphaproteobacteria Salinarimonas rosea hypothetical protein [Salinarimonas rosea]. <-655991005_?<-759845134_?<-655991006_?<-759845135_?<-759845067_?||655991008_?-><-655991009_?||655991010_HSDRN+HSDR-REase+RAMA*-><-655991011_?||759845068_?->759845136_?->655991012_?->759845137_?->655991013_?-><-655991014_? 343968291 URI+RAMA*-> URI+RAMA GIY-YIG+DUF4357 l13_07900 324 bacteria>proteobacteria>betaproteobacteria Neisseria weaveri ATCC 51223 hypothetical protein l13_07900 [Neisseria weaveri ATCC 51223]. 343968284_?->343968285_?-><-343968286_?||343968287_?->343968288_?->343968289_?->343968290_?->343968291_URI+RAMA*->343968292_?->343968293_?-><-343968294_?<-343968295_?||343968296_?->343968297_?->343968298_?-> 494634130 <-X+RAMA*<-?<-N6-MTase X+RAMA DUF4357 - 306 bacteria>firmicutes Megasphaera sp. UPII 135-E methionine sulfoxide reductase [Megasphaera sp. UPII 135-E]. <-494634068_?||494634072_?->738304214_?-><-494634127_?<-494634108_?||738304246_?-><-738304216_?<-494634130_X+RAMA*<-494634144_?<-494634151_N6-MTase<-494634114_?<-494634162_?<-738304218_?<-738304248_?<-494634096_? 703488257 <-RAMA* RAMA DUF4357 - 298 bacteria>actinobacteria Saccharothrix sp. NRRL B-16314 hypothetical protein [Saccharothrix sp. NRRL B-16314]. 703488342_?->703488252_?->703488253_?->703488254_?->703488255_?->703488343_?-><-703488256_?<-703488257_RAMA*||703488344_?->703488345_?->703488346_?-><-703488258_?<-703488259_?<-703488347_?<-703488260_? 522152436 RAMA*-> RAMA - - 285 bacteria>actinobacteria Amycolatopsis benzoatilytica hypothetical protein [Amycolatopsis benzoatilytica]. <-654457843_?<-522152430_?||522152431_?->522152432_?->522152433_?->654457844_?->522152435_?->522152436_RAMA*->522152437_?->522152438_?->522152439_?->522152440_?->522152441_?-><-522152442_?||736166120_?-> 219621053 Bifidobacterium - SP BLA_0918 280 bacteria>actinobacteria Bifidobacterium animalis subsp. lactis AD011 conserved hypothetical protein [Bifidobacterium animalis subsp. lactis AD011]. subsp. 502483234 URI+RAMA*-> URI+RAMA DUF4357 - 277 bacteria>actinobacteria Cryptobacterium curtum hypothetical protein [Cryptobacterium curtum]. 502483227_?->502483228_?->502483229_?->502483230_?->502483231_?->752554187_?->502483233_?->502483234_URI+RAMA*->502483235_?->502483236_?->502483237_?->502483239_?->502483240_?-><-752554188_?||752554189_?-> 380878131 <-URI+RAMA* URI+RAMA DUF4357 Thi970DRAFT_03846 275 bacteria>proteobacteria>gammaproteobacteria Thiorhodovibrio sp. 970 LOW QUALITY PROTEIN: hypothetical protein Thi970DRAFT_03846 [Thiorhodovibrio sp. 970]. 380878124_?-><-380878125_?<-380878126_?||380878127_?->380878128_?->380878129_?-><-380878130_?<-380878131_URI+RAMA*<-380878132_?||380878133_?-><-380878134_?<-380878135_?||380878136_?->380878137_?->380878138_?-> 503259250 <-URI+RAMA*<-?<-REase URI+RAMA GIY-YIG - 273 bacteria>actinobacteria Intrasporangium calvum excinuclease ABC subunit C [Intrasporangium calvum]. 503259243_?->503259244_?->503259245_?->503259246_?->503259247_?->752639935_?->752639937_?-><-503259250_URI+RAMA*<-503259251_?<-503259252_REase||752638883_?-><-503259255_?<-752639938_?||752638885_?->752639940_?-> 353541191 RAMA*-> RAMA - FJSC11DRAFT_3600 167 bacteria>cyanobacteria Fischerella sp. JSC-11 hypothetical protein FJSC11DRAFT_3600 [Fischerella sp. JSC-11]. 353541184_?-><-353541185_?<-353541186_?<-353541187_?<-353541188_?<-353541189_?||353541190_?->353541191_RAMA*->353541192_?->353541193_?->353541194_?->353541195_?->353541196_?->353541197_?->353541198_?-> 516723200 <-RAMA* RAMA - - 156 bacteria>proteobacteria>alphaproteobacteria Martelella mediterranea hypothetical protein [Martelella mediterranea]. 516723193_?-><-516723194_?||516723195_?->516723196_?->516723197_?->703369724_?->516723199_?-><-516723200_RAMA*<-516723201_?||703369699_?-><-703369731_?<-516723204_?<-648481875_?<-516723207_?<-516723208_? 736354164 <-RAMA* RAMA SeqA - 153 bacteria>firmicutes Dehalobacter sp. FTH1 hypothetical protein [Dehalobacter sp. FTH1]. <-521977782_?<-521977783_?||521977784_?-><-736355130_?||521977786_?->648464136_?->521977788_?-><-736354164_RAMA*<-736355132_?||648464137_?->736355134_?->521977793_?-><-736355137_?<-736355139_?||521977797_?->Back to Contents
%* ppp1r14a_Xenopus_laevis_147898606 -------------DVEKW-ID-EQMEE-LYLGREVD-------MPDE-VNIDDLLDL----ETDEDRRRSLQVILK-----SCTNN-----TEVFIRELLLRLK-GLQKQTLLKKNGLEVSSEE------------------------------------------------------------ gigyf2_Xenopus_laevis_148223517 ------GANKCQDDFTQW-CE-KTMHA-INTAHSLD-------VPTF-VSF--LREV----ESPYEVHDYVCAYLG-----DTPEA-----KD-FSKQFIER-R-TKQKTSQHRPQ----QDVA-W---VTCQTSQANSQPITLEAVQCAGRKKKKQ--------------------------- AT5G42950_Arabidopsis_thaliana_15239132 ----AVTKLTEANGFRDW-CKSECLRL-LGSEDTSV-------LEFC-LKL-----------SRSEAETLLIENLG-----SRDPD-----HK-FIDKFLNY-K-DLLPSEVVEIA--F-QSKG---SGVGT---------------------------------------------------- NEMVEDRAFT_v1g247227_Nematostella_vectensis_156360915 ----A---AASADDFTQW-CE-TTLRS-M-KATGVD-------IPTF-IMF--LKEV----ESPYEIHDYVKSYVG-----DTKEA-----RD-FAKEFIEK-R-KKDKHRSAPT-----PSPP------------------------------------------------------------ NEMVEDRAFT_v1g180626_Nematostella_vectensis_156399329 -----------------W-CV-DELSK-I-TDCGEE-------ITDY------ILHM----DNIEDVKEYLGGFLG-----QENPK-----QIEFLNILVQR-L-NEINPEFERAG--T------W----------VRKEKLEKETSPTKGN-------------------------------- GIGYF2_Homo_sapiens_156766045 ------GVNKAQDGFTQW-CE-QMLHA-LNTANNLD-------VPTF-VSF--LKEV----ESPYEVHDYIRAYLG-----DTSEA-----KE-FAKQFLER-R-AKQKAN------------------------------------------------------------------------- CHLREDRAFT_189453_Chlamydomonas_reinhardtii_159468295 -------ANAAPAPVPSW-CR-GKMVE-FFGNDDLT-------LVAF-L----YSSC----TSRSEVADYCQEYMR-----GKPNV-----ST-FVAEFLKR-K-DADVAAR------------------------------------------------------------------------ MONBRDRAFT_33655_Monosiga_brevicollis_MX1_167533317 MQVGT-SLAQVQPSFLAW-CK-KELVK-I-VNTEVD-------PETF-VSF--LLDI----GSTSEVIDYLSEMSS-----KPDQV-----RK-FAHEFMKN-R-VSAMSGTVAKG----IVKP--------------KPPMDDS--------------------------------------- PHYPADRAFT_167456_Physcomitrella_patens_168039262 -------------ALRQW-CE-AQLKK-LSADEDVT-------NVDF-SIVDFCISL----PSISEASDYLSQYLGTLPGVTQQHI-----QA-FRKEFVRR-K-EQLPSDGNASS----SDNE-Y------------SDALFGSNGKKVDR-------------------------------- CC1G_01669_Coprinopsis_cinerea_okayama7#130_169844537 ----P---VSPSHEFLKW-LS-ESLKG-LNSSVNVE-------EIIS-MLL--SFSL----DPDPTTIEIISDTIY-ASS-TTLDG-----RR-FAAEFVSR-R-KADAANRKGPN----AAKG-------------PTKPISIAEVVKA---------------------------------- trip4_Xenopus_(Silurana)_tropicalis_187607944 ----------MAADLLGW-CV-EELEKRFGLGVSED-------VVKY------ILSI----DKEEEIDEYINDLVQ-----APEDT-----KSLFTRELKLR-W-HRIRQPPASRT----------------------TSAFQRKDG------------------------------------- SCRG_02797_Saccharomyces_cerevisiae_RM11-1a_190408675 M-----NNVSPRQEFIKW-CK-SQMKL----NSGIT-------NNNV-LEL--LLSL----PTGPESKELIQETIY-ANS-DVMDG-----RR-FATEFIKR-R-VACEKQGDDPL--S------W------------------------------NEALALSGNDDDGWE--FQVVSKKKGR- LOC100158827_Acyrthosiphon_pisum_193643347 ---------MSKQEFTQWICD-KLTAL-LQFEVQND-------MAEY------IISM----QAERDIDEYCNSLLD-I---KSQVH-----KQ-FLTDLKKK-H-RIFNQKPTSSV--Q-NTRP-------------PKKNTKSSENEDPQK-------------------------------- TRIADDRAFT_57541_Trichoplax_adhaerens_196007084 --S-N-TAYNPENALIQW-CE-RRLAG---IETTID-------IPTF-ARF--ISSL----GSPAEVRDYLKMYLG-----STEEV-----KN-FFNDYVRK-R-RECESQQFPKA--S-PGNM---KETTT---------------------------------------------------- PPP1R14B_Homo_sapiens_20162550 -------------NLEEW-IL-EQLTR-LYDCQEEE-------IPELEIDVDELLDM----ESDDARAARVKELLV-----DCYKP-----TEAFISGLLDK-I-RGMQKLSTPQK--K----------------------------------------------------------------- PHATR_43835_Phaeodactylum_tricornutum_CCAP_1055/1_219113419 -------GAKMSPSFEKW-CK-DQVYK-LNGTDDLT-------LVAF------CMTL----QDPGEIRQYLSAYLG-----STPQV-----NS-FATEFINR-K-A------------------------------------------------------------------------------ PHATRDRAFT_19937_Phaeodactylum_tricornutum_CCAP_1055/1_219118084 -------PSTNKTETIQW-CS-DALHD-LLGFADTA-------LASY------LVSV----AKKATQSSEIVQILV--DG-DVRDVTPERMER-FAEQLLSH-A-RPTPKQSHGGP----ASRQ---AKA------------------------------------------------------ PHATRDRAFT_46184_Phaeodactylum_tricornutum_CCAP_1055/1_219119804 -------------------LN-SKFAQ-I-LGFEDG-------VDDV-VDH--LLTI----DSKEDLSDYLSQLLG---S-LSVEG-----KK-FVDDIEKF-K-RGVPIEPMILV--V-PDKA--------------EKSEQTHDLP------------------------------------ AT1G32490_Arabidopsis_thaliana_22329903 ---------MASNDLKTW-VS-DKLMM-LLGYSQAA-------VVNY-L----IAMA----KKTKSPTELVGELVD-YGF-SSSGD-T---RS-FAEEIFAR---------------------------------------------------------------------------------- Dmel_CG11710_Drosophila_melanogaster_24643521 --------------MENF-LR-GTLSK---CLDCVI-------TDQM-LAA--ILNI----KDDYEFDNYFGNLLS---E-DNEEH-----RM-FLVNC----R-RMLLSGKQPRN----NGKY-L------------SPPLAPTSPNCPKQ-------------------------------- PPL_07241_Polysphondylium_pallidum_PN500_281206218 MAQ-V-NEPLTWESLERW-TT-EKLQK-MLGFPPDE-------IIRY------IFAA----ESNSDIDNYLVDLLG-----NTKKT-----RT-FIEQLTKK-I-NSLPRRIVKP--------------------------------------------------------------------- PPL_04211_Polysphondylium_pallidum_PN500_281208347 ----V-QAPQPKADFIKW-CH-QQLKP-LTNMDVAT-------VTEL------LCSL----KTENEIRECAKECLG-----YSSEV-----TN-FINDYLMA-R-GDEPGLQFEAS----------------------SP-------------------------------------------- NAEGRDRAFT_57247_Naegleria_gruberi_strain_NEG-M_290995819 --F---GQADVSQEFKDW-FK-KGLKK-LNKSVDPS-------FMYF------LLSL----NSEKETIDYMSEYIG-----NSSAA-----QK-FAQEFIAN-K-SFENPNQGANK----KK-------------------------------------------------------------- PITG_01508_Phytophthora_infestans_T30-4_301120876 A-F---GSNNVSSEFMTW-AL-RHLKA-IDSNADVT-------LLEY------CATL----EDPGEVREYLAAYLG-----STPRV-----SA-FATEFIQR-K-KTQHSGKKSTG--N-QDAQ--------------QRASETGSSNKRGK--R----------------------------- SELMODRAFT_438614_Selaginella_moellendorffii_302760897 --S-T-GPNAEAKAFRQW-CE-SQMKK-LTGNDDMT-------LLEF------CLSL----PSSAEAGEYLTQYLG-----STANV-----QA-FKSELLQR-K-ELLPVEALRSV---------F-------TV---SDAVISEDWKQASR------GQGQ---------------------- SELMODRAFT_442631_Selaginella_moellendorffii_302786242 -------------SFRSW-CE-TQIKE-LTGSSDMR-------QLDY------CVSL----PSVLEAERSLTQYLG-----ESSDA-----QA-FKGEFLRC-R-ELMTPQMLQVF----NSGP--------------KSDTKVEG-------------------------------------- VOLCADRAFT_92322_Volvox_carteri_f_nagariensis_302840495 ---------AAEREVRTW-VQ-DQLHR-LLGFADAN-------VAAF------LISI----ARKHTSADSLFTDLK-RSC-NLPNT-SDV-QS-FAAELLRR---------------------------------------------------------------------------------- VOLCADRAFT_108378_Volvox_carteri_f_nagariensis_302854783 ------GEIQMSAEFRNW-CR-SKMVE-FFGNDDLS-------LVHF-L----LT-V----NSRSEVADYCQVYMR-----GKPNV-----ST-FVADFLKR-K-DAELARQ------------------------------------------------------------------------ EAI_14891_Harpegnathos_saltator_307207345 ------QNTAKTDEFTQW-CT-KALNG-L--QASVD-------IPTF-VGF--LRDI----ESAYEVKEYVRVYLG-----DTKQS-----TE-FAKQFLEK-R-SKWRSAQRPQA----QADD-L-CKP--------APAVNPNA-------------------------------------- GSOID_T00013439001_Oikopleura_dioica_313227523 ------------DPLLSW-AR-VELEL-IPNSNTVD-------VPTF-VSF--LREV----EHDYEVEDYSRQFFG-----NGKEV-----LN-FAHKFMEK-R-KQIRNEGPAKK----------------------SKGKKKNKNKATAD---------------------LLGFTPAAGDQ GSOID_T00030811001_Oikopleura_dioica_313240030 --------------MRQW-IA-HQMEL-LLGFSDAG-------LISA------IENQ----KSESELRKYCGELLG--DQ-----------KS-FVDELVKR-K-FGNSASQNNNQ----GGNV---SSGNS---------------------------------------------------- TRIP4_Homo_sapiens_32189376 MAV---AGAVSGEPLVHW-CT-QQLRKTFGLDVSEE-------IIQY------VLSI----ESAEEIREYVTDLLQ-----GNEGK-K---GQ-FIEELITK-WQKNDQELISDPL----QQCF--------------KKDEILDGQKSGDHL------------------------------- DICPUDRAFT_150853_Dictyostelium_purpureum_330797588 -------PGQARPDFIKW-CH-QQLKA-LTNNDVTV-------ITEL------LCSL----KTESEIRECAKECLG-----YTSDV-----DN-FINDYLLA-R-SDEPGLTFESS----------------------SPVITIPIKKQPKSQTA------------------STTASKQKKKK LOC100637876_Amphimedon_queenslandica_340373604 ----F-TGASPDEAFTDW-CK-RELDK---YNSDVD-------AVTF-VSL--LLEV----ESTYEVHDYIKSFLG-----ETDEV-----HS-FAKEFLER-R-QKIRNYKQTKT----QQQS--------------KKPSAGS--------------------------------------- RTG_02889_Rhodotorula_glutinis_ATCC_204091_342319142 ----------MPTSLETW-VS-DNLLV-LLGASDST-------TTAY------FITL----AQSSPSASALVQTLT--QN-GLSDS-AQT-RR-FAADLFDR-A-PRKTSKRAEQN----AADA--------------RRKAEKEKKQL----------------------------------- GIGYF2_Gallus_gallus_356460922 ------GVNKAQDGFTQW-CE-QMLHA-LNTANNLD-------VPTF-VSF--LKEV----ESPYEVHDYIRAYLG-----DTPEA-----KE-FAKQFLER-R-AKQKASQQRQQ----QQQQDS---------------------------------------------------------- dZ221I3.1-001_Danio_rerio_37606133 --------------------------------------------------Y--ILSI----DNADEIVEYVGDLLQ-----GTEGN-K---QE-FVDELVQR-W-QKCQTQTSEGL----GGVL--------------RKEAVMEELDTAPK------------------------DTQKKSKR RO3G_00766_Rhizopus_delemar_RA_99-880_384483882 ----------PSEDFRRW-CR-KALRG---LNSGVN-------EDEI-LDM--LLMF----PVDSSTAEIIEDVIY---A-NSVRI-DG--KR-FAQEFMRR-R-KADLAGRSD---------------------------------------------------------------------- LOC100889908_Strongylocentrotus_purpuratus_390357703 ----F-QSNPPSSEFGQW-CE-MELKR---MRPPVD-------VPTF-VSF--LQDI----DSPYEVHEYVKMYLG-----DTPES-----SN-FARQFLER-R-SRQRDQQRQQK--E-VESA-W--------L---GNKSSLP--------------------------------------- LOC581973_Strongylocentrotus_purpuratus_390357928 --------MAVSPSLVTW-CS-EELSL---LLASET-------TDEF-VSY--LLAI----RDDQELREYLSQLLD-----SSSKK-N---AE-FVEELMRR-R-NSGPALPQNFT--V------Y------------RKSTTQEESKK----------------------------------- CGI_10008333_Crassostrea_gigas_405951638 --------AAASVSFEDW-LG-ERLTS---LNPEVD-------TEVF-VTY--ITGILETETDEEDTRESILGILG-EVL-EEEKE-----TV-MCDEIVQR-W-NQEHSKQAKAE----TDTS--------------TLATQLSEILENQK-------------------------------- LOC101238576_Hydra_vulgaris_449668186 ----F-QAPTPSSDFINW-VE-RNLPA---VKKAID-------VPTV-VSF--LQDI----ESPYEIHDYMRSYLG-----DCNEQ-----KE-FVREFLER-R-KKTWQQQKQRS----PTVP------------------------------------------------------------ CELE_C18H9.3_Caenorhabditis_elegans_453231778 ----------ATDELQQW-FV-KRFQQ---FSTQVD-------SSTL-FDC--IMSL----ENPNEVEDIVMSYLD-----ESKTV-----KE-FVREFIKR-R-IAMRAAGGRPD----------------------ADDLTSARTAAAAP-------------------------------- LOC100186532_Ciona_intestinalis_459185799 -------MASGTMSLSQW-VN-KELVK-LLGIAETD-------VEDL-SSY--LVTI----DNPEELTTFVTELLS-ENG-TESGL-DGPKKQ-FIRDLLGR-W-ERVR--------------------------------------------------------------------------- DFA_02695_Dictyostelium_fasciculatum_470247412 ------------SDFIKW-CH-QQLKP-LTNMDVAT-------VTEL------LCSL----KKESEIRECARECLG-----FTTEV-----NN-FINDYLMA-R-SDEPTLAYESS----------------------SPFITVPKGKTPNV------------------------KNAPKGKK CAOG_06042_Capsaspora_owczarzaki_ATCC_30864_470297466 ------------PDFMEW-CQ-HKLRT---FNAGID-------AESF-VSI--LNAF----DSKQDVTDYAHEFLG-----RSSEV-----AA-FAREFCER-R-RPFEVVAGKRT--A-HAAP---------AP---APAAAPTQGKKKAQ-------------------------------- ACA1_074550_Acanthamoeba_castellanii_str_Neff_470519016 -----------------W-VM-KRLKH-LLGFDEVD-------GL---VDN--LMKL----QSAEEIEKYIKDILG-----TERKA-V---QK-FTEDLIQK---RKDASPLFRTV----PIRK-L------------STAGPSGQGPSEPN-------------------------------- GSTEN:00026621:G:001_Tetraodon_nigroviridis_47209798 -------------DVEKW-ID-DSLDQ-LYSGQEDD-------MPEE-VNIDDLIDL----PSDEERVRKLRELLQ-----NCNNN-----TESFVTELVAR---------------------------------------------------------------------------------- GSTEN:00018164:G:001_Tetraodon_nigroviridis_47230723 ------------DALLKW-CV-DQLHYKFGLEASED-------IVQY------ILSI----EKAEEIEEYVGDLLQ-----GTDRK-K---EP-FIEELLIR-W-EKSRKQTTDNN--L------F-LFTEL-VP---SSEITKDAQKKSKR-------------------------------- TRIUR3_01899_Triticum_urartu_474124787 -------MATSASTSGEW-LE-GALQE-LRGRTGSALELDDGLISGL-VS---FCEL----APPPDAADYLANIVG---A-EAAQD--------LIQEYLQR-R-GYIDPSKGAGS--S-QSSN---LQPYL------KPSADAATAQTKKQ-------------------------------- TRIP4_Gallus_gallus_513200543 MA--------APGVLLDW-CV-RRLRGDFGLDVGEE-------VVRY------ILSI----TSEDEIREYVVDLLQ-----GTEGR-K---GR-FVEELLSR-W-QQSSQSPAEPL----PAYR--------------KKDETSESPRAGDQ-------------------------------- CAOG_06042_Capsaspora_owczarzaki_ATCC_30864_514484474 ------------PDFMEW-CQ-HKLRT---FNAGID-------AESF-VSI--LNAF----DSKQDVTDYAHEFLG-----RSSEV-----AA-FAREFCER-R-RPFEVVAGKRT--A-HAAP---------AP---APAAAPTQGKKKAQ-------------------------------- PTSG_00883_Salpingoeca_rosetta_514701569 -----------RKEFMGW-CK-KEIGA-LTSDVDAE----T--LVKV------LLEI----PQPEDVIDFVTEQLG-----ERARK--------FSKAFLQK-R-ATAFEGDAGQV----------------------LPTVDDDSWQP----------------------------------- SPOG_00659_Schizosaccharomyces_cryophilus_OY26_528316225 ---------MPSSALEQW-TK-DNVLK-LLPLDEES-AV-L--VAQT------ALAA----ENADDAKTHWISLLG-----ESQET-----IE-FVSEFNRR-R-FAASFSQKDAI----------------------SKKLESNRPSSYSA-------------------------------- Gasu_19130_Galdieria_sulphuraria_545710223 ----S-FGANIPEDLKKW-CE-EQFSE-LVSSQDVT-------LAEF------LASL----NTREEIREYAIIYLG-----NSKKT-----ED-FVEEFVRR---LQFEFESQKVT----PGSS------------G-SQAFKSGRRRKK---------------------------------- GUITHDRAFT_122423_Guillardia_theta_CCMP2712_551630891 ------------DSFSKW-CF-KELEK-LTGSDDTT-------LGEF------LMSL----HSSSDVQEYVEEYLG-----PKGRS--------FAEEFVLR-K-QMDSVEVVSSR--G-GSKE--------------EANLQAKKKKK----------------------------------- EAH_00020370_Eimeria_acervulina_557125805 -------PKRAAAQLQEW----NCLLA--ECELPLD-VP----ILEY------LATM----ENPLEVEEFLLESFP-----SHKNL-----RL-FAENFVMT-N-DKYNKRPQDDK----GGPQ--------------ASAASWEAIQGKGR-------------------------------- PVAR5_6530_Byssochlamys_spectabilis_No_5_557723410 ---------MPATDLVAW-AA-PRLSQ-LLPLDDES-------LTQI-ITY--SASL-----SKDACAEHLKNLLG-----DSPAA-----LE-FISSFNSR-R-GGGETSTQSGI----ASPG--------------AGARDGTERQTNED-----------------------RNVQKKNKR LOC102564933_Alligator_mississippiensis_564240896 -------------DTEKW-ID-GCLEE-LYRGREAE-------MPDE-VNIDELLEL----DTDEERARKLQGILR-----SCGNS-----TEDFVRELLLKLR-GLQKQQALQQPS---PDEQ------------------------------------------------------------ L345_07652_Ophiophagus_hannah_565314478 ----------MAAALVSW-CT-GELRGTFGLDVSEE-------IIQY------ILSI----DDEEEIREYVTDLIQ-----GRDGQ-K---KY-FIDELVAR-W-KQSRSTTSDLL----LLYQ--------------KKDDILDTPRPGDQ-------------------------------- AND_002481_Anopheles_darlingi_568257448 -------------SLQGW-IK-EELSK-CLCFEIPD----A--MISY------ICNI----REGCEIDEYFRTLLD-F---KNPEH-----VK-FLGELKRR-M-GTRPGANNQQA--S-------------------KQTAPAPSGKQKKQ-------------------------------- LOC102654890_Apis_mellifera_571571334 --------------MEDW-IC-ENLSQ-ILDFPVTN----E--IVQY------MIQI----QNERDLDDYMRSFLD-Y---TNGKH-----RQ-FITDFKKQ-Q-VKAISALIKDQ--A----------------------------------------------------------------- BATDEDRAFT_36451_Batrachochytrium_dendrobatidis_JAM81_575473160 ------QATNGSAALVQW-CR-VALRG-VQRTSTTV----N--VDEF-ITM--LNSI-NV-KESATITMICDDTLG-----GSTAI-DP--RK-FADEYIRR-R-QAEASGTHWSA----QSSA---ES------------------------------------------------------- BATDEDRAFT_88748_Batrachochytrium_dendrobatidis_JAM81_575480014 ----------MSTKLEDW-AT-NEVIR-LLGHSLPR-NE----VHQL-ITY--SLTL----DTKDEAANYFQDLLG-----TTEES-----LE-FISEFLTK-R-FPPLQTVGAWS----STAT--------------ASRKEKQASIQREQ-------------------------------- WALSEDRAFT_69827_Wallemia_sebi_CBS_63366_588260673 -------PGAPSEEFIKW-CK-GALVG----LNGTT-------SDEL-LPI--LLSF----DIEKPDLELIQDMIY-ASS-STMDG-----RR-FAGEFAKK-R-K--EDSKGVSL--S------S------------------------------AADIVKQQQAPKTLQETFKVVQKKKKR- H779_YJM993P00176_Saccharomyces_cerevisiae_YJM993_628229990 ------ASVSKRQEFLRW-CR-SQLKL----NTGVQ-------PDNV-LEM--LLSL----PPGSESKEIIADTIY-SYS-STMDG-----RR-FATDFIKK-R-LECEEEINDPL--S------W------------------------------SEVLAMPEGSSEDWE--FQVVGKKKGKR _Danio_rerio_62960123 -------------DVEKW-ID-EALDK-LYEGKVED-------MPEE-VNIDDLLDL----PSDEARTHRLQALLQ-----SCSSN-----TEAFIAELLQK---------------------------------------------------------------------------------- gigyf1_Anolis_carolinensis_637369140 --------PPPQDGFTQW-CE-QMLHA-LNTSSNLD-------VPTA-VAF--LKEV----ESPYEVHDCIRSYLG-----DTLEA-----KE-FAKQFLER-R-AKQRANQQRQQ----QQEASW---------------------------------------------------------- DDB_G0279309_Dictyostelium_discoideum_AX4_66815539 -------PGQARPDFIKW-CH-QQLKA-LTSNDVSV-------ITEL------LCSM----KTESEIRECAQECLG-----YSSEV-----NI-FLNEYLLA-R-SDEPGLAFESS----------------------SPVITIPAKKTNKS--S------------------QTNPSKTKKKK DDB_G0269884_Dictyostelium_discoideum_AX4_66826047 --------PMSYEEIEKW-TI-EKMDK-MLGVDSKE-------MAKY------VLSM----DTNSEIENYLADVLG-----NTKKV-----QT-FIEQLIKK---------------------------------------------------------------------------------- BM_Bm1959a_Brugia_malayi_671413957 ----------MADFLEQW-VN-DELYT-LVGCSDRT-------AVQY------ILAL--A-RKSIDAEDLLGRLRS--TD-TMEDT-PAV-RK-FISELIAR---VPHAAAKREKV----IQPS--------------AAELRAKEI------------------------------------- BM_Bm2316_Brugia_malayi_671418067 ------SVTSSGNALTSW-MI-NRVKQ-LNPQVDAD----V--FAAF------IEGV----DNPNEVEDYIIGYLG-----ESRLV-----KE-LIREFLER-R-SQARHKKEPVD----KDDL--------------THPAQAADA------------------------------------- LOTGIDRAFT_162501_Lottia_gigantea_676463843 ---------MAAPTTEEW-IC-QELAK-FGIETTPE-------NASY------ILSM----DNNQDLEDYMNDLLD---K-SDPKV-----RI-FVQELLRR---------------------------------------------------------------------------------- OT_ostta09g03850_Ostreococcus_tauri_693498597 ----F-PPLNNKQALRAW-CK-AQMSQ-LNNSDDMT-------LVDF------LLGL----PSAGEVQEYVALYLG-----KTPQA-----NA-FATELIRQ-K-RADPS-------------------------------------------------------------------------- LOC100184186_Ciona_intestinalis_699243562 ---------AQTDPFVSW-CD-TEIKK-LPSAVNLD-------IPTF-VAF--LRDV----ESPHEVKDYVASYLG-----ESKPA-----RD-FAEAFLQQ-R-------------------------------------------------------------------------------- AFUA_6G02200_Aspergillus_fumigatus_Af293_70984701 ---------MSNSNLVAW-AV-PRLAQ-LLPLDEES-------LTQI-ITY--SAGL-----PKEEGAEHLKNLLG-----DSPAA-----FE-FIASFSAR-R-DQTQAQTRSTV----PSPV--------------RGGEEQSAA------------------------------------- CELE_F55C10.5_Caenorhabditis_elegans_71995966 --------------VENW-IE-TEVTK-LFNGNETN----N--VDID-LDV--IQDI----EDVTGKRKFAFEQLQ-KAH-CPCSM-DKI-IM-FLDELIIQ-L-NTL---------------------------------------------------------------------------- GE21DRAFT_1337754_Neurospora_crassa_725976398 --------------------Q-QQLSR-LLPLPDED-------LKQV-LDY--ASTL-----SKTEAIDHFTNLLG-----DSPAV-----ID-FISTFNAR-R-ADPKAPPAPSS--A-ARTP--------------SAPSSAQNS------------------------------------- RMATCC62417_11189_Rhizopus_microsporus_727141261 -----------MSTLDTW-AQ-DKLSV-FLGFDPET----I--RSQV-LPY--LMST----QTPEAFGERLMEMVG-----LSEDA-----LK-FIEEFTER-R-FHPERQQNTTT--V-VASG---SN------------------------------------------------------- RMATCC62417_06585_Rhizopus_microsporus_727147058 ----------PSEDFRRW-CR-KALRG---LNSGVN-------EDEI-LEM--LLSF----PVDGSSAEIIEDVIY---A-NSLRI-DG--KR-FAQEFMRR-R-KADIAGRTD---------------------------------------------------------------------- NCU09657_Neurospora_crassa_OR74A_85091072 ------AKNVAMEEFKKW-LH-RELSR---GLNGVN----D--IETF-AST--LLEL------PLDVSILSECVYG--FS-TTMDG-----RH-FAEEFVRR-R-KLADKGIVEKD----SNTG--------------AMSSSNGGWSEVAK--K-G--------------------------- GIGYF1_Homo_sapiens_92087055 --------PRPQDGFTQW-CE-QMLHT-LSATGSLD-------VPMA-VAI--LKEV----ESPYDVHDYIRSCLG-----DTLEA-----KE-FAKQFLER-R-AKQKASQQRQQ----QQEA-WLSSASLQTA------------------------------------------------- consensus/100% ..............................................................................................h...h..................................................................................... consensus/95% .................W......h...............................h...........p.h...h...................F..ph..................................................................................... consensus/90% .................W.h....h............................h..h........p..p.h...l...................Fh.ph..p.................................................................................. consensus/85% ..............h.pW.h....h..........p..........h......h..h.....p..p..p.h..hl.......p...........Fhpphh.p.................................................................................. consensus/80% ..............h.pW.h..ppl.......s..s.........ph......l.ph.....s..-..chh..hl.......s...........Fhpphh.+.b................................................................................ consensus/75% ..............h.pW.hp.pplp..h...ss.s.......h.ph......l.sl.....s..-h.chh.phLs.....ps.p......pp.Fhpchhb+.b................................................................................ consensus/70% ..............h.pW.hp.ppLp..h...ss.s.......hsph......lhsl....ps..-hp-hl.phLs.....ss.ps.....pp.Fhp-hlb+.b................................................................................Back to Contents
Back to Contents
Back to Contents
Back to Contents
>gi|Lgig1000010006|ref|jgi|Lotgi1|156824|fgenesh2_pg.C_sca_12000135 MIPDTIIYIPNFISQEEEQKLIDHVYSAPKPKWTHLSNRRLQNWGGLPHPKGMVAEDIPQWLDLYCDKIGKLDLFEGKKPNHVLVNEYSPGQGIMPHEDGPLFYPTVTTISLGSSTVLDFYTHINQGKAEDKSEPCSENMSKKFEDRHVCSVYLEPRSLVIVKDDMYTKYLHGIKEQTEDMVDDRICNIKQCSDINIDNIKTRQTRISLTIRLVPKVLKTKLFFGKK >gi|Fcyl1000050013|ref|jgi|Fracy1|223976|fgenesh2_pm.2_#_126 ; gi|Fcyl1000106017|ref|jgi|Fracy1|180489|e_gw1.2.1691.1 ; gi|Fcyl1000088756|ref|jgi|Fracy1|163097|gw1.2.1691.1 MNDESMKKKSRRYEKGHWDAVINLYKELFNRIRQQLAEHHLTDYYDDQESIHWLPCHAIDLKKDGELNAHVDSVRFSGDLVAGLSLLSPSIMRLIPCDDNDDDDNKNSENSTKDEEPYYVDMFLPPRSLYVLTGVGRYKYSHQLLPDGSIFHKTDTDIVVRRDHRLSVIFRDSKQPSS >gi|Ttra1000010051|ref|AMSG_11934T0 | AMSG_11934 | Thecamonas trahens ATCC 50062 hypothetical protein (935 aa) MTFRYTVPLFLDLILPLFIMSSIATSAPRSSSSSAASSASSASTSPAKSYNQRKVHNGKLYSGMRVGSTHRWNYSDAALEVSEMKAPAGDSSLGVLTPMQSQLAYEAVKSRLNAAPIGSGAAVNTIYHWFLVTELAFGDASDDGKVTSATLRGPKFKLAHKRPHWRKFSAEYSGNTPVPRKRIECLESLLGDAPPREAVTSAAKAAALPSYQASELLDGVAALKLGTCDTSWIETKVGPDEWEVAVNYGLTAAPRALAGARFLVVADQFATKLNANQYATQLAGAMYYLGDGSLATPAANALKIKIVSALTSALKSELPALKPAQPSPPPPKPLTAFWQAVGTPMPVMPPPKKVSRKRPREPEEKLVHPDVAVGPIVRPAELTKRAKTTSAGADADILARLKHELSPSAGILYAPGAVPADQADALFDAMATTVPWGAKRWRGSVLPRLVWHYQEGVVPALDMLLCSISSAFGVSAIKGAFCNLYRNGEDHTPYHADEYGADVLSISLGATRKFHFKPKGVTGAAATARRITYDLAHGDVLIFDESVNARYLHTVPKMKAVTAARINVTVFAVRDAPPPPPPALAHPSALEPTRPRSPPPPFVDDDALAAAAAAATSPLPTHETNMTANPALSDHPAAAPANNTTLGQPGFVSGYGSAHGNATGPVVGSYVAPSGPATVQVQEAEAFENVVVRVPRAEATIVRRGPPSVRTQVVPGPTFVKVVPEYVAGPVRTIKEVIPGPVTYRDEKVEVPGPVRERIERVEVPGPVVERVVHDYVPGAVVEKEVKVEIPGPVTYKDVKVQVPGPVIRRPVPKQVQVPGPVRVVKQRVPVPQPHIREVTVKRKVAVPKPVNVEIPGPTREILVHKRDNLLEAENARLRAEIAFLQSIPSAPHFEVLARREIKQQQQQFGGYPQQFGGYPQQQTYGSAYPGYGF >gi|Falb1000000067|ref|H696_00080T0 | H696_00080 | Fonticula alba ATCC 38817 (V2) hypothetical protein (377 aa) MLRAPIGLAPRTLGLALSSATRRSVSSSVAVSAHNREALAPVEVDGLAMFDVGAALLAEGCHPGAFLRGGALVRATLPAIRDLFLSNTPDATGRRSPSPPLVADGRWLEDHFGPTRALGALPAGSTAGGGISGLVGWDVALDGLLAAGELLRLQQASYSRSHFDSVIHHYRETIVRALGRRLDRGDADVAPGPDVSQRDRQLEALARIERIMHQTIVRSFAPGDSLAHMAEGNFLPLHTLDLRADGHIQPHVDNLSASGRLVAGVSLLSGRVVRFEQMYTDARPRESVKPSERRVLDVLLPPGSFYMQRDKLRYDYTHAILPVTPGQETVWPEGAGQRVVPLEPARRIVFLVRDRLNATGPGPGVGAGGKVLEGTY >gi|Sarc1000010122|ref|SARC_09892T0 | SARC_09892 | Sphaeroforma arctica JP610 hypothetical protein (129 aa) MQVWEFTIDLFAAAHNAHTAKFYTKEQNALLQKWADEINAWANPPWELVPDVLDKVIEKATSITICVPHYPNASWFPKLMSLLEQDPMIVKNMNNTFLQGGTTARGKTPWGVTLVAKIGTKSPYLTKK >gi|Ccor1000000123|ref|jgi|Conco1|67245|fgenesh1_pg.3_#_43 MILNKNQSKKRLQSYQTFHKLNKISDYASDRNSLFSAEPTKYLVLTNIGYGGVGGIKPQELNTILNNLEINGFELICKNGKPFSYLIFNNIQNSIESYKKLNLIELKELNKLIYCEYLKFNPIKLSNTNDKQDNINGLVLINDFLTIEEELELVKNIEDDVTNNWSIVQNRFVKHYGFKFDYNTNSFGSSNNEMPIWSTKLLQKLYKITSDSEVINMDQLTVSKYPKGTGIPPHVDAHTPFGHTILSISLLSSTQMEFSNPETKLQYSTMLNPRSALVMSGESRYGWEHCIKERKFDLNEKGELVDRGERISLTYRRTNPTLDCNCQFGYLCNRK >gi|Psoj1000010133|ref|137293 MDPLEAILASMAAATGNASPPKSNAAAAAPATTKSAGKKRERESESETPPVSSSTPPSGIWNPQQQQPLDAATLRKLEDAARARSVNPAMEIARQQTVRALCNKIRRASEDLGIGKLPNSAYETWQFTSQLKVKELDPLIPHAGSDYSGLFEELRKAGATKSGATKKCKELTRESERLLRKFGQQDFVAGKKKRVHVAAAEDGMRQLTYGNSTVKLSAAHFAKLREMYARKQGLGGDGSSMAPKDQRSFESALFCLLLRYDSLDGGGFQAALNEECFDVLLKEFDCKMECFASPLNCRYSRFCSAFLDTDCAFGSVGSFFDFSPRSGCFEANPPFIPKVIKRMADHMTALLNAADGPLAFIVIIPAWQDTEGWQQLNSSRYNQTHLLIPQKQHGYCEGKQQIRKTRWRIASFDTSVFFWQNSKACNKWPVTEKKLDSLKSAFKSKQADERDALGLRKSGKRVRSAKD >gi|Sarc1000000137|ref|SARC_00134T0 | SARC_00134 | Sphaeroforma arctica JP610 hypothetical protein (556 aa) MEGKKQFHGNKRPQNTPLWDSTVKTSYKNALGLSRVKSSSDKLAVMPLNEMERWLAVKKLRQSVRQACGVPALLAYERWCARSSLPAAFIKDDDDDDGESGAKEDVGSGAMATVVSYLVPNSDPEKGLQKDLMRLGGLSEEDAQEAAAKCVALGVRLNAKLNDTTVEGSGKGSETAVQGDDDMENEGVITDAATSTDTNGSDVVANKDTEPNNTPAKKKKADVKNKLRSIGVRVEGDDDDTHSTHQAQVEERGALLAYSLRTKKKPFFSLSRPHAAKLRSLYARTRGGKWVDKEGSDNDRFVDAVFCVLARYDALGGAGYQAALNEASFDVLKDKMRVDCECFASPLNCRYGQFCSAFPDTDSPFGSLGSFFDFYPSKGSFEMNPPFVPEVLCAAAEHANALLSLTKEPLSFVVVVPAWKEVRMWQVLSNSAYNKHEPLILTASNHGYCDGQQHQRRPSERYRVSSYDTAVFFLQNDAGAKKWPVSEAIRNELVESMHKAVGSAKTVQELEGRYRPNKEGDDPNMEGGDAGDGSRKRKGGRVFLLQKKRKANEDQ >gi|Smin1000020134|ref|symbB.v1.2.017755.t1|scaffold1389.1|size122275|4 MERRILLEHTATVKSCASLVQWKQCLDILQQLKEEEQSPDVILLTSIVTACGRATEGSHAVDVFEELKRYRIEATVISYSAVINACSKCSDWWRALSYFDDAWDTMGDSTSKEAVSFLTTASISACGRAVQWLIALEILEEVSKRSMESTSIFAQNAAISSNHQAKPLPEWIQALELFGHANDSDVVTYTSVISACQKVARWQEAIELFVTMEKQMRPDLVCFGAVISALEKVGQWEKAFEFFRMLESQDVSLLPNLIIFNALMSACEKAGQWQRALHLLDLLKSTTTDVTAYDVVTYNALISACEKGQQWRKALEVLEEARSLLKIDVISGNAVLSACAAASRWVQSLQFLEVLLCNKLLLDLIGRTAIISSCAKVRGWQKALQFLGSCAATEVTYNAALSSMEGTVRWKESLEVVAAMRQCKMSPNILNYGIAIGACDEAMASPFHSTKWKLFIAAGVGAVCGFCLARRSPRGDGDEEEKNSILLGYCQHHPTKRGCGALHAASTMKERTGGRKMMPKEAPNGRREVICEDALEWIEKQGHFPSGSMVFTSLPDMSEVVEFAPRFEDWEDFFMKAVRHILTALPYGSVAAFYQTDVRLPTEGQVSKAFLVLKAAEAVPEARGFGGGCVKV >gi|Pram1000000142|ref|84939 MLSRPLRSFAPLRRLLSSSAIAANEPTSLWQDVYNLDATHCHDPLVSEGDLQVVLDVITEDEEQVVADECSRILRRRRYEEDHWDNAIVKFKEMERSRWSTETQRILQKVREAAILPKELKYFPAVHVIELAEDGYIKPHVDSIKFSGRVVAGINLLSPSIMRFKEEHGDSIIDAYLQRRSMYMMTGRVRYHYTHEILPGAQVFKGQVPVNRTHRISIMLRDEFLEEHVTKYHTPYVKLDVVAQ >gi|Lgig1000020166|ref|jgi|Lotgi1|228010|estExt_fgenesh2_pg.C_sca_10395 MYEYNEDITENDQFKPTYKYYKAKIPEPDFSNVLDFQAISESEKPDERVTNFQLQIPTGNINHINGFRPVHDWKAYSINDNPGFIFVLNPFKDGYQKYWVHRCLNDYPDAKNKANLDIQLDSEKRAKLWKNFVETDPDSIHKNDEIMGLRWTTLGYHYVWNNKEYHKERRTDMPEDLEELTKFVAQTIGYPNYCPEAGIVNYYHLDSTLGGHTDHAEFEQGAPLISFSFGQDCIFLLGGLTKETKPTAMFLRSGDICVMWKQSRLAYHGIPKILAPKLTHYPLSLNSEYVSESASDEAQYGNIHEINKKIEKTLQNLNWKPYKLYMSCSRINLNTRQVEGENKFFPEK >gi|Sarc1000000227|ref|SARC_00221T0 | SARC_00221 | Sphaeroforma arctica JP610 hypothetical protein (158 aa) MPTLSTQPLRTPMMMTCLDFVLTLVIFLTKMPRTSPPPPPLSTPLLPIDTPTNKSPPNFPTQDIQPPTKMTYDEYDRNNGHRSRKTTLRMALKDNVTLTHPPPPGVTTEDWMLNKNDWASAMTHLDFTPTIDRCASQTNSQLPRFAGPEGGEVNDFR >gi|Crev1000000227|ref|jgi|Coere1|78980|fgenesh1_kg.1_#_172_#_isotig04968 MLNTVRNSCRIPHMVNTTVQRRHFIEAGRKSRVTKPSCGDIPQPKEPINLASPASAKPLIKYSDAFIKHGYSKGDIYLHPEFIDTEEHDLLVRSCNKKLRRLANNYEQGHFDKRIHNYRECTVSAWLPNKLGVAGHIAKAMGHHIDEDPIDLPDRAPTKKSSGWSSIGKHDGQIRHILERVWGLFPPHLAWLPPHILDLHEDGEILPHIDNPEYSGFVVGGLCLLGSAVSTFKHANDPSIRVDVLLPPGSLYFMTNHIRYQFTHEITANPEQRAWGGQPIPKAHRISLMFRDAKEPVGGWGSALTTGCATTTMASAKSDHGI >gi|Hrob1000010247|ref|jgi|Helro1|123161 GLKPVNTWKGYTIANHPGLILLTDVMTSNLQKHLVKRCLNDLVKQPNKTNLDSYTKEEDIDNIFINFSNLLKKLRWITMGYHHNWDTKLYAEENKSEMPQDISELAECIAKVVGFTEFTAEAGIVNVYHKNSSLAGHVDNSEYDFSSPIISLSLGCTCIFLIGGPDKTSEPTALYLRSGDVVVMSGEARLCYHAVPCIITSSYTCLNDKSTTDCHEKCFDSNENNSKINSESYTMDNDFDNDWHLYENFLKESRININTRQV >gi|Bnat1000020289|ref|jgi|Bigna1|25145|gw1.85.31.1 RFNFCLLNLYRNGSDYMGWHCDDEREMEGPIASVSLGETRDFVFKEKANRTSKHFLELESGSLLVMNEETQKLFLHSLPRRAKVQNCRINLTFR >gi|Fcyl1000040299|ref|jgi|Fracy1|258165|fgenesh2_pg.113_#_6 MKAKRYGSDTKLSNVISFLRRFGWLHVTVFAVVTVSAFSGKTGTVGHRRNKKSPSVQQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKALETAVEGVKAASVISRLVSSDYLISSNNDNNNGKVWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHRQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDDVDNKHNNNLFTVASFVKQVKFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGNGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRFEP >gi|Fcyl1000110306|ref|jgi|Fracy1|184778|e_gw1.5.156.1 ; gi|Fcyl1000066930|ref|jgi|Fracy1|139687|gw1.5.156.1 MSTSKNVNAAAKTKTKTKKNDASSKTTTKISVFKKACIRHKRRGNDPNRIDDLLVGSGGCADDSIIDVKWIHYLQQRSRRKRRKHPQQQHSTKKKDNDDDGNCDSDNDSKNTTSDNGGRFNKDLVFLIDHHKRSTSTTSTSTTAEDATDTDTTTPTPPRCYGFHEYPGVYIYPNALSEEVQLQLSYEAVTKYCEHSSTNSNSSSSSSPNASSPNAYRTNTDLLPPKTNEQINDTTNNNISETMWNLWKQEQEQSDSSLSSTTTTTTTNYYRRFSKLSWSTMGYHYDWNKRQYHPDQKLIVIPSLVTKISKYFASASLLYNNQNNIDNYSLTNAPLIPPPTGTGTGTGENNISFIPSASIVNYYTEKSNFGGHRDDLEHINAMDKPIVSISTGLPAIFILGGYTIKDEYNYEDNKNENDNDGNENENENDENHPQPHPVRAILIRPGDVLIMGGPSRLNYHAVARIVPYEAIIKYDNTLFGSDDENENNENNNGNNNENNTNDNAITDEKKYLKRYLKDHRININVRQVYPDQK >gi|Bcir1000010321|ref|jgi|Bacci1|200768|e_gw1.291.36.1 MHPSHISKSLSVLPTIKRTLSTVSATPTERRPSVTQSEASSVQSDDTYVIDNRYTAQELLDIAEEHLFGRNGKAIDRAYAIACLQESAINMRCAGAQAVLGFCFEFGIGVPIHFEAAEQYYLMSIKTVLTGLDLQGEKSQTLSIDNVSLSSATLLGITRLAFLRKYGRPGVHINRIEAEYWESKIQQRGLEAIAWIQRAAIYDQCSASQYCLGVCYHDGIAVPKNEYKAFKWYRLSAEQGNCRGQGILGYCYGEGFGVEKSEATAIDWYRRAAAQGETVAIYNIGYCYEDGIGVEKDAVEAVKWYKLAAEKGNAFAQNSLGYCYEDGIGLQLNKNFAAYWYRRSAEQGYPWAQCNLGYCHQNGIGTEKDTIAGAYWYSRAARQGHARAQHNLGFCYQNGIGVNKNFKLAFEWYSKSAHQGNVFAFHSLGYCYQNGLGIEINHTEAVKWYLRSAEHKHAPAQLSLGFCYRNGMGVEKDEAKAAKWFELAAHQGNPLAQNSLGFCYEEGLGVSKNVKLAVHWYIKAAKQNNPWAQCNLGFCYASGIGVMEDTTKAVYWYRKAAQQNHARAQDKLGVHLQAGIGCRQNMGLAVRYFRLAANGGQVSAQYHLALCYEKGLGVEMNLQEALVWYERAASSGCRNSYEHLRQLLLRYCLENSSAVETLQRSDAHGETRNCSHRLGWISGFAAPAA >gi|Sarc1000000340|ref|SARC_00333T0 | SARC_00333 | Sphaeroforma arctica JP610 hypothetical protein (291 aa) MLHSRFYQEAERRFGKFTVDRFASAHNTHTDKYYTKIHNALQQKWAQEVNAWANPPWKLIPDILDEIIEEATSITICAPYYPNATWFPKFLSLLEEDPMLVEKTNNTFLKEGKKACGKTPWGVTLVVKIGVKCPYLDSTREQRAIHVAASEPVHSGDAPQATEYETRVNLIEQYHKLRIYTLEEPIHKLEAMDMVGLISNNMYTSSLTDAYHVSKIRYTKKEFHPLQSITATFPGDRVQVDTIGRATIGFQQRAHICVSMARCTLIRMSYYSDDGQMSGYYSTHASTNHV >gi|Sarc1000000358|ref|SARC_00350T0 | SARC_00350 | Sphaeroforma arctica JP610 hypothetical protein (335 aa) MKDKKAYEYLAKSGAEISSTPTRAVCLVNAGNQQGIAVDDFEAACSLLGKIAYLLNFPGKTYCVVIYEGLECAIDAHKALAGRACTLLKRTYSKEASIPLLVEYLIEDSIQLPVPATVQDARAVPGLVVVENFITAEEESTLKQCLDRYKWEPLSRRRVQHHGIRFNYSTNRHADDSMPGFPQEVQDIIIGKLRRMKSLNPVDEVSPVEPVPVPDQLTVNEYKPGAGISAHVDTHSPFRGAIISVSLCGRVVMEFKHRDGRAMSVLVPARSLVAFTGEVRYDWTHAITETRFDYVDNSFRERQFRISLTFRETKSTPCDCNYPHLCDSVGSTQL >gi|Crev1000000444|ref|jgi|Coere1|13168|estExt_Genemark1.C_20086 MQQQTAFRRAEKKHKAQHQLPNLSDVIDVSAIDASDGAAVRRLCLTHDMCQPSVLPFQPFCQDRPPAYTLRDHPGLIVIPNPFTAEAQRWIARKCLCDCTRPPNHTNLDPFFDLPSQSLFSLASGPTNNKPAGALTRVEAQHMVASRVQSDSIDPSVSKPKGKIYMQSAPAADLLERLRWCTLGQQYNWTTKEYDLGTSVFDCELNALMRSIAEAITNPAYAACNRSEDWPPINKYEGSEFVSQAGIINYYHERASMAGHVDKTEESMDAPLISLSIGLSCIYLIGGPTRDTEPTPLLLRSGDILAMCGDSRLAFHGVPRVLSDTAPECLTLPNAGDNDSVAARYPNWHNFATYLSTHRINCNARKCM >gi|Lhya1000010458|ref|jgi|Lichy1|232292|estExt_fgenesh1_pm.C_2000009 MIRKTGRITDPDPALYDTRILGGRGVQAYVYFMWICLQRCDPHNGELCLITPSQWLVLEFARHLRAWIWEHCELLQLFQLEPYKVWPRVQTDSLIFRLRMRGTRPPNLNTHTLFLRHTARRATLENILAAYATFNPHQQPPSSSTDIAYKYTPTHDRSRIQNSPNASFAFLSPSTSLTGELAHLTHSLSRLCDGPGAPLVFHRGPNTHPVYALVVRTQWARDYFGPHCCSRWLRPAFYWSGKAAGTHDPESIFWHLRDTQRLARKETSPAEAYAPFYAPDANYSLLLVDKEGADALESTLDKDDARLYEYLQAARLALQPTREERKVTWCHYNQCGTDVAVKIVHPINCGYFTKSQPRQRFFVDRHQLCVTNQCIILLWITQLEHGTILST >gi|Vcar1000000459|ref|jgi|Volca1|78621|estExt_Genewise1Plus.C_10079 MDIEEPTRNPPTVDCSHCVLRSVFLRLQKASGRVFAIDASCNVRSDNSLCPIFACPDSFTSHNLSGQHIWCNAPADRAIPWLNNYSTFKQRTPDTTSAVILVPKCAHLEKEFQTRGWTLLKEFTKNSSIFSEPKPGGGRTRSPNCSGEFQAWLDPCQE >gi|Uram1000000474|ref|jgi|Umbra1|218622|fgenesh1_kg.2_#_546_#_combest_scaffold_2_37280 MCEACFELASLDLIQDELPLVLDEEADHTYAASIEDYSHKPYLTREALSSTKFDDMKSNAVRLASYAPVEHQLFSLSFDSTYFDIPGRAPRWASHSGTDYHGTWLPQTVRRAILRHTRKDDRILSNFLGRGTDAIECFLLQRRCCGVDINPAAVSLSQRSCSFETPPGLTTAEHRPIIVQADSRKLTGALFADESYDHVLSHPPYKDCVAYSLHIEGDLSRYTNPLDFQEQYDKCVRESWRLLKMDRRLTLGIGDNREHCFYIPVGFQLIRLYINNGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIATFRKVPRNSNDKMDSFDASNTKSQMRITYTCREIPRSPIARKSVVMGTVWLFKPSSRHSFAQLCTSRMVERFGKDESNWEHVELEIISDNDDSSLKPSLANIATDSMVVEDEGLSISSYEIERQKRIDENRLALLQLVSSLSTPFIHINDDLTYLFCMAGVDIRPQ >gi|Lhya1000000544|ref|jgi|Lichy1|202942|estExt_Genemark1.C_30112 MSDIDWEDLFGVSDHDNDDSNSEQHVDNAYSIPGLKLEKNALTHEQQMQLVYAIAEADYFQGGKLDQAMCFGDLGPFEWVESWIRNEYPNLIPSRILKRACLFDQAIINLYHKGQGIKSHVDLMRFDDGIVIISLLSSCVMVMKPVDSETLQRRGTIPILLRPGDVLALSGPARYDWEHGIEERMADEVNGEWIERGTRISVTLRKLLVK >gi|Sarc1000000559|ref|SARC_00546T0 | SARC_00546 | Sphaeroforma arctica JP610 hypothetical protein (181 aa) MSQPHNPSGITLIDRILQQRVSFAWPSELPDPDNTLPMPADCDDNSDNEQRFTLPVAGCVPASSEKRTQRLHPAWVRRVMEKWHGVAIDLFANRRNAQIPYYASDDPCGGTWGGDAFTIPLDEDFGWACPPRVSGFWFLHYLQSCDKASAAMVVPVWSGATWFPLFLSDVGGCASSSTAY >gi|Fcyl1000020581|ref|jgi|Fracy1|238447|fgenesh2_pg.5_#_951 MSTSKNVNAAAKTKTKTKKNDASSKTTTKISVFKKACIRHKRRGNDPNRIDDLLVGSGGCADDSIIDVKWIHYLQQRSRRKRRKHPQQQHSTKKKDNDDDGNCDSDNDSKNTTSDNGGRFNKDLVFLIDHHKRSTSTTSTSTTAEDATDTDTTTPTPPRCYGFHEYPGVYIYPNALSEEVQLQLSYEAVTKYCEHSSTNSNSSSSSSPNASSPNAYRTNTDLLPPKTNEQINDTTNNNISETMWNLWKQEQEQSDSSLSSSSSPPPTKKKKTATTTTTPTTTATPIVVASTTTTTTTNYYRRFSKLSWSTMGYHYDWNKRQYHPDQKLIVIPSLVTKISKYFASASLLYNNQNNIDNYSLTNAPLIPPPTGTGTGTGENNISFIPSASIVNYYTEKSNFGGHRDDLEHINAMDKPIVSISTGLPAIFILGGYTIKDEYNYEDNKNENDNDGNENENENDENHPQPHPVRAILIRPGDVLIMGGPSRLNYHAVARIVPYEAIIKYDNTLFGSDDENENNENNNGNNNENNTNDNAITNSDEKKYLKRYLKDHRININVLQFNIVEAGDMLIQEALFGWTKDRAATGSN >gi|Fcyl1000110682|ref|jgi|Fracy1|185154|e_gw1.6.1329.1 ; gi|Fcyl1000087436|ref|jgi|Fracy1|161704|gw1.6.1329.1 MVVSWELKNDNISNSTNKNINSGELFLDYATVTKKSKAKQQGEYEKGEASRPDCTSTTDHVYVPGLVVVENFLTEAEEELLVAILTGPQAPWAPQQSNMSQTGSVKRRVQHYGYVFDYRTADVLRRDVEEESGRLDPDANCPPLPSIPVEKGMEKQTNDNDLLMIFSDLNQMTLNQYKTGEGIGSHVDTPSAFGDGLISISLSSGIVMEFQKVTVGNDDDGNKVSPKNIKKLVYLPRRSLVLMSGACRYEWEHHIVTRRTDTHNGVVIPRGLRVSLTLR >gi|Fcyl1000030685|ref|jgi|Fracy1|248551|fgenesh2_pg.24_#_31 MKEDAVTFNYVINDKSKGLHVKVFRRYQNQIQSDYWWNTILDNIDWYRVKYKSDRFQKNCETPCWTTFFGGRKEYTPYQDIPDWLQPLVNQVSSDLKVPTTKAFNAILVRLYFDGNDEIAWHTDGRTFLGNTPTIASLSFGSKANFQMRRMTNVWPSVNRNVVNNYWHHRVPKEKGRRPRININFRYINPGKDAERGQKTYYKYMVHGDDDKKSLKSYSYKSILAMRGGIMNFISSSSLRPNANNNIKKILTEFNDAATATAATNIGGVDGNNSNDCANKKRRVDDKRKNNDNNNNNNNNNGKPSSSSSSSNHNHNHNHKTGNNSYDVPTAGIVQKYNNGNDEVDVINDSMNSTDDATTTTTTTTSTSCSSSTTTQQYYYLSASENNNIDKSAFMALPDDIRKELINEWWKKKYISQRQGYAKTNTNTSTSTSTSTNTNMNTKLVVPNQPKVQQQDNTYNKKQKGESKEKIRSSAVAAARTDTLHSYFSTTKKK >gi|Bcir1000010688|ref|jgi|Bacci1|327341|estExt_fgenesh1_pm.C_3310003 MEDIPTHPKYEWSSTQEKRVFNMLDEKTKEYRKQKEDPFSWQLPCSVKFQQDNPNTGETTTNKQKTSKFGNFMGKLLLKSSTNASKIKKSVPKPLLVEKKDSTEMPLTGYFRFMKRRTHHSNTSEDNGFPKINFKFSTARVLGNENTQIVGLNTQTNIKPKYRLINGKRTRWDVVSPVANNSKDEDSSLDLKNKEVVLPQDQSQEPTSSTEKSDMREKCIDFDFQYKEDFNDDVVDDGSSLGSLQSDDDVLGPWIELGMFDSNVSETKKSDHFSSFYSVDNQQNLKSLDLPEIQCLDCKLRTAYDKKPLDISNLCLSCQNKWSDTLSNVFRKFEFAVLTQTCKEVTEKPKSKKVKVTNDNKKASVEKNDVLHKEKTGGEKLLPQKLKKSPPAIGIPPTKSNYTRKTNKKMATGSSKKNDIPKKTNATAKPIVVATPETNDDDDPPFGFCSNPRGLVYKQVVEVLNINGHWYRGTLELMDKRKVKVKYIDWDDQEEWVIIGSKRLRTIQLEDKESDQQTDQQMKSKGESTADEVSKNNPIYFRKIAAAVKSKEPDDYVSSTLDKDPTQIFNDNEVFMTRRLAQELVDEHGFMPNSFGYRRNRAVAVTFYTSSKQRKQKEESVGYLREMHKNQVRVWYPDLHQSEWLLVGSRRLRILTEEEEESILFDSSIDLDRQEVPKIQEIAQIENKIDEIPIINPPPKRSRGRPKKTLPTEVVEIATEEDTNNVYEPEQVPQSTIILEKKVTEEHGQGDEAKVSNFLTTGAFATRRAMRQLTDQSGFVPNPYGYTNNQAVEVLNTRSGKKKFWEFGRLVEMKPGKVRVHYEGWSDLYDEWIMVGSRRIRVAQEQIPQKEDNDEVIAAPVPKTNDLLMTELNPEIRDEVKRNKKHKILSAKDYQELGLLVNIEELAAKELRKKKLHEKKTEEMGTTVKVKAVSKTKSKKSEIGGDKYEDEHDDEDIDEGDLDNDYQDTVVKKRLKSASKFKRKVKKSKTKIAKQTPCEHHSPSPPPANDTQVISLRLAQARASNSQSFVANVYGYDYMQHVTVLHLDKKFYEGRLVSMRKNKIKVHYCGWLDAFDEYITCGSRRLQVIENDHEVVCIEPNFKERYESMKSTGEPSLPEITPVNRIVRKRITLDDVCEEDSEGQREYHKEPSGEGEDEEELVEMDAWKVYCNQCNIVIKQFRYYCTYCETPSAGCDYHSFELCLRCFDQNFPFWHDHPRSSFAIQAVIDKEVGPMPIKGELVTVWEEDVLEESVNITNEDEEKNGEENIEPMFESKIDSVDASKVFSGDASITTDQGYKYLKRWKRRKVCAFCNDDDDTSNELGQFIGPFIIATFNKNGVEKKRSFWAHDSCARYSPEVFCTPEGKWYNVTLALRRGRGMRCYGCKEKGATIGCFESKCSKSFHLPCSQKPASYFKNGVIFWCQTHEAYYNKKDTYVNIFNCDGCSKKLEEETWFTCVQCATSYFSTFDLCVDCYEKFPADHRHGEEDFEETSLAILKEMEAQKATEAAREKEELRAANARKKKKSLFPRRRRKLPDGSTPVSCCYCGTYEAETWRKGYDGGVIMCNTCFELALLIDNDGDTNVTDMPLVVDNDGLQQRYVSSIEDYSHKPYFTREALSSTKFSDASTGRRLESYEPQPNQYFSLTFDSSYFDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTVKDERVLSNFLGRGTDAIECFLLQRRCCGIDINPAAVSLSQRNCCFEIPPGLTSAEYRPIVAQADARQLTGSLFGDESFHHVLSHPPYKDCVAYSTHIDGDLSRYTHIDDFKVEYNKVVKESWRLLKMSRRLTLGIGDNREHCFYIPVGFHLIRLYIDQGFELEELVIKRQRYCSAFGLGTYLCVQFDFLVFTHEFIGTFKKIPLENIDRMLIKNEEEEDSAERASHVRLTSMQRGVPSSAILRKSVVMGTVWVFRPTESFRFSQLCTSRMVERFGKDDNNWEHIELDFSFQDQPRCEQIESCHAETEKDQSIDEEESPLSEYEQQRLRRIEENNKTLLKLGLISELSEESNDVIHYENMMDKAPLEDGKLVLMITAHQTLAPCQINLYRKTIVQLAKDATKKLAHHGMLIIGTQDIRNNTSGKLWPMTMLVLEDIERAVDQSTLKLKEMVVTVPDGYSKNRKQNMDEQPDTEHNEEEIDIETVDDYVPIVHAVYLVFQRL >gi|Caps1000010719|ref|jgi|Capca1|156565|estExt_Genewise1.C_3640053 MTKAERRAFKKQMRSQHTLLRHENIISLLHQHLLIANGGLGNSVSRDMLEKVFKPCGSILDIVMVPGKPYSFVTFSDLSEAQSAVQSLQGTELPSSAASSEVPPVKLYLSFVKSVPGAEVASNILPAGLTLIQDFVSQEEEIELLKCIDWDYMDPQLKEDSKISLKHRRVKHFGFEFLYSTNNVDPDHPLDMGIPPECSPILQRMLSQQIILNLPDQLTVNQYQPGQGIPPHVDTHSAFEEELVSLSLGSQVVMDFKAPGGCHYPVFLPQRSLVVMRGESRYQLTHAIAPRKSDVVPLACLTKDNQMKLTLMARMERTSFTFRKVRNPPVCECDFPEQCDYQKLKKSNRPLTKPISTSPKELETEHVHQVYEEIATHFSDTRHTPWPRVAQFLNGLDPGSVVVDVGCGNGKYLGINPQLVMFGCDRSSGLTAISHERGHQVWVSDVLATPLRDGSVDAAICIAVIHHLSTQERRFQALVEMKRILRVGGKALIYVWARDQKRGNVASNYLKESSDLPRDEEIKSAVDAKDLKAELPTELSVHVNRTEFQQQDLLVPWKLKGKEEKQVFHRYYHVFEESELEELCQRMQDVEVNDVYYDQGNWCVIFTRIH >gi|Vcar1000010806|ref|jgi|Volca1|99194|fgenesh4_pg.C_scaffold_83000041 MNGAGGTNDLGGSLDVAMASGSSGNPAGEAGAAAPPLAPVVPVPVAPVGQPVIPIAGAPAAGAIPNPAQPVAVPAAVAVAQLPDAQMVRNIPAPKLPVATPVEPDRIHAFVADVRDYFVLVGWQANIPAQKLFISGALEGFFKEWHITWTKSVPDYTPDQLLDAFLIRSAPEMYSRTHVARTTFYSATFKQELNELHLPLLLLWRHPLQYTLDLSRRRPSLPLDSRPTRVRAVVLEAVAGLVDVVVAQQAAWPDAMVQGVAPPRFVRRPDGNCLWSKYWQGLGRSVHESEVASIATTMGREFTLDACASDCGLSAVCNAFSCTARPFLDTNIAGHTVWMAPNAADLPAYVTHYRACKPLAPQSTAACILVPSGTEPSLLKGMKLVRRYPVGTSLFYVPDVQGSRALLPPITEVMEVWYDGPDSTEEIPACTAIGNAVPHLAVKISGSTFMAMLDSGATHSFVSEALVRLLHLHVLPSTFTYVRLADGGMSPIVGQPMLKVLSPLLGPYID >gi|Hrob1000010803|ref|jgi|Helro1|76748 MKQGVAVFENFASVEEENSILSEVEPYLKRLKYEDSHWDDAIRGFRETEKKTWTAKNRPLIERIQKTSFTPSCSILPHVHVLDLLPSGVIKAHVDSVKFCGDTITGLCLLTSCVMRFVNVDDQSKYADVLLPRYSLYYMKNAARYKFTHEVLPNDLSKFKGLQVQRSRRIVVLLRNKPVIKGDGDVDGGESDDVGSSYGGYSKS >gi|Vcar1000010818|ref|jgi|Volca1|77763|estExt_Genewise1.C_830117 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFLDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Caps1000000823|ref|jgi|Capca1|163497|estExt_Genewise1.C_120131 MGLLRFLSIPRHLGSLPIRLFTQSSLRSCPTLQTSNGLKPCETCITASDRATEEDIRGSFFVYKDFISEEEEQSLFDEVEPYLKRLHYEQDHWDDAIHGYRETERQQWTKKNRGIIQRVRDLAFPPGVPQLSYVHVLDLQKTGCVKAHIDSIKFCGSTIAGLSLLSNSVMRLVHDKTKSRVADIYAERRALYIMTGSSRYDYTHEILGESESFFEGKAVERDRRISIVCRNKP >gi|Fcyl1000110820|ref|jgi|Fracy1|185292|e_gw1.6.1356.1 ; gi|Fcyl1000110923|ref|jgi|Fracy1|185395|e_gw1.6.1355.1 ; gi|Fcyl1000088004|ref|jgi|Fracy1|162301|gw1.6.1355.1 ; gi|Fcyl1000088005|ref|jgi|Fracy1|162303|gw1.6.1356.1 MVVSWELKNDNISNSTNKNINSGELFLDYATVTKKSKAKQQGEYEKGEASRPDCTSTTDHVYVPGLVVVENFLTEAEEELLVAILTGPQAPWAPQQSNMSQTGSVKRRVQHYGYVFDYRTADVLRRDVEEESGRLDPDANCPPLPSIPKMIFSDLNQMTLNQYKTGEGIGSHVDTPSAFGDGLISISLSSGIVMEFQKVTVGNDDDGNKVSPKNIKKLVYLPRRSLVLMSGACRYEWEHHIVTRRTDTHNGVVIPRGLRVSLTLRTAL >gi|Vcar1000010860|ref|jgi|Volca1|99520|fgenesh4_pg.C_scaffold_89000033 MASPARNPQHAQSAEADHEHYQLCQPQRGTWQGAAPRGDDYKKLVLEKLDQMSRRMDVIEAHVQAQAAAPPPPSTPIIPAEEGLNVQPTAAELLAITAPVQPAQASDQAPQPVPPSIVAPAAPAANCAPPAACEPAAAQAGHDSGIVGCKDVQSVSVQNANHRSLKNFRQPPCFTGVPSTSVTKPEIWFDTTVDYMTRSGSNPIMGMRYYVMGKAAEWYHDLFTHKGSMLTIQGMCDDFVLLFSDECATDANGNNWFAFDTLINFAQDSVTSHDLNGQHIWCSTPADRVIPWLNNYSTFKQRTPDTTRAVILVPKCIHLEKEFQTRGWTLLKEFTKNSRIFSEPKPGGGHTSTKAVTLVDSGAWHVFVSAALVQKLGLQPLTSHVAICVLGDGANSVRLQGQAVMPDKPDANWGTASFTDGTWFKGNKIIVPNDPQLRKDILHAYHDTPLAGHHGGLITSLLQTPSGNTAICVFVDKFSKMVSLVPTTDQLTTIGFAKMLVNHIICKHGRITALLTDCDPPFTAQAMRNTAKQLGVKQLMSSNSTLFFIAGTAAVRQGAEIGTQSWGP >gi|Bcir1000010874|ref|jgi|Bacci1|251514|estExt_Genewise1Plus.C_890072 MANKNQYNVLVDMEDAAYTQPATIESDGLEFQDFSGSSGMNNKSSYSQPAPPPPSNTTSFFDAPQPTNNNSNRGPIWSLDYYSRFFDVDTSQVIERCLKSMYPVGDFAADTLNNQPDLYGPFWISTTVVFSVFVCSSLAGSLAAYIAGQPHVYDFRTLSVAVFVIYMYGFFCPAAVWASTKYFGCQPSLLEIVNYYGYGLSIWIPVSLLCIIPNDIARWVFVGVAFTVTAVFLVKNLYIVISRADAKISRIILLAMLGAHVIFALILKQQKIWEKQKSENEKRKKQRKTYVNQEAFRSAERNFKSRNPPPDFSKVVDVTKEDQEHIIKVPLTQDLESLSRLFGDNQHSKTCQDAYVLKNVPGLIVIPNAMTAKAQRHLIKQCLSVYSLHPNISNLDTHYNIPESGLWSLFEKQKENTLKPEDALVYKKENKSLQQGGYSSDNDKDSDDSNSSAPPDELIKKLRWVTLGYQYDWLSKTYHPDKKYAFPQDIAELSKRVVKAIEGIGYTSEETSWRNEYKGSDFIAEAGLLNYYQYKDTLMGHVDRSEVNTEAPLVSLSLGNACIYLIGGPTKETVPIPLYLRSGDIIVMTGPCRKAYHGVPLIIEGTLPDYLDSQDGDADWAIFGEYMRTTRINLNIRQVNTSC >gi|Fcyl1000120875|ref|jgi|Fracy1|195347|e_gw1.24.318.1 ; gi|Fcyl1000088089|ref|jgi|Fracy1|162391|gw1.24.318.1 MNNNNKNNIEEINDKSKGLHVKVFRRYQNQIQSDYWWNTILDNIDWYRVKYKSDRFQKNCETPCWTTFFGGRKEYTPYQDIPDWLQPLVNQVSSDLKVPTTKAFNAILVRLYFDGNDEIAWHTDGRTFLGNTPTIASLSFGSKANFQMR >gi|Vcar1000000870|ref|jgi|Volca1|102807|estExt_fgenesh4_pg.C_10481 MASPGNTMRTQSAKADHERYQLRQPTAAAAEPARVTGPTAALQPQRGTRQDSFTSHNLSGQHIWCNAPADRAIPWLNNFSTFKQRAPDTTSAVILVPKCAHLEKEFQTRGWTLLKEFTKNSNIFSEPKPGGGRTRSPNCSGEFQAWLDPCQEKSKCSALEPLTENTAPLLPCNFTSTKAVALVDSGASHVFISATLVQKRGLQPLTSHVATCMLGDGANSARLQGQVHTSLRIHGFRCKIVAQVIPNYPPHSRRPEDGFAFKRGGM >gi|Vcar1000010947|ref|jgi|Volca1|106299|estExt_fgenesh4_pg.C_410059 MLPPVANWSQFLSAFVIVIVIVIALLVIFVFVILQWLSSPQKPTATAPVHGLRNFRATRLSPPPSAPQWPNSLRPHGRPPNSPIGDLYAPLPTIAQPRPPHLNLLHYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIVLPDRPTAPWAPLIRHMTVVRRFPAGARIVCRRDPSDASRSLYPLSSAA >gi|Lgig1000000949|ref|jgi|Lotgi1|74562|gw1.8.349.1 SDLRLKLKTAHPSKNSQRRRHSNENYDTMYSDSYEKISRYDRHKYQDYHLDELKKIHSGIQQRRLFTSAECAEVEKKIDDVVAKANRNEYKDNTVDRAPLRNKYFFGEGYTYGSQMERKGPGMERLYAKGEVDDVPKWIEKLVIKPLYDANIIPKDFINSAVINDYLPGGCIVSHIDPPHIFDRPIVSVSFFSDSALSFGCKFSFRPIRVSKPVLNLPIARGCVTLLSGYAADDITHCIRPQDVVSRRAVIILRR >gi|Psoj1000010940|ref|138234 MSSKAERIQAKRHQHAADEHELDLFLNAPQRAPLPCNPNAAPTAENGAPEASGANDQVPPRGSTDTGALLPGLVILKGFLSPQEQQELVDDSRRMGMGEGGFYKPTYASGAKCRLHQMCLGRHWNVKTEKYEQRRSNHDNAPVPPLPESWKKCAQRSLEAAREIDPQVMGTCKHMTPDICVVNFYKKAGRNGMHVDKDESDEAMSMGSPVISFSIGCAAEFAYIDHYPEPHEAVPIVRLGSGDALVFGGPARKVVHALTRVYNNTQPKWLRMRSGRLNLTFREYKPSELAC >gi|Fcyl1000020973|ref|jgi|Fracy1|238839|fgenesh2_pg.6_#_241 ; gi|Fcyl1000111282|ref|jgi|Fracy1|185754|e_gw1.6.1166.1 ; gi|Fcyl1000111321|ref|jgi|Fracy1|185793|e_gw1.6.1362.1 MPRHETTTTVATGLGSKVAKKQKVKAGKSNIWNPDVCYVSEKKECTTSSGKLVENFPDPTCSAEFMRMTSTNWLRKQLHKLRKSNKSAEDVEFMKPAVMAVERWFMRYSLDHPHRLAAGEDPVLPMPGADLEEIDPHFVDDLVKTGKDTQEAAVAVVKDFYALTRKAALDILKKEQALTKEEIVVVIHHRHSCDVMLAKNKEKLLKINPAHYEKLKTIYIAVQESSSLSLTSGKKKKKTKTTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAFPDTDACFGSLGSFFQFHPKHGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLKASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDTAASKWPLTEIAINELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKMAGGAKKYKRNT consensus/100% not found. >gi|Lhya1000001028|ref|jgi|Lichy1|233553|estExt_fgenesh1_pg.C_70030 MRNIDYGTQIPGLIVIEDFVTEQEEVALVTEVDNRTWCGLGVSPNPELKRRTQQYGHLFSYRYRKVLEKYGPLPEFTHAVVARIKENKLMPKEPDHLLVNEYNAGQGIMPHTDAPALFGPAILSLSLLSACVMKFTHVENGNSIDILLPRRSMLVMTGDARYLYKHSISKDLVESSSEGVTVHRDRRVSFTFREIIAWEVPAENPSCSCTKSCSNNK >gi|Bnat1000001029|ref|jgi|Bigna1|88467|estExt_fgenesh1_pg.C_320094 MEAEIIALTFALVVTFVALTLGWQFILGTPGTEEHVLGLCYEQGEGVEKDLKEAVWWYRKAAEKGLAKSQYQLGDCYDKGNGVEKDLKEAVRWYHKAAEQGHSAAQHQLGYCYKHGKGVEKDLKEAVKWYLKAVKWYRKAAEQGCAAAQNNLGDYYEHGKGVKKDLKEAVKWYRKAAEQGDAEAQNNLGHCYYVGEGVEEDLKEAVKWHRKAAEQGHAEAQNNLGHCYHVGEGVEKDLKEAVKWYRKAAEQGHAEAQNNLGHCYEHGKGVEKDLKEAVKWYRKAAEQGDAEAQNNLGHCYYVGEGVEEDLKEAVKWHRKAAEQGHAEAQNNLGHCYYVGEGVEEDLKEAVKWHRKAAEQGHAEAQNNLGHCYHVGEGVEKDLKEAVKWYRKAAEQGHAEAQNNLGHCYEHGKGVEKDLKEAVKWYRKAAEQGDAEAQNNLGCRYYVGKGVEGDWKEAVKWYRKAAEQGHAASQIELGWCYKYGEGVEKDLKEAVKWYREAAEQGNAEAQHNLGACYEHGNGVEKDWKEAVKWYRKAADQGHVEAQNNLGWCYKYGNGVEEDLKEAAKWYRKAAEQGHAASQTELGWCYKHGKGVEKDLKEAAKWYREAAEWYRENAERGCAEAQNNLGDCYRHGRGVEKDWKEAVKWYRKAADQGHVTAQKNLICRKSRATQTTMRW >gi|Smar1000001056|ref|SMAR012323-PA pep:novel scaffold:Smar1:JH432129:50962:52819:1 gene:SMAR012323 transcript:SMAR012323-RA MSLVETTAMTAAILTMQEQGIIKVVSPVFGQVLSPVFPVPKPDGSAPDPEASKIDAFAHFWSDFSYAFSPFSVISKVPWKVHQDKATILLLMPLWMTQPWFPRLLESLIATPVWFPAKDLLRLEHSPQEKPRLNDQLVLLGCKISGNPMLPKVFRKTLQSSSWTGGGQTLSPRTQREFEVEKLSGFTTDYCD >gi|Sarc1000001093|ref|SARC_01068T0 | SARC_01068 | Sphaeroforma arctica JP610 hypothetical protein (320 aa) MATTDNTYTFDTVVPKIRNHAINTSAVLRGPSLYGVASTRFQPRSNGTDTLKRPPIDARATRATQPAPSSSTTLPSYYTVQSLTLRATPQLDTALATTFSTALINSGADISIVTSTDHLTDLQSGTYTIELAGGSTTTAYTRGTILGLGPTLVLPTAAQPILSVRNLQCNGYTVTFPEINTPNTPGESHITYGTDTIQIVTEAAGRYQLNAFQAESLHLHHARFGHQSTRTTKAMALANNLPIKPPLTYCNDYQLAPDLFEDHVQKPFGPIDIDLFSSKHNKQVPTYCTEDFTDKDTHYHDAYKQNWHKPNKKLYGNQP >gi|Mcir1000011152|ref|jgi|Mucci2|115786|Genemark1.11549_g MQTATVNHQQQFKSKRQFKIWEKQQKENAAKRLNRALYANQSPFRYAERVYKSRVLSDADAREIVDFSNLSNNTTQVQSNIVQVPLKHDLGTLSNAFSADMSHNKSNYALIMKNVPGLIVIPDAFSPHAQRQLVKHCLRDYAKPPHTSNLDGHYHVSKDGIWPLYEQEKKGSLVPGDANYYVPIKTVPQDENDMYPTAIGDDDNNDAFSVASSMYSDISGKSAPPRAASSPTQMLKKQRWVTLGYQYHWGTKKYNLDDPIPIPSEIADLMKAVVTATEDIGCQDAEVPWKNQYKGADFRPEAGIINFYQLQSTLMGHVDQSEINMEAPLISMSLGHSCIYLIGGNTRDTKPIPLRLNSGDMLVMTAIARRAYHGVPRILENTLPEYMLPESVDDEDWKPFGDYMKTTRINLNIRQVFPK >gi|Lhya1000001164|ref|jgi|Lichy1|190365|gm1.1132_g MSNKNQYNVLVDMDEGGYTQPATIESDGLEFQDFSSNTGHHGSAPPPPPPPAASSTTNFFDTQESRRSGANKAIWSLDYYSQFFDVDTSQVIERCLKTLYPVGDFASDTLNNQPDLYGPFWIATTVVFAMFVCSSLAGSLAAYIADVPHQSDFRLLSYAVGVVYSYGFLCPALVWLATKYFGCQPSLLEIVNYYGYGLTVWIPVSILCVMPFDIARWVFVGVAALLTGYFLVKNLYIVISKTDAKTSRILLLAILEMQPSLANKHELMSSRRQRKILERQQLKAAQLKQDTKTYVNQSPFRYVERNFKSRVPPPDFSKVIDLHQHPNHDRVVPVGLACDLSCPLFEQKKPSRAYILHDIPGKHELISFMSNCSHLFNRQLIRECLSLYTRPPNTSNLDTHYAIPEQGLWNLCEKEHHGQLDPSFVVPRKTMEEWQLQTNDSEKHDPPPVESMPLLRPFELMHRMRWVTLGYQYHWPTKTYHFDKRYPMPALVDQLTSSIAYAVDGVGQEGVWKNTYRGQDFKAEAGVVNYYQYRDALMGHVDRSELNMDAPLISVSLGNSCIYVIGGTTQDTEPVPLALHSGDIVVMTKPCRKFFHGVPRIIEDTLPEYLSSPLSNTEEQDDDWELYSEFLKTSRININVRQVFPP >gi|Smar1000011252|ref|SMAR003465-PA pep:novel scaffold:Smar1:AFFK01018421:2875:3482:-1 gene:SMAR003465 transcript:SMAR003465-RA MDISVIHIPGKLNKIADFLSRDFTSSDGEWSLDTYTLNNLFSIFVTPEVDWFATRLNYKLPKFCAWGPDPMAWKLNTTNNPEDEQRPSRPADYRSPNLVCATAREPDRSTSTLCSKGQIVLGAQARGQAPITQQNDPAWMQDLREPLQAGGFSTHAADLYTASWRKGTVTSYTSGIKVAELS >gi|Vcar1000001335|ref|jgi|Volca1|89254|fgenesh4_pg.C_scaffold_12000015 MVPFRRTATPTPTAAAPHDPVNVARAPPPAEQVSHSVPVHSAVQQSAEPDTPPPPFNVPRLTQITSKPCRLYAAASMMGVHRVDSDWMLSCTIFLDLYSQYGLFTVDACCDDFGINAHIIPFFSPSCSFLSAQVDGLWFAYSLWMFWAVR >gi|Vcar1000001334|ref|jgi|Volca1|104096|estExt_fgenesh4_pg.C_120014 MAVQYMHEAITSAVRAAMAELPPAPWSTAPQPHTGPYAPLPTIAQPRPPYSNLLHVSRDFPPFDPKDIRADIGGWDITMRHNLDMAGVAEDSPEAAKITLSVLRGTMGDSLRRLNADPATRFTSYHAVLQAVTPLAPDLIDLTLTISTGRDAANRIRDDPTPPRPRDHRRNDPMDTSAALVRLPQGTRTINVPAQLRHQRQDAGRCLQCGSEFHDKLSCPDLIHLTAPTLAAMTSTAAPTQTAAARPVPVNVAFKPPPAEQVSHSLPVHFAVQRSAESDTPPPPPFNVSRLTQITTSTTDAHRVDSDWMLTRAIFLDLDSQYGPFTVDACCDGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIILPDRPTAPWAPLIRHMTVMRRFPAGAWIVCRRDPSDASRAPGARPLSGEPPTV >gi|Smin1000031348|ref|symbB.v1.2.027722.t1|scaffold2860.1|size68653|1 MAISSDLAAIKVLCKNQRPTSESINHDLSPFLDELKSSPRFGTSLLNKLARRKKLRQLSLVLDALLLNRCDVNVFHYGVSISAFEKAEKWDRALLLLRQMDVVRVEPNTVTYSACISAMEKCGLWRQAVDLLAEMEYRRVEKNVITISAGISACSKAGHWWLALDVLDKMCKDQIDPDTVAFNACISSCSSQWQVALHLFQQMSGFKLQPDVISYSGAIAACEEGLQWETTLQLFTDLQSKEVHLDDFSYNALISSFSRGSAWQLCLHVLQLKQLASCASTASIASETNHSNEFEFATNDTNDTNDTTDICKLLVPSVPSVNFVKLVEGLDIFAVSATVSASLELQWAVGYPFLPAKEEQLTHGFYKYIAGMQALCARELLELVPHAQNIMDMFCGSGTVLVEALRTGKRAIGCDVSPLALLVATHHSDAARIDLYELFEVARELVASMEARNEGWHYLKSRISNLRSKNLRDALHFILLVSLSRVQDVTYLHSSSKVIKSSVPDHGLPPCMFLGVAQLYVARVRSLRARALESECEIYRCDARVLRLEPVDAIVTGPPYPGVYDYHSPANMCADLLGENILYDFCAPGYSIRGSKAPTNVEMAHEKSSTYAAGREIGQNRLWLEDSDFAEIWQSEQEAWLTSAFENLREGGTATLMIGDGDLHSAGDGGFDNLEPTIIAAEKVGFATIATATIRGKSKHPKQPKGMKRTEHVVHLKKPKL >gi|Bnat1000011351|ref|jgi|Bigna1|87291|estExt_fgenesh1_pg.C_180193 MIQSDLEAKRPSSNPSTQDGGGKGGGTIVPLRLGDETALLKNQPTNVERKERLDSTKSIPAGKLGAGDTTLHVGFLSKEEANSALSAFQNGEVIFQQWYHMPDKRSPAGPLRKLRRIKAALCNPEKDGRIPLYRFPVNDQKRYGETIPMPPALEAIRKRAAKVTGFEYNHAVVLLYRNGDDCIGFHKDKTLDLDDKAPIISISLGAERPYVLRDNIRAPKIEQEFIFPHGSLLALGPETNANFYHSVRQLKKEEETGDVRISITFRKVATFRSEDGKIITGKGAGRGTNLWPEAINGAHRLDTKLDESLKQAESVEDDAKARTVREAHHQAHEKRMAERKAAKKQKQRSLDKTKIEEKLSGVNRDDGESLKDLKIRLMGLVRKDIKNFALPGVNIDDKGIEQELSSLVGKFLGAKVASSKNAEA >gi|Uram1000001377|ref|jgi|Umbra1|222849|fgenesh1_kg.5_#_213_#_combest_scaffold_5_101108 MSAKRSNSQILHHFGRVFARLTVKDSSTPDLFKSNSSPSTHHMPSDPLASHLFQTANNHLFGTNGFAKDPKVAVTYLRQAASQGHAQAEGVLGFCYEFGLGNIATDFRQAEALYIRAARKSDGLAMARLAFLRKYGRPGVKIDRGEAELWTERLCNLGVESIQWIREAASQHNCPEAQYVLGVCYHDGVGPKANEAEAFRWYKTSAEQGNARGQGILGYCYGEGYGVKKDDVEAIRWYRLAADQGETVAIYNLGYCYEDGIGVERNVTEAVKWYKLAAEQGNAFAQNSLGYCYEDGIGINRDSQKAATWYQKSADQGYPWAQCNLGFCYQNGIGVEKNERLGAYWYRQAANQGHARAQHNLGFCYQNGIGVPKSATDAVHWYTKSAERGNSFAYHSLGYCYQNGVGVPVDGKRAVYWYYLSAKEDHAPAQLSLGYCYRNGIGVEKNETEAFKWFYKSAAQGNALAQNSLGFCYEEGIGTKKAPKLAVSWYSKSAKQGNSWAQCNLGFCYANGIGVNKNYQKAVFWYQQAAAQNHARAMDKLGMHLQSGQGIEQDVKLAFEMFKRSASLQHVAAQFHLANCFENGLGCDVDLAEATMWYERAALAGCRTSHERLRRLLMRACLDSAGGTGNDEDSPLGEGAYYSTFAYGHCAPAA >gi|Fcyl1000021378|ref|jgi|Fracy1|239244|fgenesh2_pg.6_#_646 MTMLTDDNGINERRFRSVLPHEPLPIDGDSKGYAHLQGAFEPKRNKLTNNSNDKKSSTTTTTSSSSSLSYWKSPEALEEVLLLALRDDFERLKLPYPVLSVHVTDPNNLSKVRLEYNSPHEAIQIQYAFRDQRISPHDIIILTDEQQHVLFGSRPCQATMITDKPLPTDRCFWPRSNPPQFRRLLHDRGDDEKERSETRFVYVTGLIDNNITAELSDWWNNPYYVYQAMRQVFGTDVEIFLPKKINKRQQRIQSCQLGFRSAEEAQDAVQKFQGMVVSWELKNDNISNSTNKNINSGELFLDYATVTKKSKAKQQGEYEKGEASRPDCTSTTDHVYVPGLVVVENFLTEAEEELLVAILTGPQAPWAPQQSNMSQTGSVKRRVQHYGYVFDYRTADVLRRDVEEESGRLDPDANCPPLPSIPVEKGMEKQTNDNDLLVKEGKGWELLAQIIEKTRQHEFDVCSNKSNNSLNNNENADLIDPQPTKKMIFSDLNQMTLNQYKTGEGIGSHVDTPSAFGDGLISISLSSGIVMEFQKVTVGNDDDGNKVSPKNIKKLVYLPRRSLVLMSGACRYEWEHHIVTRRTDTHNGVVIPRGLRVSLTLRTALSSKGIPLPRFESNMFPPVWGINDEKRNGSTMDSNVLVTPNTERDHVHAVYDAIATQWHHTRGKRGVLWPSASLFVKDLPEGSIVADCGCGDGKYFPAVWEAGSYVIGTDISLPLLKTALLNDSASLSDGGKVPDTRRVSPHRESLQKRPAVAVADCMSIPFRNNSCDAAICIAVMHHLSTIDRRIRCIEELARIVKINGKIMIQAWAMGCHSRRRFAAPDVFVPFNAQPKYLDKVSENSKTTMQDFKAGTMNDSNTGVAATSVVPTINGPHASKSAAEIYSDAYDNADFDEQKGLVVFQRYCHMYREGELEDIVSQIPSVKLIDGGFETGNYFVILEVVAN >gi|Vcar1000011421|ref|jgi|Volca1|45994|gw1.116.24.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSCFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Uram1000001443|ref|jgi|Umbra1|223189|fgenesh1_kg.5_#_553_#_combest_scaffold_5_102347 MHELNTDQLEELFGSDNSDFDDSFSDDQTRMDARLGSDRNRSPARMLTSEKFGRIPGLRLLRQGLSHSVQTKLLDTIISAKYFNGATNNQAMHFGDLPQQFQDIGQWAKRDTDLLESIDREPLFDQAILNLYRPGEGIRSHVDLQKFDDGILIVSLLSSCVMIMTPASEAQKHATSYSEEGNLDDGIPILLRPGDVLAMIGPARWDWAHCIPARLVDDVNGDIIQRGSRVSITLRKLNCTTADAIELGHRQ >gi|Caps1000011463|ref|jgi|Capca1|150134|estExt_Genewise1.C_2310033 MICNGGNCIYINNMTSESRGSSKPCACKGIRSCLLCERDSPSTPRSVSVTSSWSADTHDVKDRKEATKQIYVYCHRCRRAHIAPWTATPSLKDVINHAELNCSPSDSDLPLQGIHLFENFISADEEKELCDRINCTAWVVSQSGRRKQDYGPKVNFKRKKVKLASFSGLPSYSEPFIQRMLQLPQLSDFTPVELCNLEYSRERGSAIDAHFDDFWLWGERLLTINMVSDTVYTMTNEGLPRTEILIHFPRFAFIVMMGEARYEWKHAIQRTHVAQRRMATTIRELTPEFLPGGKQEDVGREILDLALTYKGRAVGSAS >gi|Fcyl1000031479|ref|jgi|Fracy1|249345|fgenesh2_pg.26_#_236 MAPPTIATFFSMAWSLMVAPECLSLATISVSQKKRVEVVEPGLVILRNFIDDEACQRIAAMAKDFGDEFYTVNKEGEKILNTGESRGRIYDAATRFPRDLIQLSNDAVSTSRAADTSMPAMQCTHVLLNLYTTSEGLVWHRDIYENDGKSDHPVVNLSIGATCVFGFKHLDTDEERTVELRSGDILLFGGPCRLIKHAVLEIKLDDAPEWMSYDPSRFSFTFRDSPEVLGREEEFKYFRVKEDLVGQDNFKVPTSSTDRKAFHGLPSYTTQQHVSMAS >gi|Ttra1000001477|ref|AMSG_01559T0 | AMSG_01559 | Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase family Oxidoreductase (253 aa) MADSNEMAPGRARREVKECVRYEVDVPAETIAASRRTVAALPSAVDRPRLAGVTDAGVPGFLYLPHYLSAEEQSAVLAAIEGDTSVDWSDAFETRLQKHWGWAFVYECGQIVGGEAAPPAPALLLEVLAERFVADGLVATPPNQVLANKYMPGNGIGYHVDRIDLFGDVIIGVNLVVPTKFTLKSVAEPEERVSVVMAPGSVYVLSGEARFGWRHGITRKARLYKRFRRAPELLPDEPWRVSLTFRDVLEAT >gi|Pbla1000001570|ref|jgi|Phybl1|23346|e_gw1.22.32.1 MDSNVIEQEIMSLLCKKAIEEVHEPGFRSRIFTIPKKTGDLRPVLNLRPLNQYIPKQSFKMESIKHVCQLIQRGDYLTSVDLQDAFLHILIVKSSRKYLQFSWKGHIYQFRVLPFGLSLSPLVFTKMVRPVLRWARRQGIRLSAYLDDLIVIARSPTLSLQHTQRVVDKLQSLGFLIKATKSHLTPSTSIEHLGFVINSKDMTLIIPRSKLRDIRREASRLLHNPTITLRQLSSFIGKAQATTLAVLPARLQTRQLISIRNQALYRGLQWTSPIHLSSMARQELQWWIDQLKAWNGHSFLPEVPQVEVYTDASETGWGIVYDNTVLSGTWTTDQQTEHINYLELMTIAFATKLPRLQGKALRIYCDNMTTIAYVNHFGGTRSAKLMNLATDMWKQCLVTGTRVRLAYIPSPLNPADPPSRSLIQQLEWSICPTFFRHLDSLWGPHHIDCFASSLNTQLPTYMTWKWDPQAFAMDALSVPWTTWPRLYLCPPWNLLLHILQKLQREKVPATLITPNWSSALWYPLLLQLSSRPPIPIPRHLVLPAPGCAGHVLLKNPHWNMIAWDINYAG >gi|Sarc1000011591|ref|SARC_11325T0 | SARC_11325 | Sphaeroforma arctica JP610 hypothetical protein (77 aa) MEQVRSQGLNTKSRVTWWYPVVCDAKHRTNEYKAATFVFDKAERAWRPLTHDAFALPGNQQLPKYFSPSTDEGALT >gi|Ccor1000001613|ref|jgi|Conco1|3535|gm1.1775_g MSYNKEWRIDYEYGYIQEPRALSSKKEANNMQVGNYDTQSYPRNRDTRSPYKFASSSSDLINYSVSSHAGQSHQQFTPITPCFPLRSPNEEISIEKNPRLEINYLLSHQRLNETLSSPNIQHSDSMASLSSREAIKDDTARRASWPTKSQAQSMQNPVRAQGLKRPLEEEPELSTSDEYPIAVIRQIQEEIERIILDTHSNRSMTACAHINTSSHSPNTCQNCNENQFFQTQDLWKFLNNEKTENYSLVVSFYTVCRIWCEARMAAELSPREMASGILERYSIVKLDRNVMDDSRCEGWDWFYNKIKFQNSCQISYDTIFKHFVHLVRTNEELLNMNWIDNIYTQLFHQTNSSLKFFEGESQIQRDRGQFFTPIEVVDFMWNIAIPSWDDHIKHQIMSARIQDLVFDPCMGLGTSILYYIDRVIKQLKSNHCKVIWNNPMSLEVIYNKLRSGIFGIELDLVPFLVCLCRIEAHLFPIYIQISRLNRDLTEINNSPVRLFWNDTLQLLPREEFIKRPTDTIYRRWSEAQINKLRNFKYSFSIVLTNPPYLLRHQFSFPDPKLYSQAILGGKGHQAYLYFIWLGLHCISHEYGQLVMITPSQWLTCEFAQRFRDWMWGRIWMHRLFLFDPFKVFRKVQTNSLIFTLSWRDKDTQQKEHQIGFFKCKDKKLSLKSMLEHYHYWNCQLNNKCGMDQVLDPNSSCSIQLKLTTQLETKLTESPANPFYSLIPHNNLSHLMHQLAANLPTLCDSNGKTVEWDSSSPLIWNRGPNTNPVYGLVVRTEWAKQNFGEMIYSKYMVPCFYWNGCSNNPLRDEGVSGEGTKEIQFWNKRDPMRLSRKESSPAESYVVHQQYFKHYSLIMVDKLTAKDIIQKNNHAQYKIANPSICVFYQFLKEVREELQPQYADREIAYSNIQKCGNNMGQKIVTPINYGYFTTTQPRQRFFLEEESTSVTNQCIYFTIKQTDLYDKPILDADYYLALLNSAIIQYYMNIHCQYDQQGRLRLFRSNMAHIPFQYPESSAYIQILQLSRLMKSLKSVVYNLQTIFNPSVTGPSFLLEPLRRGLIDLSEPNRWQIVEDCCNEISQRNNINSFEWVQSRLRFVYRMVNFVQLKIDLLMFEIYQLPFEFVEELFQELDLMTQYENYLQEVDAINIKGGYNGWVNEVEIVLAEFDVWIKSIY >gi|Spun1000001767|ref|SPPG_01847T0 | SPPG_01847 | Spizellomyces punctatus DAOM BR117 hypothetical protein (259 aa) MGSMSCSHMVGRLRTFPFLSLQATRACKRSHVTTPSPKRNLGQLHPSNPLLDLSNHTSNLPPPSSLTIIPNFLSPAQQALITKSADKKLRRLCGREYWKGHFDGVIEKYRECSVGAWSRSGGDEPEIVGIVDEIKKVVEEVVEADGGRVGKWLDPHILDLGEGGEIRGHVDNVQASGTIITGLSLLSSAVIIFRQVKDPTQSFSALVEPGSLYIQRDALRFDYTHEIPDDPALRKFRGNTIPRGRRISIMMRNEREVV >gi|Fcyl1000111862|ref|jgi|Fracy1|186334|e_gw1.6.1306.1 ; gi|Fcyl1000084765|ref|jgi|Fracy1|158880|gw1.6.1166.1 ; gi|Fcyl1000088082|ref|jgi|Fracy1|162384|gw1.6.1362.1 ; gi|Fcyl1000086782|ref|jgi|Fracy1|161015|gw1.6.1306.1 MLAKNKEKLLKINPAHYEKLKTIYIATTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAFPDTDACFGSLGSFFQFHPKHGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLKASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDTAASKWPLTEIAINELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKMAGGAKKYKRNT >gi|Lhya1000011955|ref|jgi|Lichy1|139241|e_gw1.1115.1.1 KINELKISFPDISTDLLLEILLSCNGSVKQSKHLLYESIPNKKRKLNSDNINNTIYQSTLKDIFHFKSKEIEKKINSNIITLYDKNDVEKALSPYVTFHKNFLPQELSNSILKYILYENTIQDAFTNKEFYIFDKKCKSSHLTSIFSSNEDFITGKNKLFYYAKKSNNIKKYNNDLLIAQLLVEDIVNKEILKYSNKQIYPFLDKNQFSGEVAFINKYENEYQHLDWHSDILTYIGPHCVIASLSLGVKREFKFKKKCDEKNNLIYSIPLPHNTLCIMHAGCQEIFKHCITKSNLPINSHPISGKTRINITYRSYRKDFINNIPKCKCGIDMALKICYKNIKNRGRYFWSCEGTYQNNSCYDFYWADFNDKKLITKDYNKCSIWFENDNQKTINFN >gi|Fcyl1000122023|ref|jgi|Fracy1|196495|e_gw1.28.330.1 ; gi|Fcyl1000088062|ref|jgi|Fracy1|162364|gw1.28.330.1 MVPTKQPFNAILVRLYFDGNDEIAWHTDGRTFLGTTPTIASLSFGSKANFQMRRMTNNNNNIKSSGIDYNTPQHDFIVGDGDMLVMLDETQKYWHHRVPKEKGRRPRININFRYINPGKD >gi|Fcyl1000032066|ref|jgi|Fracy1|249932|fgenesh2_pg.28_#_266 MSVPAACSSSSSSSSSSSSSDMNNNNNKNNIEEINDKSKGLHVKVFRRYQDQIQSDYWWNTILNNINWYRVKYKSGRFQKNCETPCWTTFFGGRKEYTPYQDIPDWLQPLVNQVSSDLMVPTKQPFNAILVRLYFDGNDEIAWHTDGRTFLGTTPTIASLSFGSKANFQMRRMTNVWPSVNRNVVNNNVNNNNSSSRSSSSNKNNNNIKSSGIDYNTPQHDFIVGDGDMLVMLDETQKYWHHRVPKEKGRRPRININFRYINPGKDAERGQKTYYKYMVHGDDDEKSLKSYSYKSILAMRGGIMNFISSSGKPSSSSSSSSNHNHKTGSNNYDVPTAGMVQKYKNGNDEDDVINDGMNSTDDATTTTTTTTSTSTSSSTTTQQYYYLSASENNNIDKSAFMALPDDIRKELIHECKKGKAKRKYGHQQ >gi|Sarc1000002122|ref|SARC_02073T0 | SARC_02073 | Sphaeroforma arctica JP610 hypothetical protein (178 aa) MDTRRIKVFLGYRYDYGKTQTGPRQLLYDDVDTMSKLSLVALVREVLVRVLRNDMQGYLKLPGRNFFNQAVMVCYLKFNSGLGTHQDSKSLFQRPIISIRLLADAALSFNRVGRECRRTPQSFSVPQSVGTVTLLERFAADYATHCVLKKDLKVPSLSIVFRGVQQSAVEAMRSSQV >gi|Vcar1000012130|ref|jgi|Volca1|97056|fgenesh4_pg.C_scaffold_57000068 MNISTEPDTPPPPFNVSRLNQIATSTTRAQRVDSDWMLARTVFLDLDSQYSPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSTPSTFISQYLESKSTNPRTSAVIVLPDRPTAPWTPLIRHMTVVRRFPAGARIVCHRDPSDASST >gi|Mver1000012212|ref|MVEG_12216T0 | MVEG_12216 | Mortierella verticillata NRRL 6337 hypothetical protein (214 aa) MIFFGSDSDISQIRSTYPIRKPTSDNGWGIVIGNRSWSGLWLMPQRHLHINTKELLMVFMAADLQECQGQMLNIICDNMTSIAYINHFGGTHSPELMHWATKLWDRCLKTGTRLKMTYISSSFNPANAPSHQMIAQLEWSIDPSFFLWMDKKWGPHKVDLFASEQNHQTTCFMTWKPCKMAMAWDALQQPWMTLGRVYCCPPWNLIPAVLQKV >gi|Wseb1000002244|ref|jgi|Walse1|68555|estExt_Genemark1.C_70130 MRGGDFFYMNEFLKQNEANELYNQALELEFYRPTLKIYGKDVIQSRQVAVYAIEEKRAHMKYSNHDAKVNHPFPQLVNQIAGRLKEVTGVDFTHCMLNYYQDGSVYIGKHNDNFNNQVIATVSLGAERTIHLSPQTTKAALKVYPETDVPGREKSTLKLTNGSLFVMQGSTQRYWKHEIKKEPKVKTGRISLTYRQIVD >gi|Psoj1000002243|ref|128295 MTAGLAVLVALSVLLFVGAIVMLLFFCRRIQVQRERESLVFIQEPADVYREELNDSFMEQALWRCGVCKFLNHPERKLCDLCQTLKGAERDVAGNKKHTSSGSRGGSFSRGSLSSQRMGRIGETEAFSSSQADDAIGGSFGRPSFEFSRKQKPQNKLSKLQLAASRRQQWKRMPTTDGLHRWVRQEDKSARGYNARNTQRDSIESDPGGRMSLNRYLDATERDSNLSVGYIRVRDSLGRLVLNESDVVATDFHYRINILDGTGSSELIMNLEGVHNLPFPDKIRWFSTEIHRLWLPWESGHAELVVRRDHLLQDSFELVAAMKPYQLRQRWRVVFDGEPALDAGGVMREWFTLLFAELFDPAFGLFVSTVGDERSYWINASSDLLIGEEDHLAYFEFAGRLVGKAILEEHLMPVHLALPFLKHVLGVPISFSDLQFLDDEIYNSALMVKKIDDIEPLCLDFTATRIVDGKPEIVELVEGGANIDVTRENRARYLDALFKYHVLGSVSEQLLSFLTALYDVVPEGLLKLFDYQELELLMCGVPSIDVEDWKKHTDFKFFTHNFPTELELNNIEWFWEVVEDMKNEDRVRLLQFATGTSRVPAQGFKGLISSDGRVRRFNVAFAGANQSFLFPKAHTCFNRLDLPIYNSKEVLSEYVKLIVQMDITGFTIDRMREDKQEQECHRDENGFADFKYPKGWCQPTKPPAWVPPVAKQAKRPQSSDGGPSKRAKLSPPPEQEFSLATVVANVKKAAIEHSHLPAEVASRLFEEDATISYLTTDHKSWVYHVPQWYKHVFEHVTPEELWEAMADDDDTESQVTWATLFEQAWEAHPKQHDTIMMFGKPAKLPRFQQLCGEMGSYRYSGKTFEAQKKYPRGLEHAVLHMQRMVEDPATHRTRLTGGLVNWYENGDHYIGPHADDERDMMACSPIVALSLGATRHFVFTKKTSKSAPQGDEAVARLELQIGDGDLMIMGGTTQRTHKHAVPKMARCREPRISITLRCFH >gi|Lhya1000002295|ref|jgi|Lichy1|204403|estExt_Genemark1.C_180068 MQPPSHLTSRRQRKIWEQQQRDNAVKRQKKDIYENQTPFRHAERHFKLTRQLDFSNVVDFDLPPDAKDKRIVSVSLHNALPESLFGKATDTAYILKGVNGLIYIPNPFSPDKQRHYVKQCLSTYALPPNKSNLDPHYKIPAQGLWHLYTKEQKEEEEEEEEAKIVSINNHEQQVLPSDIMHKLRWITLGYQYDWTTKKYNDEAYLIADDLSELTKAIVAAIEGIGQQNEWKNTYSSDKFKAEAGVINYYQLKDTLMGHVDQSERNMDAPLISLSFGHACIYLLGGTTRDTEPIPIHLKSGDLLIMTGQCRKAYHGVPRIIENSLPEYLSPCIKDDDWKLYGSYMKTSRINLNIRQVE >gi|Mver1000002291|ref|MVEG_02285T0 | MVEG_02285 | Mortierella verticillata NRRL 6337 hypothetical protein (480 aa) MAQTSLPHNGTAEDWDFKMATLFSIFESTPEHILHQALTNAHGDLEQAIPIVLSGQQSASNTTANSHHPSRPKKKQRLVQPRLAAFLSSPSSSSSQSSSPSSCSLPTTTLAPSLSSSSNLPSLNDRLRWKDSIDDSTRPRERIKPLVLYNPEDVAKHCPCTLIHNVLDRDLASRLLQVMLVESETWNRNRWWLFERIVESPHKTSYYTEDSHDLAEVSGWTYNGKKQDPPRKFLKEMDEAKLVVRKIVNELRSARELHPYEEQGEWKCNVAAVNHYAHSKESVGWHADKLTYLGPRPTIGSLTLGATRFFRIRKVVPDVKNPDTAGQMISIALPHNSLCIMWPPMQEEWKHEIPAQATVTPHPIRHVDQRKRYGTARINITYRLSRPGFAPKDTPVSGSQEGKTCGEFHWVDMETKLGLVKLEEEGKDPSQDTIQDASSNSLKRRPSDILKDDDDPLDLASELLDDVSPIQAPEEGEAE >gi|Vcar1000012306|ref|jgi|Volca1|46001|gw1.61.90.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Sarc1000002310|ref|SARC_02258T0 | SARC_02258 | Sphaeroforma arctica JP610 hypothetical protein (167 aa) MSYPALFTKHITEAFDTVDIDLFADAKTAQAPIYCSLEPNAPLHDAFEQSWKQDRKYLYGNIPFHSKTTWRESLLKLERKNELITMIFPVIPGQRNYKQILLQLNSHVEIIRTNEHTFLQNDTNVTGPLSKWRYICLARVGNTKYSILPTDDLMSAEADVVAHPHA >gi|Vcar1000012324|ref|jgi|Volca1|91734|fgenesh4_pg.C_scaffold_22000129 MASLFLKRNILLKKNTGRDAANRIRDDPTPPRPRDHRRNDPMDTSAALLRHQRQDAGRCLQCGSEFHDKLSCPDLIHLTAPTLAAMTSTAAPTQTAAARPVPVNVAFKPPPAEQIASKPARRVGCMRARVAATSTTEVHRVDSDWMLSCTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVRMVPRTSTPSTFISQYLESKTTNPRTSAIIILPDRPTAPWAPLIRHMTVVRRFPAGARIVCRRDPSDASRYVNDANKVVCRLPAYNEEV >gi|Lgig1000002422|ref|jgi|Lotgi1|103779|e_gw1.2.1002.1 MDLFVVKTKRKRADSQSESSGSKKFKSYEKSECNVIDFKKFRLENLSCDYGILYSKREADNLLKECEKILCYNEGKLAKICLFGKWHNIPRKQVAHGDEGLSYKFSGNSIPARPWLPLLENIRDDITRQTGYKFNFVLINRYKDGSDYMGEHKDDEKDLEADHPIASLSLGQARDFIFKHQDSRGKNSTRRDIPPLKLVLQHGGLLMMNYPTNSYWYHSLPCRKKLFNVRINMTFRKMVVK >gi|Uram1000002459|ref|jgi|Umbra1|228021|fgenesh1_kg.10_#_156_#_combest_scaffold_10_5503 METLQPPAHLKSRRQKEMWKRQLLQNVKERQKAQESQTPFRIAERSLQSKVATPDRPPVVDFTNLENNPPEVRAKLIRVELSHDLREICPLFGRTDDDWSKRRSIAYIHSDIEGLIFIPNPFTESAQRQLIYNCLQKYTQAPNSSSLDAHYEVPAEGVWNLYTKSRRCASNEDDSQLLISRKSRSNNGQKIPESADPYDAPSTSGQQSTSIEKDPDIADIKSEPSLHKPSELLHPSELIRKLRWVSLGYQYNWSEKTYFFDRPIPVPEEAAKLSIAIAKAVEGIGYRQDDGFRWQNNYKGDNYAPEAGIVNYYQLKDTLMAHVDRSELNMEAPLISMSLGHKCIYLIGEQTRDIKPHAILLQSGDVMAMTGASRSAFHGVPKILDDGPSFLQPGTIDENIPDWDVYGEYVSKARINLNIRQVNVDSSST >gi|Sarc1000002473|ref|SARC_02416T0 | SARC_02416 | Sphaeroforma arctica JP610 hypothetical protein (686 aa) MPELQLAGYQFYKEPCVSARAYRHVAPSIELVRAQKVQELTEALHAAVDSQYGGPSTQGRKRVHESSHYNDGSGSAVPILALERWMFNAKDLERDALRDISGKTGDEGGEKNQDYEKSVGRTEETDALLPDIINVAEKVEPVLVRDLVRGSHTQESAAEIATHVAELSHEFAVGINRLCKLGLSADLEPSSSMAPSCPQCGKIIEVRVRVTMHKHTIDVNTVRIRQRPDMKQKGTQVKYVQGVKEEGLEGGDGTNVSVGPPVCAVCVADGRTAPTRLLKLNHEHYAKLRALWDSAQAAESVADHAGNADGDGMGNSKANKKRNKNKKRNENRRKKKKAKGERDGNNNGTGAEHEEKTPTDTIKDIKNIEGKQGKKHSDTKPSSKTSAERSAEKDTSTNEAKKGDTAITGAIKRSNSSTVKRVTSDKHGKPIELDDKQHKNTEELFHNDLYSMLLRYNALSGMGFQAACSEHVFAALKSLFRTDFECFSSPLNTHHPLYCSAYIDTDFHFGSRGNFFDFYPASGSFQANPPFVTGVMERMAKHIETLLQKANTNSQPLSFIVVVPGWVNEVSYQTMLASPFMEGDGPLLISKDDHGFCDGAQHQRQDRYRNSPFDTAIFFLRTDAARNVYGERTFESDAILRKAFAEALPSDAAVARRARDARGFADLDRMGNGSSKKRKGFFKRY >gi|Ttra1000002481|ref|AMSG_02643T0 | AMSG_02643 | Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (292 aa) MASSPSKPSNVSAAAASPDLARAEAAIAAARGFSVHVLHDTADTFAYVAVGRGLLPLELADAAWAEASVPAHEAAQIVDGSLATSSGPMAWSQDEVTVGGRVVREPRLTAFLASAADVVYTYSGKANLGRAFPPAVSAVRAWLADALERGDDEYWNACLANCYIDGSQAVGWHSDAEDDLVPGSPIVTVSLGATRGLHFRSRAASDLFVAQNDARHARKHRPLTAAEAEVLATPTPTDLVLDLAHGDVLIMGGMTQSLFRHRLVRDADCHLPRIVLTFRAVVPSALAPPPL >gi|Crev1000002507|ref|jgi|Coere1|86268|fgenesh1_pg.10_#_86 MSGPDRCHGADGPLSPLAAAFDYDSSGSLSSLDSFLDALSDCADDPLRPDSSLLPERQFNRSASATDAKSSQTVPLKQQEHKKEKQNLPSRNKRERTADVSSPKPRQTVNLQRQSMQTLVSSNNMITTGAFLTRTTLRKLGVADIVADPEASNAGLAVGSRIKVLNLDKHWYTAVVLAIDSGKALAHYPGWEHCYNEWVLIESRRLLYRGKADLGVSGSTKAMEQLEALYSGVEEPVLIGFDLAQAINDAFGISTGCNLKNGGADEAAANVFVNSVDNIDSGTLARKQSGVKESKEHRSRGRPAGTRNRRRAGHIKAKSSRKQRPTIQNDKAKKTLVGDQEESAAAECPSTPGVPAELRVPSVRLVRAAENPYARCHRSEDIFCGDSDENEATARESCLTAVRDVHNGGNEEPSGKKARIADDNDTATASSGNAIWHLTRGDYVTTGAFSTRRTIKALAHSGSTGGIMQDHHGYYPGQRVEIMNANQSWYQGRVIAYANKKFLIHYGGWDHANNEWIVAGSRRMRPASDINDMAVTETEEMARKACVVLVDEYNTYIDGVERKNAEKADAKRKARELRKTPVNVRVAKLAQSMSADSMGDDEAEEMTHACLPENEEEDDEDVDPISVEAGYTPVPQLLRVKDYVQLFRKGMQIAARDRNKLWWRATIAEIKTFRLRIHYTGFGSSWDEWVEMNTQRIMFEESAESSRCDAEMAGDPLHVSGENSSALSKEPNCMGQVISSSSHGKDAGQTDYGTVEESQETKGIPVPRRLGRPPGPETKSTPLSLRLALKALMSDREMFEQCHPEELDVFHLPKEHMSMRDYSTFLKVGDRVRIRDRDKQWYDCTIIDLRHGRIRICFNGHSDEFNQWIPVNSDRIRILRETIDGDKRLEKMEKESQIAQRRKQEKLRAQRRKRSQASIASLVRLAESLEYIVDCEDTFVSGHTQVGPQDGITELDAATEVQSRLEGSTEDDASGGDDKPLLQLMMESDIDNVDGMPLLTRILLAEHFKRQRFGALIRHGSMVAMQDSATWFVYCNQCNIVISTFRYYCLSCERPSDGYDYESYDLCLMCFSRQFPSDHPHSQASFARAAVGDAESIVKFTADALSRCRDHERLAAASAHMLDLFSGLIAVYEPDAFDTSYKPRTPGTSLWSKLAVGLHGTTTSTLDTSAVVGKIIGNTRRSRITSIINPDSEMLSSCNGHDADASSDKEDKDETDRQLCKADVDDLPPRCAFCSEDDQSQRDLLGTFAAEQPFVLSMVRDDGTVRRRRFWAHTACAKYSPEVLVTEAGQWFNVAAALRRARTIKCAECKRRGATIGCFHDRCQKSFHVACAGMSKSFFESGRIFWCPKHARMAAGVVEGNAGPEPVSLEARCANCNHELSGDLMWMECLECLAEPERQFSLCLTCYDSKDALADHPHKKRCFREHLSHTGGVSSNGQYLADIAAQDSRRRVGKGTTCCHYCRSRQSRRWRKGYAGVVMCEACFNTAHSLRGGAQAKQVQAGTVCDQDLFAEADNDSPGELEVVALNPFGRSLITGSDAQPLPPPQQQQQGALIEDYTQGIYFTREACIAPNRVGLPSVSQQPLGELSSYGPTDSMLFTLPVNTSYFDIPGRAPRWASHSGTDYHGTWLPQTVRRALLRYTQRGEHVLSNFLGRGTDAIECFLLNRKCVGVDINPSAVSLSQRNCSFTITPGCGMSIEFRPTIMQGDARDLRSDLWPGASYFAESESFDHILSHPPYKDCVLYSTNIDGDLSRFPGPDEFQREMEKVVTESWRLLKMGRHLTLGIGDNRAECFYIPVSYQLIRTYISSGFELEELVVKRQRYCQAFGLGTYLCVQFDFLMFTHEFIATLRKVPKDQIDSMHLADRHYAEDSEFGLQTVTVDKDPLDFRLVAISHRCLREVPASPIERKGVVMGSVWTFEHHPVHSFTHMCMSRMVERFGRDGSNWEQIDLALRPLEQGTTENAADGTNAASDIAAASDTQCTGDVIDDKCASLNNARNQAESDPELLDSDTEEGGYERARQRQIQQNREQLLQLGLVSELGEDSTDIAHYQKMIAMTPLPPTSSAPLALIVVPHILNTEFARCHVEPYRRTLVQITHDASHRLCPSGLLVLGVQDVRDEHGKLWPLGMLVLEDVQRAVGSIRLRLKEFIVVVENGYARKRDDVMSRETFVDEQCVVEVNTPDIHVPIVHAYYLVFMKLK >gi|Fcyl1000122509|ref|jgi|Fracy1|196981|e_gw1.31.61.1 ; gi|Fcyl1000122713|ref|jgi|Fracy1|197185|e_gw1.31.263.1 ; gi|Fcyl1000088412|ref|jgi|Fracy1|162734|gw1.31.263.1 ; gi|Fcyl1000066918|ref|jgi|Fracy1|139673|gw1.31.61.1 LHTKLQNYFTPTSILENIAVLVTPKVDPSASLSSLALIRLSKQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKSLETAVEGVNAASVISRLMSSDYLLSSNNDNNNGKIWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHQQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDVDNKHNNNLFTVASFVKQVNFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGDGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRVSLVFKKTLGYSKERTKSRTKV >gi|Wseb1000002511|ref|jgi|Walse1|64082|gm1.2421_g MLKRTLVRLNKIERINLSNASNNRLFNFSNLKSDALSELDKEDFVIYPNYLNIEEQKVLLKQLLKKLDRVCGKPRRNTNLQRQHEEQYEEGNLQRAFCHKDMYRWQTSHFDNVITGYREANVRSMTVPNVVSEEGILGILKRLYGCLYDNSTELTKLQANDMKDERLEDDDLSVPKWIQSHILHLSPDGTIQAHVDNQEAMGSTIMGLSLGEERLVEFNNESKGSFLVRLPSGSVYIQKSKLRYEYKHSILQGNCRDQRLSLMLRDQPSPK >gi|Fcyl1000032662|ref|jgi|Fracy1|250528|fgenesh2_pg.31_#_127 MGTVLYNVNHYLATITRDGERTMFTQSDDGDRSTTNHDKKFDTHNDKSKNEDKNGGKRNVLFDMTEKFGLEIRKKFIGNDQSPTKNSTIIDRKTTTEEQESKSEETYQDMGRMLMKLMLPSNSNDSKVESLNDVIEEVKGMSGRGDIQDNNTIVEVFNVAKRCHNMLDSQLNEFFGEKGSPPLYLTNLIYYIEREDEIKNPTWKRRKHYFFPGIDIAQMDDLNEKLKLTDLAYEDTIDEIRDRLDIEYNSELVYCSLESLPNKPAHFIAVKRDQSPRSKELEVLLVVCGTKRITDIITDLICDATVYREGFAHYGIRDSGQWIANEHSDLFEKLRVLANKKKIKLTLLGHSLGAGAASIAGIELNDNPFIDVKVVGFGCPAMMSSELSESYEDIITTVIGDNDCIPRMSMATMVNALLDITELDYTPFALRDFEETVDEMQRFLPSYVDDILENIAVLVTPKVDPSASLSSLALIRLSKQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKSLETAVEGVNAASVISRLMSSDYLLSSNNDNNNGKIWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHQQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDVDNKHNNNLFTVASFVKQVNFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGDGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRLEP >gi|Mcir1000002688|ref|jgi|Mucci2|106998|Genemark1.2761_g MPPTTSTNITFKSRRQQKIWERQRKETEAKKKTSTSYVNQAPFRYAERNFKSRVPPPDFSQVVDFEKMQDHSDIIVPVQLTDDLRRLSSVFGQCEAPCRDAYVLKNVPGLIIIPNAFTPAAQRSLIKQCLSVYPKPPNTSNLDTHYIIPDTGIWPLYEAQEKGTLKPTDPEYHVPKKVVVDGSSSTYSDDEDKEEEKEPPRMAPTACSDDFQPVIKDPKPDPLPAPGVPLLSPSEMVRKMRWITLGYQYHWPTKTYHLDRRYPFPADVADLTKAVVTAVENIGHGDWINQYKGEDFNAEAGVINYYQYRDTLMGHVDRSEMNMEAPLVSLSLGQSCIYLIGGLTRDTVPVPLLLRSGDIVVMTGPCRKAFHGVPLIMENTLPDYLSNNDQYEDAPDWKLFGDFMSTSRINLNIRQVYPRHQSEETVQ >gi|Mver1000002798|ref|MVEG_02791T0 | MVEG_02791 | Mortierella verticillata NRRL 6337 hypothetical protein (245 aa) MTRSIGDGRVPGAPDSIYYLPDFISAEEEQALISKVLTAPKPKWVYLKKRRLQNWGGIVMNNGMIAESLPTWLTNLHPRFQESGVFDGLHPTLNEPNHCLVNEYLAGQGILPHKDGPAYLPTVATISLSSHCILEFYKCPTGSDEPGMDKLSNSRSQEPEFSILVQPRSLLVLKSDVYKSYMHGIREITVDTLAESNILNLVEAMPGMDLSEARAKQLDRGTRISLTFRIVEKTKSGRKFLLGR >gi|Sarc1000012860|ref|SARC_12577T0 | SARC_12577 | Sphaeroforma arctica JP610 hypothetical protein (80 aa) MHHFIAQFNVTITHVPGELNKLPDMWSRVHQGTADINYPSVFYTSCWKLYPDFFDQVQKNLGPNDVDAFASSHNTQLDNT >gi|Wseb1000002876|ref|jgi|Walse1|60458|estExt_fgenesh1_kg.C_100073 MPLSKQRFARPPTPPETDTAIRRSERFYKRKDIPLDLSYAFDWQRDEKDAIKIAEKCYTFEKHPGLIFLPEYLNEEEQKGLIRQSVKDIPTPPNRTSLDAHYYMPKEGLWYHYANQTKDDIALPRATKEEKREPPSYYAPSGTRPTINNEPSTFEILKQISRSNNPEIPPSTTVKPLNGERAMNKLRWTNIGHYYHWGLKQYDFSVRDPQTGGPIAVPAPVSDVCKSVVSSIPWERTSVADQASEWKKSYRPDAGIINYYNLNDTLMAHVDRSEVTATLPLVSISLGHSAILLIGDDIRESINPPTAIVLRSGDVIVMSGPTRRSYHGVPRILERTLPEHLKSQEDDEEWEPYARYLSKTRINVNVRQSGLTDEQITELVSV >gi|Chet1000002884|ref|jgi|CocheC5_1|114719|estExt_Genewise1Plus.C_320154 MDAFVTRKRKRDERVVKPAVATTRVEPGEEEGKEEECTDFKLAVLASLHAGVEETALLEALLAADGSVEQALEYLAQSSRSPMRKRPAPATVGYQSSLTSYRIAPPNGALVNKSVVKKGKTLFLYSPEDIETHTPCSIIHNFLPAKQADALLLELLEESSTYQRFEFKLFDRVVQSPHTYAFYVNSLEEVERQKTEYVYDGRQMNNVRQSPPSLLAALPAVQTAVNTEIQRRIRTFYPNAQKLAHQSPQPWHPNTAFVNCYDGPHQSVGYHADQLTYLGPRPVIGSLSLGVAREFRVRRIVAQDDDARADGSRESTADAQGQISIHLPHNSLLVMHAEMQEEWKHSIAPAHAIDPHPLAGNKRINVTYRWYRESLHPKFTPRCRCGVPTVLRCAMRKKESRGRYMWMCHVGFVPGKEGCGFFVWAEFDADGEPPWAAGKVKKDEEQRSVGSV >gi|Sarc1000012919|ref|SARC_12635T0 | SARC_12635 | Sphaeroforma arctica JP610 hypothetical protein (292 aa) MASNWAKLQQQIVCNPKKKQKSTGVVQGKTIGNINNGSKKTAPFHKKTQSGIIRDTAASTPAINPDISSGKRKLKVDGNECSKGNKRKKIHQQTSFPTDVPTTQVNNQQNNTIQNGRGDATQITSSTRHIDKQLETSQKKSKKKIKHKQAQIQSRKNDKSRSDCTHIEVSSNSQLPISATKAPTPFTIQANTCHACRKFTTPQDCVCTKGSKKESVCVNAFKQCMAHSYRGFSRTSAEKIESSLHDSILDTLDDMVKKNYFHYDIVSAGKAVCRQYVFEFILYTKLHVNEG >gi|Sarc1000012918|ref|SARC_12634T0 | SARC_12634 | Sphaeroforma arctica JP610 hypothetical protein (212 aa) MTYHYQRLRIFALPWDDDGTGVYQADTAKCESGRPVSNLLALKQLNSTLTTNATELLLQYHKEGGDKHNQQIAQKGVVPRGSCKYSVSLINYMFCEKDSKVPLKNEAVYGMGPTSVSWHADSSLQNYSTIAVYQITGNGPIAGSQNQTKRSSKTSDTTKGSEDDWSVAMRVVADDVTPAVACRLKSGETYYMMDDFNHHHHHAGMQHSNGT >gi|Vcar1000012920|ref|jgi|Volca1|99783|fgenesh4_pg.C_scaffold_95000030 MKLGELRFQVSTVGSGPNSLQCYIAEFKRLMADLPYRHEKDHVLFFAQGLQDDLRKEIFSRLRNPGSYIRLQDAAPTLAAMTSTAAPTPPAAVRLDPVNVAFKPPPAEQVSHSLPIASKPARRVGCMRARVAAASTTEAHRVDSDWMLSRTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAHVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIVLPDRPTAPWAPLIRHMTVVRRFPTGARIVCRRDPSDASSHATSSLSTPRGRGAFAMDVRQGAQRAGDGGGRVPANCMRYEPTHQVPIEVRPTLAGQGVRLRHPCDLRTEAYKAPPPRLLLNVTKGLSIKRMVTVWFLFICTP >gi|Pbla1000012921|ref|jgi|Phybl1|77051|estExt_fgeneshPB_pg.C_50358 MTESTPTSSYPISKRQQKILERQKRQYAERKEKADTYVNQTPFRYVERNFKSRVPPPDLSHVVDFNNLDNNLKRINDEIVELHITNDLRSLSSLFGEHDTEWENRAHKAYGLKSSPGFIFIPNPFTPRAQRHLVKQCLSEYTLSPSTSNLDTHYVTPPNGFWNLYEREHLQDLKEGDADYFVAKKAGFTGSEQRYDSSDSEDENSNNKSNSISNFNNTFEKETVPTPTACSREFEPVQWQPKPDPPPAPSVPLLSPRELFRKMRWTTIGCHYHWPTKTYHLDRRFPVPEDVRDLTQAVAHAVERVGYEGDKSRSWKNEYLGADFKAEAGVVNYYQYKDTLMGHVDRSELNMDAPLVSLSLGHTCIYLLGGPTQDTVPVSIYLRSGDIVVMTKHCRQYFHGVPKIIEDTLPAYLSPQTAFTDTPDWEPFGTYMQTSRINLNVRQVFPKKNE >gi|Vcar1000012957|ref|jgi|Volca1|46013|gw1.105.40.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFAQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Sarc1000013043|ref|SARC_12758T0 | SARC_12758 | Sphaeroforma arctica JP610 hypothetical protein (243 aa) MVPYNSPTVEGGVFALDTWFGRSWYMNAPFSKLDAGVQRVKADRATATLVVPYYPETAWFKQMVPSLADTPIVIPPAHGVFLRYGREPMPPPPFVTLICHLSPRCEKFTSAEGFWAAVDAHRPVTSALVRALANRPDDLESCIVQDRVAPAVAGVVEKPVRCGHDPSLGGFALKPIEHVVGSCTFMTTTVKPDESVLEESVRRAEREYDVHRARGVAVMTRAQRLRASAAGEELTVEESVSLL >gi|Vcar1000003043|ref|jgi|Volca1|92007|fgenesh4_pg.C_scaffold_23000205 MPFCHDTIPPRRHSGAGHHEEDVVFLVVIRQVSHKVFILHNVTLERVLPGTLRQSKHGKLCIQFVLNDRGQWGEGLEHGMVRCEVCSRVRMKTLQTVAHCAVKNRKGDFCCLRALHGLINVAREPPPAEQVSHSVPVCSAVQRSAEPDTPLPPPFNVSRLTQIATASTTGVHWVDSDWMLSCTIFLDLDSQYGPFTIDACCDDFGINVPFFSPPHSFLSAQVDGECVWMVLPTLNPSAFISQYLESKTTNPCTSAIIVLPDRPTAPWALLIHHMAIVRRFPAGVQIFCRRDPSDASSHPPL >gi|Vcar1000003045|ref|jgi|Volca1|92008|fgenesh4_pg.C_scaffold_23000206 MYSCKENLWLTAYSYQLAGEFCNPRRIFTGKKKYDVVVFPKIVSIADFSDLYGILVTFLYDGGAVILGPTTYDINITTVFTQSMASEVSFYGDVGTAVTIDCGYAAVPVYNSSNRSAQWNASSAANTTTTTTTNTTGSSSAAILPSSSWPSLLSAGLASDEPTSYSANCGPLGGSLGGGVMYSDSSPMKATTVFSLPRGSGRFYYVGLDFDNDPRDGSWGQLLSAIAVQAASNPLSGNGSPPSPSPPPPPPATAPEEPGLYDINVNTSSSTTQWPLGRGRLSYTAAGLTATSALTAPPSPAASRTLYLCHDLPVVNGTGAAANGSSNSSGGGGAWLVIDLGTRRRVASVRIQALEAGYDNRSVIRGVPVRVGAVPPNDVTDPAAVNDPLCPALAHTIWVGAAVASGSAAVQVSCGANNSLVGRYVSLQLQDDIPGASWARMCGVDVMGVLPASRSLVSRNKPICVNAGASSFPPDMDPSLAVDGLYNDNRNGGGLCAMSALRQDPFFTVDLGAVMTIERVEVTKSWRNEYDKQLSNFSIVVGLTSCISVDNPLTNLTSSQAVVCASGLTLPPGTTGVYNCSGISGRYVTIATQGYNGYMRLCELDVYAAVPSYGAYRVSVHKRVTVAVDYLDSLLPGLAAATNKSVAGMLVDGVTPTRLRSPAAGGDPSAPLGRTCLVAATNYPSAASWGSNSPWLRVELGGRFLVQHVQLHFAAHFDDLLSGATTTTATTTATTSTSAATTVTPANYKYTLDVRLGDSAPPVMYTSGTENPACATGAGFFTGLESFTDLYGSGENVRRYGCGYYGSYVVVTVLRTTADGNTTLALPPLCEVEVFVVAEDGNGGAAAVGYYLSSLGHYAAASNGNGSWAAVNATRALRNPNYLEAAVREEDRCVVAAAAPGGYAAWLVDLGSPLEISMVEILSSEYSTATVNLTILNDTAAAAAGGGGGTLLLSPGSGTLLTGAANLSLPLSTVTHVLPAAAATASIAVGNASANGGNGSGTAGGGGSAPVGRLLMVTNTVPGAELRLCQVWVYGTVDGGGGGARPGIAVRKLSGRVAANFGSLPLDTGATGTTSNTSSAIRALLLHRRLQATTDTNDHNRLLTGPSDVIGASSSTEDISTSRTAVGTAAAAVDATDAVLGGSSGGAAALALRGKDISRRRSLLQTNSSSPSSSSSTSTTMTGAASLVLDLGLPRAVEVAVLVMESSLSYSGSTLELSGVVISVSNTSSGTPGSYCVYDVTIPLSQSRHVYGCGGAYGQYIVASRTVSYALPNGTSLEVYLTDPGTELLVSAHRTTSQAGGVTTGAGGSGNAVNGYYSQAALDAAAAAAAAAAAGAGVVSGLFFGSATAVSPSPWWAVDLGSALPLAFLEVTAHPAPTDGLGLLDFEIALTNYSVVTGTEGIAVLSGLSLPAGGTGRYPLGGAAARQLVLRQPGAVRSLQLAQVDVIADRATVTAGANALIGPKPLSYPLVYDMARTSGSYIDPAVSVGNATSSGELAPLAVDGYDSTLAASCARAVPATSSQLPWLLVDLGSSIRVDRVDFLKAADPSLAAELEGVDVLLGNNTSTSTPSSVVFPPPALSTSFHPVALANPVVLSKLSLPQPGAWSTFLLSPPASGRYLVVRGPQRGATMTLCELQAYGESAFEMELVPAPRPPQPPPMPFPSPQPPSPPPPPSPPPSSKSSSRSREWPVSKVSTNPTAVTAVTASIVGVSIGTTVTATMVSSAAATAAAAAAAAGGGAVAASSVPSAAAAGTAGTGAALAFVGHMQFFALTANAAANTSAGYQTTNGQLSWLNLQFNTLRSLDHLPEEEQRALSQALNIAIAYVGALVLHFLVVMLFSALRWVIALNRRGAAAAAKGATADGKSGGAAAASAAVAPILPSFLVFPFVEVFVIIFFITSAATAAGTLLSYGITAHRAVSIAVGIVILVLLVIFTAAVLWLLWRLYAAADRLGLSYVWTRKPPPADGSGGRLAHWLRLAERGYWERPEGVDVWLMEPHRTQYYLRQGFSLPQALRAADPNHHIDHTAAVLEGQEGQEGQKGQKGQEGEKHGAVAETVMAAAAAPGIQRANRSSSGSGGVCGSSTTSQVQPGGDVREGRKEKDGPLGGGGVATTAEAVAGVQPTSQNRNTKPTASRGKAWASFTTASDAAGGGGGTAVTGSRPGSAARTSAAAAPVAESAADNTIRAAAGGSGTAAVGDGGSARRLLPVAPLPSPPAAHNTSASAAAPRRTQALIVFEEPSDSDSDCILDPLTGGGLHTAPPPPPPPPPPPAPPLPPPPAPRPVLLPPLDPHNRDSLIKAIGLPREPAASNHLGSTTAGGGGCNGGARVSGGGGGGVRQDGEFVEEHTAWAAVICNLVLPAVLLGVQVGAQMEPHSSAARGTTVALLVCKALIAIYMALILPYNNIVVMSVELLCAWLETAVCACLVGLQWTHGSEPGISDAMLGCEITVFGLQYLGSKTTNLRTSAIIVLPDRPTAPWAPLIRHMTVVCRFPAGARIVCHRDPSNASRFSESNERKPRFSVYLPRAYSAVRWWYGGAVLVALVAMFAGIVMMPAKRRVGDQLVELQMVHADLHAQLTAIEQSDDYQKAASLLASDPLARLNKKKFSTAMGHAWDTYVSLKNQMASVSANIAQLEMTMEQPAMKVLVSDKSGRVMERPFKDLDYVNDSWVNKSGKEHKALKRENRELNQIYHAFRMRALRLGASPHSLYSFKSLPFACLAMAPAPSTYPPSSATNTAPTLAAMTSTAAPTQTAAARPVPVNVAFKPPPAEQVSHSLPIASKPARRVGCMRARVAATSTTEVHRVDSDWMLSCTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGEPPYSPLGSPHTPHDRRAPLSCWRADRLPPRPLRRLQHSSSTTVFANLHDHLSTKTHPLHRFLYPLSSAA >gi|Smin1000013068|ref|symbB.v1.2.011515.t1|scaffold777.1|size163462|6 MPLRKKKKHLEGEEGGTRRGPPTATKSGFLSCRVANDEWQTTFGAWKAISSYFASYKKKSVWMPFYYDGLCADHLRSLGFQNVIHRQEDFFERVLDKDFLSRIDLIWDNPPYTAPEMKEKVLRALAECGKPFVMLLPISVLHVGFVRDIVDMQRVQAIIPRRVHVRKTGEEILPFKYLCWFCYRTELPRDLLFVNDVEQVSNPSPSEMHSAAPARKKIPKKAMKKSPRVKKKGSPFHRLPFPRVFCLVHEG >gi|Mcir1000003087|ref|jgi|Mucci2|107410|Genemark1.3173_g MNSSINYLELLTIWKHFGGTKLTCLMDLAAQIWRHCFATDTRLLLTYIPSKFNPVDPLYRNQLRQLEWSLSTATFNMLNSIWGPMQVDCFASQTNHKLPKYISWTWDLQAIGTDAMLIDWRKLRCLYLCPQWNLILPVVNKRNMVSNSLQAGNSSSHAYKSYGYHLRKRRRVVPSGGQQQLDFPGLEDPRIAERPRPSGVSSAVEALLAQAPAPHQLAQDSSSLYQMV >gi|Uram1000003236|ref|jgi|Umbra1|231739|fgenesh1_kg.14_#_327_#_combest_scaffold_14_20870 MNPTLEPYKLPGIPDSAYYIPNFISAAEEEYLISKVQTAPAPKWVSLKARRLQNWGGTPKVEDGKMLQEPIPTWLKEPIFKKLQSIGVIFGDSPSTEPNHVLVNEYLPGQGIMPHQDGPLYNPIVATVTLNSHSILNFYPHGVEKATSEPEFSVLLEPRSLFVQTGQLYQTYLHGIAEVTEDDLAERPPINYHTIPNRTLLPRQTRISLTYRHVKRAIKNPFAKTLFKH >gi|Spun1000003257|ref|SPPG_03428T0 | SPPG_03428 | Spizellomyces punctatus DAOM BR117 hypothetical protein (341 aa) MWCDRCGRISRRVWVYKWVCEGCQHTIEIPIPIYTRTTIRHLPPSRTEQYGCRSFKPSSNIRGYLERIPNHVPRFGGWERMVYVFPEGGHVHHFLAGNREDVDGVYEGLQRGRLPFRRMGVKSRLNGLLTSNFVLNSGEAYAQASNLPARSMTENTRIELEAIQWLQDAVYEFGTVYNQSHKNLSLDGDEKEMHWHDDGEHTVHGPVSSLSLGSDALMLFRTKPTPTQKPKTVLSLAVRHGDVVLMVGKAVQRYYEHCVRVRGHRVSVTGRMVLNEGMRVEEGVWRVLEGVERGVGCARRRKGRLVFGKEVEEEKEVEVEKKGEEEEWDYPPAPPSSDIE >gi|Sarc1000013250|ref|SARC_12965T0 | SARC_12965 | Sphaeroforma arctica JP610 hypothetical protein (182 aa) MFAARQEFEWIRMFWLQGESHAQSHERYWLGCIQTLTHLWELSEAMLSHIVTVLGADVASEAGGARVCVSKDVRTYAMTIYILREIRFRRREYAKRCKSQAYHFLEARNKPVENKLKYNIPLDRAGLEAYKHLDLPQGVSLFSHTQACRKGARTCAFTLPKDLSPTIEAIEEQKKAIFGKV >gi|Vcar1000003269|ref|jgi|Volca1|92356|fgenesh4_pg.C_scaffold_25000154 MAHICLHEHTFRGAPGKPQQGATASKLPNCWTYRLLYVYAEEQKTGPLVRHQSQLAMFALAQGSGPNSLQCYIAEFKHLMADLPYRHEKDHVLFFARGLQDDLRKEIFSRLCNLGSYIRLQDVIDLACTISMERDAAHRSPHSLPQGIRTINTAPTLAMMTSTAALTQPATALPIPVNVAFKPPPAEQVSHSMPIASKPARHVGCMHARVAAASTAEAHRVDSDWMLSCTVFLDLDSQYGPFTVDACCDDFGINTHVVPFFSPSRSFLSAQVDGECVWMVPPTSTLSTFISQYLESKTTNPCTAPISTLVII >gi|Spun1000003263|ref|SPPG_03435T0 | SPPG_03435 | Spizellomyces punctatus DAOM BR117 hypothetical protein (427 aa) MDRDAKLALLLSIFDDRSERNLLDALDFAQGRVEQAVEILLGNAEGHVSGGTNKRKHGATIDEFFGGSERGGKKMHLDRHVDVSDDASQPGTTTTSSPGSRNAFEVLCSGSLHAEDAKGVSLPPRTLTAKEVAEHIPCQLVLDVLPEELAASLLQKLMVEAESWKIKRFVMFDREVESPHTTSFYTDEPYTSSSSTPDVEYYYGGKKTTDVRTMFPELAQARSLIAERVNRALDERDHVQGGRHPAELVGIWEPNIVLANCYRGAAEGVGAHNDKMTYIGPRPTIGSLTLGATRPFRVRRIRRPGGPVPQTFNIMLPHNSLLIMLPPMQEEYKHEVPKCNPKLLIQHPISKDTRINLTFRVARPEYRDNIPVCRCGNPTELRVVIKKESNLGRYFYMCAGGGNETSGIESGSNCGFFEWLDLKGKS >gi|Pbla1000013272|ref|jgi|Phybl1|77820|estExt_fgeneshPB_pg.C_100091 MPSDPFSWQLVRPTAEAEGVHPKPIKPPLDRWKGAVHGNCENSTQKHSFGTADTHKSQASDASLSSSTPIPVLEKPKQTNPIILTSKFGQFMGKMLLQSSNNRTKPVPKINEHSQQSNEPSYTGYFNIERKRRSSDGLKVVLRAVGQEERSESSTQKVVELKDSIPEDTIMDDTMDDTMDDMSIEEESIEDLPTSNSLLKQNLTVNTTIGEASTNNTPIEEVVLKNTPSPISPIGLNQPKDITLSDPRYHHKKGKRTRWDVGPILIPEDMECTQSFASLQLDDTSKALDYEPENLCDTLATGDCDACNIMNSQERGPLDQVALCESCKQTWLPNAKSLLDRLSKHSKDFVKPTQKQTSKQTLKQTSKLLSKQQKQPHQQDNQSQSQPQSKQKQTPKQTSKKTTKRLPPAKSVHLHSKKSSIQEAKNYYTTGAFLTRNTAKQLVDEAGFHPNPHGFTNMQKVKVLNINGHWYRGILTMMYGSKVKVHYLDWDDQEEWIVMGSRRLRGLTKDEEEEDEDANEEEDGEDAAIENENKDEENEEKDEDKDKEEEEEEEEDDILSIPAESKTTQPVKKGEHLSVSPKSHSRKSQSNNVSALNPKKHYTTPIDTDPTQIFNDNEIFMTRRMAHQLTDEHGFKPNSFGYRYNRAVAVSLRAEKGKRNRMEYNGLLREMRGNQVRVWYPSLRQSDWLIIGSRRLRVLTDQEASELDNLGTELVRTMDTRAKDSEISTKLPTETKTPSPSTTETLPEPQSESVPKHVEETVEDTVEDTVEGEVEGIAEEPVEAPVPEPAPGPIKRGRGRPRKVALPVDPSNPHIVTIPKTPTKTALKKRNDSIKNGIKETKKAAAAAAMVVEMGTKDTEDALDYLTTGAFATRRAMRQLKDEHGFVPNPYGYVYDQPIEILNTRSSKNKFWERGRLIGMCPGKVLVRYDGWGEVYDEWVMVGSRRIRPAAAQIESSGDQKSSTMASTENTAPTSTLNGGSASTKKRAKQAARNDLLVTEANPEVEDEARKKRQHRVLGPEDYERLGLLAGSEKVEKIERRGRKKMVRDVETPKETETMKDKAVLIEEEPKPIEPSVEAPIVQNEDQEMAEPESDLTKPNGAMTVPEGTQNDLPVKRKKQKAKTQQRKRKPAKATSPSPSVSSSTSLQQTQAQTELSTELSTETETETPTPISTSIVSVAATEADHDTSTLTSSYRRHIPSDESNHGFVANVYGYDYLQHVQVLHLDKKWYEGRLVSMERNRVRVHYCGWLDKFDENIAVGSRRIQVIENDHEVVCIEPTYSERLEKMQEEKEKKAVEPEDAQVVKPSKRREVAPTVVPAPEEPVHGTHDMVEYHMEAVDGMEVEENDTWKVYCNQCNIIIKQFRYYCTYCETPSEGHDYQSFELCLRCFDQNFPFWHEHPRSSFAVQAVIDADMGPMPIKGELVTVWEEDILEEIPDDTQDDLNDPDDMFSGTMEASEVFSGVAPLDEDQGYKFLKRWQRRKVCAFCNDDDDTSTELGKFIGPFVITSFNKNGTEKKRSFWAHDACARYSPEVFCTSEGKWYNVTIALRRGRGMKCYGCKEKGATIGCFESKCSKSFHLPCAQKPVSYFQSGVIFWCNTHEAYYKKKDTYVNIFNCDGCSKRLEDETWFTCIPCASSYFSSFDLCAECFHNFPQDHAHDEDQFEETSFAILKEVEAQKATEAAKAKEELRAANPKKKPLFPKRKRRLADGSVPLTCSYCGTEEAESWRKGYDGGVLMCTPCFELALFIDNDGNTASNESLVIDSEETHRYVMSIEDYTHKPYLTRDAVSATKFSDHRTGPRLASYGPQPNQLFSLVFDSTYYDIPGRAPRWATHSGTDYHGTWLPQTVRRAILKYTNKDERVLSNFLGRGTDAIECFLLQRRCCGVDINPAAVALSQRNCCFEVPPGLTSAEYRPIIAQADSRQLTGALFADESFHHVLSHPPYKDCVAYSTHLEGDLSRFTSVEDFRAEYGRVVRESWRLLKMGRRLTLGIGDNREHCFYIPVGFHLLREYINHGFELEELIIKRQRYCSAFGLGTYLCVQFDFLVFTHEFVATFRKIPLECTDKMLPIDNSDCRDHVRVEHVTKAVPQSAISRKSVVMGTVWIFKPTDTHTFEQLCISRMLERFGKDDGNWEQVLLDFMSPESMMIQNNVQQQYQSSTSSQNVHKDKPEEQEHDRDLDLDQDQEQENNKEDSQNQLSDYEKLRLKRIEENNQTLLKLGLISEMSEDSDDVIHYESMMSKKPLENAPLVLVMVGHQPIEPRQIGLYRETIVQIALEAVKKLAPLGMLIIGTKDIRQKDNGKLWPMSMLVLEDIERAIDRSVLKLKEMVVTVPEGHSKDRQQKNLNTEVEEELEIVDEHLTIVHAIYLVFQRMNYSHNYN >gi|Uram1000003343|ref|jgi|Umbra1|232251|fgenesh1_kg.14_#_839_#_combest_scaffold_14_22505 MQAIRVQNSFRSLGFAKARVGFGLKGRNGITVGRVRSVSSNAYASLRNSPAILDSCFELSTAFTDNASSKGYSINDFLVYPDFVTKEEKETLTSICEKKLRRSFGPKVEYFPIHPDSVIHQYRECSASHWGKQDEFMKEFINKKIYSMFPDQMEWLDPHVLDLDGDGEIKAHVDNIEYSGSVVAGLCLLSPAIMTMRHKDDSNIRFDVLLEPGTFYIQRDTIRYSFTHEIRLSNTTWKGNEVDRSRRISLLFRDKKI >gi|Vcar1000013369|ref|jgi|Volca1|45995|gw1.101.33.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADSFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Fcyl1000123364|ref|jgi|Fracy1|197836|e_gw1.35.136.1 ; gi|Fcyl1000138707|ref|jgi|Fracy1|213179|estExt_Genewise1Plus.C_350063 ; gi|Fcyl1000088052|ref|jgi|Fracy1|162352|gw1.35.136.1 ; gi|Fcyl1000100129|ref|jgi|Fracy1|174601|estExt_Genewise1.C_350073 MSLSSIKNNKNKKKNNNRLPKISLSSSSDEGGSGGGDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRRRQQQQQQQQHQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCTKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFELEEGICYQANPPFCEGLILQLNDKITDILLSSQQQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLW >gi|Vcar1000013363|ref|jgi|Volca1|69777|e_gw1.101.32.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFVVPKDVFTFECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILADDWLERKKARMCWESHTLTVRKATGDQCSGRNGNLPKRKPAETSFGDSGEEVLVLRKCRN >gi|Fcyl1000123395|ref|jgi|Fracy1|197867|e_gw1.35.124.1 ; gi|Fcyl1000138708|ref|jgi|Fracy1|213180|estExt_Genewise1Plus.C_350064 ; gi|Fcyl1000086733|ref|jgi|Fracy1|160962|gw1.35.124.1 ; gi|Fcyl1000100130|ref|jgi|Fracy1|174602|estExt_Genewise1.C_350074 MSLSSIKNNKNKKKNNNRLPKISLSSSSDEGGSGGGDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRRRQQQQQQQQHQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCTKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFELEGICYQANPPFCEGLILQLNDKITDILLSSQQQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAA >gi|Vcar1000013410|ref|jgi|Volca1|46011|gw1.111.24.1 MFLRDEFRRVETELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCSISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Ttra1000003432|ref|AMSG_03671T0 | AMSG_03671 | Thecamonas trahens ATCC 50062 alkylated DNA repair protein alkB (269 aa) MAEAAAEAAAEAATKAAAEAGSEANDRVPVVLAQPCHAGLLAGWADGEVEPLDALDALGIRVVPEFVTADEEAALVECLEAGRFTSVGGRELQNLGGVPPVLDEGPGMVVEPLPEWLGDVVAALNAAGLYSGEAAVPNHCLANRYPPGTGIAPHCDGPRFAPLVADISLGADAVMVFSEVASREIVAEVELPARSLLVFARDAYHKHKHAIASSPTYEGSPTRYSLTLRRVKRVAEAGAAREILYTDEGLRAARAAKFAHLRLISEKE >gi|Pbla1000013531|ref|jgi|Phybl1|78334|estExt_fgeneshPB_pg.C_140177 MTERASSNLEDQRFKSIDQSFIHITNDSCHCNPALDYSHLSHNCSANYYSQHLMSLNSSTSHYTPILLDTLTPPYETNNALMHGPHCTAALSNNDCIDLLPGIKRKRQDTIGDQSASSSIYANAYQPKGQKTSNTDQIINQTNGFQNEYRSPISLVNSETYYRNTPSDLVHDNRIYPSIKQACISESFEAVSINESEAKQVVCILGNMLSIVKVDVSKEVAKRINNTGKYFDNHAQLDFKLYLLEWSYMPDLALDYLFRSLESERVCKDMSALRETVNQQTIDIYCLMTGFMTVYLVFIELLFQDLIGNVNRSSFVEYSSLFAEQDDILKKSDDIFLWHHSYTRHHTTILSSLRSQMQAAGIKFHSLQLIDTIFSQFYTLHFLETSAQKHQKDHGQFYTPQSVVQFMWMRCMSKQTLRDTLKTGRVPRVLDPCMGIGTFLCEFLTRLVSQSTTTPLLWNDPTMLRTMLSQTIPDALWGVEIDPFACNLGKLNIILHLFPFYKRLVELGECLTPRMINRLRIFCNDTLKLTVESKPVTESGTEAQLWEKEQLEKLRDAALFKFDYVVTNPPYMIRKTGFIAVPDPELYDDSVLGRGSQAYMYFLWICLQRCEETNGQICFITPSQWMVLEFAEELRAWIWQHFEMLEIFQFEPYKVWPKVQTDSLIFNLRKRAPGRVPEQTLFLRHMSRKHNLESIIKSYNIFDRSRLEVQDPLIKYKLTSTEPVDFIHQIPHASFSFLSPTSSVSDQLMNLTKSLPRLCDGEMCLNTSSSCAPLVWNRGPNTNPVYALVVRTRWALGVFGKECCDQWLRPVFYWSGKSSSASRRRKTSGFQTVSKEATFWQDRDPLRLTKKENSPAEAYVPLCRSDPDDTRLSFYSMILVDKDGALQLEEEYKLYGNTANSSALYHYLLDARNALQTTKTDRNIAYCHYSKCGIETPVKIVHPINFGYFTRTQPRQRFFIDTDRRCVTNQCMYFTIKSEYPWQSADFFCGLLNSSTLQFFIRDTCYYDQQGRTRFFGKHMAKIPFTPPRSTIDVEIMAMFVRHLTTARQWIYGIIQLSNTLNVMEQVRGCTWHIPLVDQALLSLCEQRTPNWRIDLFRAPYDANDWINIIIQSESHTRSVGMYEELTRLLKVASLFQYCIDQLVYDIYSIPADLQKGLEQELGLILTETWKNILFEETSVKDHCLTWGLCMEEVAKKIFDKDS >gi|Vcar1000013547|ref|jgi|Volca1|40642|gw1.120.3.1 MYLPDEFRNVENMLGRQFTFDAACNNSGDNSLCTRFASPSNSFLTSDVSGEFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARRMGLAIRPCSTSSVRVVNGSSELIEGLIHAKLRIASFHDTVKLLVLKQANAGVELILGAD >gi|Vcar1000003571|ref|jgi|Volca1|45982|gw1.10.241.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTHVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTTSTVRVANGTHELINGSITATLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Smar1000003623|ref|SMAR010103-PA pep:novel scaffold:Smar1:JH431960:112870:114093:-1 gene:SMAR010103 transcript:SMAR010103-RA MLFEILHSNLVSVRHLASVIGKLNATMLAVSSAPLHYRGLQMLSAGFLRGGSYESECSLNTETRTELAWWLRNLNICKGRSLLPKPLVLIIETDASNWGWGAFSKDFSIGGPWFKSFKAKHINILELTAVFWALKALARNYSNATIKLKMDNISAITAINKLGSPRSPQLTAVAQDLWTWAMDRDLTIVAEHIPGTENCLADAASRGRLFDSGDWKLDTAVFDRLGSVFGKFSMDLFAACHNCQIYPFFSWKPDPDAIAFDALTQAWKGGLLYAFPPFSLIAQCLTKLRNSDHPQEIVLIAPVWPSQAWFPLQWSASTVFPVLLPQYDQLIVNPAKEPHPLMIDRSMRLAAWKLSTPSITTRSFQKLLPSWLGPPGNPLLPSNMIYAGENGANGHMSNILTLFRSLP >gi|Bnat1000003625|ref|jgi|Bigna1|73330|fgenesh1_pg.23_#_199 MELGSSFRRRDKASRDDASSDKGKPENKAGGAAATNLFRIAEKRYRLYKVNDTASPPSSAPRKKDDAQQVWRIPEVEGLYIIKNALSLQEQLFWAKRAIKEFSNCAHTNLSNLHGTQPDHWANAVECNDVGKDSEFYKLRWASLGYHYDWTQRLYHKYQHQNEKSDFPENLASMADALARRVGLKVKSEAAIVNFYPYDGSMGGGKSKDVEPTAIFVNSGDAVIMGGHSRLCFHGVPRILEGSCEELARFLHDSPSKEEKLIGKYLTRSRINMNVRQVTQLAKQTGIRGIC >gi|Smar1000013635|ref|SMAR001291-PA pep:novel scaffold:Smar1:AFFK01015044:37482:38168:-1 gene:SMAR001291 transcript:SMAR001291-RA MLKTKFLRVGNYESHCSLDHDARNELNCWVKNLDFCKGRSLIPKPVSLVITTDACNTGHRGWGAVCNNVKISEKFSSFELKKHINVLGLLGAYLALKSFARNSSDSSIKIRIDNTSAVAAINHLGSPLSPELTALAQDIWSWAMAHNLNIIAEHNPGAKNIKADAASRGSVKDSEDWKLDPVIFSRISNLWGPFSIDLFASHHNHQFCSFFSWRPNPSALAFIALIQD >gi|Rall1000003656|ref|jgi|Rozal1|3657|O9G_006133m.01 MIPMYSKIVGPLEKLRCQSVIEWNEEYWAIYTKLRTILSSEIILSFPDFDVIFDVGTDASDKGIGAVLYQTINGKTKYINFAARSLHDGEEGYGATKRALLAFVFALQKFRPYLWGTKFNLYTDHRSLTYMYSQKHVNQMLNNWLKILLEYDFNIYHKPGFLNILPDAISRLYDADPEIEEIELKVWSTTVQKNIEWRLNPKIFEKLNKELGPFEIDAFASPINHHLPMYYAKDIDALTQNWKDKYLWIYPPENLIDKVIDKIYKDKAKAAILTPFDKTKSYFTKLEAIXNHISLNWKLLVENSL >gi|Hrob1000003787|ref|jgi|Helro1|185055 MSDQRRNVVQGAWANVSTRSKKVDGSLASKKGMVTFDVRKTAEKNDNASSGSNLVKKSSFQFENVELPREPSFRTINQSGDYIISDEPNGQSRIALVTNFIDSSECGWMFDQLNDELSWKQHEVHRIGKTYMQPRMTAWIGNVPYSYSGITHKCQCWNPLLTMLKDTIENKTGHQYNSVLANLYRDGHDHVPWHCDDEKDFDTHPSIASLSFGDSRIFELRRKPYKNEHLKSLKNDALTDADYLEYIKIPLNAGSLLLMEGAVQLDWQHRIVREYHDRGPRINLTFRSIIDR >gi|Lgig1000003858|ref|jgi|Lotgi1|115378|e_gw1.21.357.1 MGTAKGDNATAPDLKGPGPVERQDSITSVPSPTPSGSDPVHDLPPELLQAGWRKFWSKREGRPYFFNKLNNESLWEMPQLGFHGNMHDPMTDPLGINTPEGHAPLPAISKFIYKAKSFTINFCNIFYEFVLFCSPFWNFEIPTNCVINERAPVNIPGPYPDIEQLRAQLVTKLRQNYNELCYSREGIEAPRESFNRWLLERNVIDKGTDPMLPSQCIPEVSQSMYREIMNDIPVKLVKPKYSGDARKQLFRYAEAAKKMIESRGVASESRKIVKWNVEDTFTWLRRQQNASYEDYLERLAHLKRQCQPHLTEAAKTSVEGICKKIYNQSYEIVKKLSERHWEILKENNINKMDPLPEPTTPRKVLCYSIQVSVPTPRPVLVEHATENDITSLRYKSETVKINSSHFHKLEQLYKLNCRDDPRFDHFLCRVWCLLRRYQTYFGIHTNEGFGLQGALPVTVFECLHRVFGVTFECFASPLNCYFRQFCSAFTDTDGYFGSRGDILNFFPKSGSFEANPPFCEELMEAMVDHFENLLHESNEPLSFIVFIPEWRDPPTEALMRLESSRFKKKQITFPAYEHEYRNGFQHICPKNDMSVKSLHGTVAIFLQNDAGFSKWGPTPERIKELLLSSKPRDTVS >gi|Sarc1000003931|ref|SARC_03841T0 | SARC_03841 | Sphaeroforma arctica JP610 hypothetical protein (111 aa) MEPEDVVPLPTEDDFIDDEDALPVAGCVPVAHEKRTQRLNPEWFRRAMEKWPGVVINLFVNRYNAQLPDYASDEPRVGTWGGNPFYGPFGCHFWICLSAMAFSSAVLVVS >gi|Fcyl1000123964|ref|jgi|Fracy1|198436|e_gw1.39.264.1 ; gi|Fcyl1000084768|ref|jgi|Fracy1|158883|gw1.39.252.1 ; gi|Fcyl1000088083|ref|jgi|Fracy1|162385|gw1.39.270.1 ; gi|Fcyl1000086781|ref|jgi|Fracy1|161014|gw1.39.264.1 MLAKNKEKLLKINPAHYEKLKTIYIATTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAYPDTDACFGSLGSFFQFHPKQGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLQASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDAAASKWPLTEIAISELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKKAGGAKKHKRNT >gi|Mver1000004016|ref|MVEG_04014T0 | MVEG_04014 | Mortierella verticillata NRRL 6337 hypothetical protein (426 aa) MPDPLISNRQRKIMERQAERNREMAAAKRSDTRTPFREAELQYLSRHPPPDYSQALDFRKPHEELVEDPKVKPVHLQKPLNEFCPLFGSEDYQGVGKSRYAFLHEDHPGLIYIPAGFTPAAQRTLVKACLKDYSRHPNKSNLDTHYTVPDSGLWDLHEDVFYDKREADDPAVLVPRKATTDVHAGGYGSDDDDEEEENNKKSKKSKISIRTLVPITDDTPSVPDNVPKTDPEPSAHVPILPPGQLVRKMRWITLGYQYHWPSKTYHFDQNAPFPPELCELSKAVVEAIHEVGSYPYASEDFVAEAGVVNYYQLKDRLMGHVDRSELNKDAPLVSFSFGHSCIYLLGGSTREKPPTPVLLQSGDILVMTGPCRAAFHGVPRIIEGTLPAYLQKSQDPDWDIYAEYLAEARINLNIRQVYPPKKGET >gi|Fcyl1000034107|ref|jgi|Fracy1|251973|fgenesh2_pg.39_#_31 ; gi|Fcyl1000123871|ref|jgi|Fracy1|198343|e_gw1.39.252.1 ; gi|Fcyl1000123890|ref|jgi|Fracy1|198362|e_gw1.39.270.1 MPRHETTTTVATGLGSKVAKKQKVKAGKSNIWNPDVCYVSEKKECTTSSGKLVENFPDPTCSAEFMRMTSTNWLRKQLHKLRKSNKSAEDVEFMKPAVMAVERWFMRYSLDHPHRLAAGEDPVLPMPGADMEEIDPHFVDDLVKTGKDTPEAAVAVVKDFYALTRKAALDILKKEKALTKEETVVVIHHRHSCDVMLAKNKEKLLKINPAHYEKLKTIYIAVQESSSLSLTSGKKKKKTKTTTTTTFDLAEFHRRLFCILARYHGIQGHGFQAACTEHVFDVLHGRFGVNMECFASPLNSRYASHCSAYPDTDACFGSLGSFFQFHPKQGSFEANPPFEPYVMLAMVNHMEVLFKKATGALSFIVVVPGWKESEAFQRLQASKWNKKSLPIAQKDHGFCDGAAHQRRDRYRISPYDTFFFFLQNDAAASKWPLTEIAISELRSAMACGVPSPSMQARHDKAGRGTDDIARGVYKGKKRKATGEGVMSRKLEETRLKKAGVKKAGGAKKHKRNT >gi|Fcyl1000014177|ref|jgi|Fracy1|232043|fgenesh2_pg.1_#_241 ; gi|Fcyl1000104334|ref|jgi|Fracy1|178806|e_gw1.1.2000.1 ; gi|Fcyl1000088076|ref|jgi|Fracy1|162378|gw1.1.2000.1 MPLWAAETRISIRSLASSGSIILPDPEQDAARMAALKRLRHKFVRLCSENNRSKPPVLAFERWLGRASLKRGISTSDGYDPIIPSDSVMDKGFAKDISRTLPSWAAANAVAEEMTKEATKQIRGMATQREEIDEHKDLGMLRKKIREEAALNESTTKAQQALGGANNSAVVGGGKVVLNGNRRDGIYDVMLCGPCGKPRRPYLTISSLHLSKLLRLWKLKNKEGNDDDDDGNQIEVIEVENPIDNMNALLEDDRIMFTKSLYCCLARYEGLKGAGYQCAVPGVAFDAAIACGLGSTIECFASPLNCRYQKFCSAFPDIESRFGSLGSFFDDEAFNPLTGTFEANPPFVPEIMVAMGTKLKRLLGDKSRGALSFLVVVPAWGAGIDFVTDLESSTYVRASSRIKASDHAFCDGAQHTKPLSNQADPNLRPSSWDTAVILLQNDSGALKWDVDDKKLEESFCNALKATIGQVPDKFTKLDDWERRGVGQGGGSAGSKGYQGNPNKRPFAKEGNRNIDRKRFARGYAIVLDWMCGDHQSNNYTGHFVIVARSQQTATFGVNNNSPRLKSGEVGIHSSAT >gi|Vcar1000014193|ref|jgi|Volca1|98959|fgenesh4_pg.C_scaffold_80000013 MLMFVTRSELRTDATSTVYALQAKQRDAPTMIGMHVHSPVRQTPCKTLQLHSQYTAALFSPIRPHSVSPTHFGTFHNYRHFANAGSCFTRTSVFCIQVDADTSCTAVPEDNFPAQISMDTIRQGSGPNSLVSYVAAFGRLMGDLPHRHELDHVFHFAKGLNRDLREEVYARLPQYGTDVTLQIIIDLAMGIFSGRHNAHLIDYNTAPCSSRDVHRDDPMDLTANVYSTTPHHSTSPLPSSIKTIGVPYNLRLQRMAQHVCLQCGDESHAKEHCPELRRLLAATTISQPNSSRRHSFQGANNSRKPPRSPARSPARSPSRSSTSSASEDRRSRGHRQYDNHSHRGRSPSRAPVKSALRPSSAETLSPFFSPSRPFLSTDIEGECIWMVPPVDNVSTIVARYLDAKTANPKTSAIIVLPDRPQAPWAPLIRHMTIVHRFPAGAQIVCRPTTSDPSSPTALLKLHFRR >gi|Vcar1000014202|ref|jgi|Volca1|84221|estExt_Genewise1Plus.C_800030 MFLPGEFRNVENMLGRQFTFDTACNNSGDNSLCTRFASLSSSFLTSDVSGEFPGFKPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQVERRRQGASGARTAGGKCVPPVCGKHLPQMPRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARNMGLAIRPCSTSCSEPPSSLRA >gi|Lgig1000004215|ref|jgi|Lotgi1|118068|e_gw1.28.427.1 MKVYENFLTEKEEESLMLEIEPQIKRIRYQDTHWDDAIHGYRETERKVWNSVNQEILKKVRDISFAPGTPQIDFTHILDIKKDGYIKPHIDAVRFCGRIIAGLCLLSPCVMRLVLEKDKGKYADILLNRRALYVMKDRARYEYTHEVLAAEKSYFKGNIVPRDRRVSVICRNQPK >gi|Vcar1000014314|ref|jgi|Volca1|78281|estExt_Genewise1.C_1320001 MNSHDFRIIDYELLRQLERRIGRTFSLDAAANDDGSNSVCKMFASPTRSFLNSDCSGHTIWMNPPMKLLSDFLRHYHRCKSYDPSISSWTAPWASTAWPRVNCLDVNCSMCRVLTFPAKSVLFNGLSLEGKTVELPGIPYPAELWYDAP >gi|Psoj1000014320|ref|142118 MAAAYKASEKRWKRATDEDLRNDEALVDPRNLSKEQQAKVQRVGSWKWGPEDKERPVLAFDDFVASHRGFYVIPNAIDTKTQLQFAHACLTEFTEEPHVTNMHLQNQQETDIWRKARGSHPKDPAASPLLSKLCWAASGYHYDWTARKYYRHSFSAVPELLQQLGSRCAAACGMSLSAEAVIVNFYKSKSSMGGHLDDVEYTMDHPVVSLSLGSRCVFLMGGHTKDELPLEILLRSGDIAIMGGESRTCYHGVARVLPTPFDVPSDDFDALLQSEDDREEYEAVRTYLSTQRININVRQVYPVEPASTD >gi|Rall1000004359|ref|jgi|Rozal1|4360|O9G_000840m.01 MTPKPNNIPVHPIHFVPSIKTYVTNPDVLPIKGMSYISDFLTVDEQKEIWDTVYSNPFSTVIHRRQQFYGETYYHTTPNISSLQPLDNQTQLDSQNDNVADLKVENGDMKRCWTGKEAPKALPLKLFDSLIEKLINAGFFTKEDPPTQILVNEYSGPMGISNHYDDDKAFGDTIATISLGKPIWLNLQLPERQNNVCKSILKQTKVFLEPGSLFVMQTDARYVWRHGISNAKWIRYPDNTPVRRDDDYVRVSLTIRKLLDGRKRVTKTTTDWLEIPDV >gi|Vcar1000014369|ref|jgi|Volca1|70135|e_gw1.112.21.1 MFLPDEFRNVENMLGRQFTFDAACNNSGDNSLCTRFASPSNSFLTSDVSGEFFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARKMGLAIRPCSTSSVRVANGSSELIEGLIHAKLRIASFHDTVKLLVLKQANAGVELILGAD >gi|Smar1000004446|ref|SMAR009389-PA pep:novel scaffold:Smar1:JH431896:51121:52792:1 gene:SMAR009389 transcript:SMAR009389-RA MSLSETTAMTAAILAMQEQGIIKVVSPVFGQVLSPVFPVPKPDGSVRLILNLKQFNTNLEYKHFKMASVRDAIALMQKDFFMFKLDLKNAFYLVLVYENYKKYLRFLWLGILYEYQVMSMGLAHAPRIFTKLIAPVFAHLRVLGLCGPNRLFTSYWYIKPLEAFKTGQLIKNNNDYDAIVPLYPDIKSELQRWVGIDMFTPKSLISTVSTVDIFTDASKNEWGAVWGNESKFHINVQELLAVLFTWKSFFPFNGTKHLTFHIDNLAVVSLFKNHGSSKHLLHSFGRELWEEACRHKISLFFVHVSGKENTIADFLSRNFISADGEWSLHCSVFNKLTEEFVMPSIDLFASRLHFKLKIFILGLQTPKLQRLTPLLTFGQIFHTPFLLSVSSPRSYGRSIRTKQPFYYWCYFGRRNRGFHARWKA >gi|Bden1000004472|ref|BDEG_04459 | BDET_04473 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (226 aa) MDSSVGLDAADDPTRNHANLPSLHKHPFEPVISGLRLIPDFITQQEELDLIASIDAHPWSGYGIPPNPELKRHTQQYGFLFSFRTRTITECLGSLPAFSSFVIDRMLLPEFNVFPNDPPNHVLVNEYQPGQGIMPHVDSQDTFGDVVTSLSLWSSCVMSFGNKMTGEKVHLELPRRSLLILTGDARTHYTHAIPKEDMLFAGNECVDRGRRVSLTIRSILKSAIP >gi|Bnat1000014488|ref|jgi|Bigna1|25582|gw1.3.140.1 KLFSLLRDQVPWQRETDDFGPQQRLSYYMGDPDCTFRYVGLSLEPNPWLREMENVRGVAEQDNPRILTGCLLNNYEEGGSFIPWHFDEVRAHGDSKVVMSLSLGGPRRFDLRRRRQSLHNISLLLQPGSVLLMAGNTQEHWLHQLPDITGEPKPHRISLTFRSIV >gi|Sarc1000004600|ref|SARC_04494T0 | SARC_04494 | Sphaeroforma arctica JP610 hypothetical protein (341 aa) MQVESPIIKNRVSFADPLVTVPDTEEYSLALKYFDKALQRVGPFDAEGFALPGNNLMVPYYSPTLEGGVFALDTWFGRWYMNSLFSKLDAVVQRVKSDRATATLVVPYHPEAAWFKQMVPLLADTPIVIPPAHGVFLKHGREPMSPPPFVTLICHLSPRCEKFTLAERFWATVDAHRPVTSALVGALTTRPDDLESCMVQDRLAPAVAGVVEKPVRCGRGPTLGGFALKPIEHVVGSCTFMATTVMSGELVLEESVSRAEREYEAHQARTVAVMTRAQRLRASAAEADVAASEEPLVEEPALLLSRAREQLAAPEAWLASLWATHREGVSRRCTVWIVWY >gi|Rall1000004614|ref|jgi|Rozal1|4615|O9G_005572m.01 MVGVSGKILTSNVVLSSPNFDYPIFVATYASNKVLGAELYQEYDNRIDPPVEVIERILNKIVKDEAEGSIITPNDSTQPWFTKLKSISIRNPIIVTNMDGTFNKIDQNISETKPDYDTIIVWTFKGNHRNKEDCEFINLSETAFENINLPSERKQGDLRSQIIDIVTDSNLKEVPENERENLLNEAHLQGNF >gi|Smar1000004632|ref|SMAR009255-PA pep:novel scaffold:Smar1:JH431878:386467:389171:-1 gene:SMAR009255 transcript:SMAR009255-RA MGDSTHEGESSRSTRTSKSKIKYGILLLLCCCLLAGIGIGIGYMTAIQSSDYVNNAQDGKNKTCTHTSLLGDDPFPWWTIDMKSQYEISKVSFQGREDCCADRNHDLQVRVGNYLNKGWGTVLTLNGLCGEIGENKGQKMYEFDCPEMLIGQYVNINKAILNDNSAVVSLFKNHGSSKHLLHSFGRDLWEDACRHKISLSFVHVPGKENTIADFLSRNFISADGEWSLHCSVFNKLTEKFGKPSIDLFASRLNFKLKIFYSWAPDPEASKIDAFAHFWSDFSYAFPPFSPMEGPSGQGNHSIIGATLDDATVVSTPAEKPNSNSSVVSGQGLALPGTQPTGEAETKRSTRPAWLQNLGQSHVAQGFSKDAAELLMDGWRPNTVTTYSSGVTRWENFLGSQPTTAIEKPATPATLTANLVASMYNKGLSLSTVNTTLAAGSAAGFIIPETRATPAIKQIWDVGLPLAELRKMWPHEDLDWRDLQIKVIMLIALVIASRISTIQSLCLDDLTISADVAIFRPSAIQKTLQSGIHPVLRLAPYPAEPPICPHLALCDFLLRTQDWGLFLIQNSPFSGPSKYTISHWIKDFLAKSGVDKNVFGAHTTRSASTSKAAKQVRIQDILNAAGWTFEFTFARFYHRPIVDPNAFQNAGLATE >gi|Vcar1000014693|ref|jgi|Volca1|108458|estExt_fgenesh4_pg.C_1190017 MDTIRQGSGPNSLVSYVAAFGRLMGDLPRRHELDHVFHFAKGLNRDLREEVYARLPQYGTDVTLQIIIDLAMGIFSGRHNAHLIDYNTAPCSSRDVRRDDPMDLTANVYSTTLHHSNPPLPSGVKTIGVPYNLRLQCMAQHVCLQCGDKSHAKEHCPELRRLLAATTISQPNSSRRHSFQGANNSRKPPLTAPTAAAPSTSATNPLPAPAPQAPVKSALRPSSAEPVSHSLPTSSAVHSSEPAFNVPCLAQIVRKPARRVGRKRAHAAAASTSQGACQASNDWKIAQLLFLEYDSQYGPFTVDAYCDDLGLTAQLSPFFSPSRPFLSTDIEGECVWMVPPVDNASTNIARYLDAKTANPNTSAIIVLPDRPQAPWGPLIRHMTIVRRFPAGAQIVCRPLSSDPSRPKAGYGDSRGKPASQPVPTKKVDDKPCKEGNPNDDPEELGSTEPCTQEAEQHFCSAKNSHRLFEVKPDELPNVLKEFNVMKARLAYYISLGRVTWEAAWVSLRPEAHLLPRMMRLAQVATILPTQTACVERGFSRHRIIKNRLTNRLKLETVDSLLRVGMLGPAADTGAKPLIDQAAGIYSRKGYTGIIHKLFSDVSKINLNPYGEDDNEEVATDPEIEVLVQATYPASDDEAFSDSDYETDVERDTDESSVDEDFVDEEEDAKE >gi|Spun1000004719|ref|SPPG_04959T0 | SPPG_04959 | Spizellomyces punctatus DAOM BR117 hypothetical protein (1591 aa) MPTKSRPSICEMARRYVCRRNGQSPLQAFERFCYPLALSVCRHAGLRFRTLHVKLPVTGVTGLCNVLCMPANSNTCSARICSILGNDAPSTRSARMSVFQTLPRSKENPNNHSRRMLRGMGQKRTAGCCCVVQEDRDPFDTQIPCTSCCQFFASCLPSFFSQDLVKKKKKVSRKSIKGRKSSSANERDDDFLEPVILPPIINSRKLALPANCPAFHIDAIVEVRDGAKTWWPGRVVTVQSGKVCVHYDGWGDQYDEWIDCESQRIRLATQMPADQCSERISQENEHGQTGEAGIGPDAVILVGRSVREKRKKSAAYTNQKNAKRKKANSRTNVTVVNPITSKSTESATPARPQGFNAQQPRSSSTSPVDNFGIFRAAANREAREAYISRRALANDATADDMKRLYGKGARVEVCCAGGERYMATVIKTRSWQVLVQYDGWDEAWNEWIDMNSSKMKLVEAASGENNSSEDGNSSAGECSSEEDDEQKWKIFCNRCEKRIRQYRYFCTYCEVPSEGFEYESFDLCLACFQQDFPLDHPHPIQSFAVEPLLDTDDPTRRKFKDGELVSTFVLDEFDTSYIAMGTNQSQIDAETVVTAIPMVPRCAFCHSERTDIVGPFIGPHPFRNTRISGRRMPLPSSEKKNSGKNRRVPIFWAHDACARFSPEVYFMKDSGKWHNVLKALARGRGVKCAACKERGATIGCFDVRCTRSYHVGCTRKPLSQFEEGVIFWCPRHESLVNKADNYKDVYNCDVCSNSLGLSDNDEQWHTCDECAQNHFNTFDLCKECFEGRFPETHDHGKDRFITTCMSQRKEIREMEQQLARELVAANASRKSLGQRRKKKLERASGIRCAYCWIDSSSRWRKGYNGIPMCEDCFQMASSAFASSKAPELCATPEAIPSPSPSESLSQSPVNAPTPVPVRIEADSPAPVLDPTSMERVYRTEIEAYSHEYYLTRGVVGKASGADEIGAAEVNKSSEFGILQSYAPTDDQLFTMGFDTSFYDIPGRAPRWATHSGGDYHGTWLPQIVRMSLLRYTSEGERVLSNFSGRGTDAIECFLLKRRCCSVDINPASVALSQRNVSFSVPPELGLTAAYRPVIVLADSRELIGSLFEDESYDHILSHPPYKDCVSYSAHIEGDLSHFPDMEDFQKEMEKIVAETWRLLKPNRRCTLGIGDNRRECFYQPVSFQTIRTYINDGFELEELIVKRQRYCQMAPLGTYLCTQYNFLMFTHEFIAILRKVDDRQHSGLFSYLKVDDDHDFHVNPTRILRVIPAAPIDRTSIVMGTVWTFRVTQKHSLARLAMSKLIERFGTDSAYWEEVSISEFRNKVIRAAVYDDLCAERDPPEEEDEEEENGTEVTEYERRRREQLSKNTRELLSMGLISELSPEGEDDAKHLETLLAMPPVQTIQHEGCPTMHPPSSPVIIFVPHINAPSTAILPHAWINEYRKFVIDCARDAAARLTDGGYFIIGVKDARIFLPPKTDPSSENEQPCGIQTKYVPLGLLVSEDLSRYFEGSEMRLKDFVVAVPEGYSRDKGIEFEEMKARIDEDEQEWKSEQEKEANGQSNVRRLLPIVQAYYFIYAKQAGNRKLSEPSS >gi|Mver1000004713|ref|MVEG_04708T0 | MVEG_04708 | Mortierella verticillata NRRL 6337 hypothetical protein (324 aa) MSRMNTWRVARTALPRALSSRIVARNDICMRKFMGAQQVMSTATALPSVQLSRKISSIPSAYATKPLISHYSTSAAEHSPTNSSTNGSFYATLSTPPGIVATHFDLSKIPTAEHAQITHDFIVVPNYLSPQEHDMMVEAATKKLKRALGKQVRYEDGHFDGVITRYRECSASDWGAPPSLSSQGAEQKERQTPSEVLQAIKLHYFPQDWKWVAPHILELEAGKGGIKPHVDHLEASGQVVAGICLGSTAVMELIHDKEPSKSFRVLLPKGCFYFQRDSVRYQYKHGIPIELEDHQFKGTVYPKEKRISVMLRNALEPVHHNGH >gi|Pram1000014720|ref|50629 LLPGFVVLKGFLSPQEQQELVNDSRRMGLQEGGFYKPTYASGAKCRLHQMCLGRHWNVKTEKYEDQRSNFDDAPVPALPMSWKLYAQRSLEAAKAIDPQVMGTCKNMMPDVCVINFYKKAGRNGMHIDKDESDEAMTMGSPVISFSIGCAAEFAYIDHYPEPHEAVPIVRLESGDVLVFGGPARNVVHALTRVYNRTQPLWLRMRSGRLNLTFREY >gi|Fcyl1000104830|ref|jgi|Fracy1|179302|e_gw1.1.1865.1 ; gi|Fcyl1000086724|ref|jgi|Fracy1|160953|gw1.1.1865.1 MTVLVERSFHVLVLALLLRYSALSGGQLLEDLRGGGMQGAIHSSVFDVLQSHFSKIPHNDKKSSSSTKQFWLEGFASPFNATLPRFASAFPDLDWHFGSVGRFLDCSFDGRLKYCEANPPFTPGIMLAMADHTTNVLQRADNDNTRLTFVVVVPSADNKKNTNKDEAVVKHEAQKSFRSMVSSVYCTKHIQLKAREHGYVEGAQHLRPTQYKQSSYDTSIIILQSPKAKKYGLDKTNMKQLEKDIRIAFASRHENEIMERKKMAASAM >gi|Mver1000004892|ref|MVEG_04886T0 | MVEG_04886 | Mortierella verticillata NRRL 6337 hypothetical protein (2160 aa) MDHDQQGHNRRPQPSPSRPQPPQPQQRHSERHPQYHPQHHDPQYYDPQYYDPQRQPQQQYPPSHSSIHTPSPYLDHHPAWTSASPRTQLQQHHHQHQQHQQQHQQHLHRSHFLPLVETPPSQPPHRSYGRASFSHHEEQPHFYQDAYRDMHDHGQMSSQLGADHARSHSPLLPALTSSHTSLHPRPRAASFSASLPSFASSFSSLVHPVPLPPSSSPPSRAFPEHTMPSTHHTRYDDFPQRKRGRPSHGGFGGDNNMDSGVFQSTGQAIGRTDLDSASPHSRDSRPSHIFRPEGDDQYQTYYHHTHAPDQTRPGHSPESHYTPHPSLQIQTDTQTHSPSKRPLILSPLLGPEYSEPLETHRTLSSSRRSISHHHNEHHHNEHHHQLHHQQHHQQQQQQLHEQRLSHGPTQSREQFLHDPMEHATAPLPRTNSPSDPMLRPSSSPSSFPTSSSASIPMPHFRGQLLHRAPDSAYQRRYFGASTSGSGSSPNPLSSFSPSTVPVPIHSDPLSKMPRKTSTSKRPQPNDTIGNTSTSGPFSSGRGSIANQDATLTSYLHTQLGRLLLSTKAHVRRLLESFLHQKAASTVHPRLESVLVELFNGMLMSHNALATREHISSQEQPMTLRPVLIQSLDHLAALMAHAPSVSSSSTPATLATATPGATRKKAPPHPPSRLSQSIYFSDTDRDTDSDLDLEGVAQGTDATRREATEGTTGRSSGPPRPESELLAIEIETVEMYTSTIEFLSLMTAFVSVVCWLMQARLKRDTSQPTTLGVDIEDVDMENRQPTDKDCFDRLHSFLLLFDDHDNVLKPIQDALHSEEQGGHSTTHPRGAMDDPTRQVGLFAWHFSLFSDDEDGPLQQEVNLIDRQALDSIHFDVILNDLYSTHVLAMTAKEHQKDHGQFYTPSNVVDFMWRRAIVGRENLLERFVANLGGAKGQGVQASMAPVESEASLVPTALDPCLGVSTFLSCYVRLLIQKARQDHTETIWNSPIASRLLLAQICENIWGIELDGFAFWMARCGILASLIPLVERVQKLQHQQQQGLQAYQAGRGETTKLTRLHLFRNDTLQLTVPDGVHPDKSWERACILQLRDPQLLRFDFIVTNPPYMIRKTGTFSAPDPEVYDWSILETGGSPTIITSNVSPSETGSRSKPRRGSISPNPPINEDVVSAAEEEDELEGSDSEATTPDSRSGSPRSSRVKASSSSWPTSSASASMRLGAKGMMQAYGYFIWFAAQRIKPYAGVSCMITASQWLTLEFATKLRAWLFENCLMDEFFQFEPFKVFAKVQTDSLIFKIRSMEPGRTRQDSSIEPSIPLYDRLLEIGAHRTVFLRHTDHHRPLDGILQDYMDFFAISPQEQSSSVNIMVSNKTREELSAVIAAAPQPSSSSTTVTAPTYSFAPMMPSSLLSTFLLSLTQDLPGICSAGTKRVNRLSAVEPLLWHRGPNTNPVYGLVVRMEYAEVMFGEVMKARWFRPAFYWNGKNSPEVGMMTKALHKEGQFWQGRDRLRLSKKEGSPAESYLVPTPGSHRLYGLCMVDKESVKVLREQMAQGVQGAAALWQYLTDVRNHFQPGLASKKRKVFLSGKQQMTDDEGVAYCSTNQCGSDVPEKLVHPINYGYFSKTQPRQRFFLDTSSLAVTNQCIYLTLNKLSHHYDAAQSPPLIYFLTLLNSSTLQFFVLHHCQYDQQGRMRLFRESMAKIPFQDRDVKSSPQRIQYAAQLGQLMIDLKGTLYKVVMEWHLTGSSSRTDLGAPRLSEPFIGSVGGNQGLLDWIRRGGDPPTGVLPKTRDQIWRMLQGHASAPTTRSPSAPSSIAQLSTSAPPALPALGAHFHRAESLSTQADIDTDTNTGTDTDTDTDDNFESGRRSRFDQEEDFEKPRREYQHPLQEPRASGFSPQQYNQQHSSWLKSSNDPTPSLTPLPPSLQPSTHNQHVSQTRHSLQNQDTECDAIMRALERAVTMVEMLQWAVDQYGYMLYGIQPRFQKLLELELKVAYGSRLDAVIVNMSSPVSEPLLLSHHHQHHHHQHQPVGPLDRDQPMRFGEEMSFDHVEDPLTVSEGSLLSTSAPPALYSTPLSAPIAPALRPILPRQGAVGPHSRAPLATPVSVHTPSNLEQDLGITVSKLMRWDKHEKDPTSIAVPSYAQSIMENAQAAVSSLEDLLRRYPPL >gi|Caps1000024925|ref|jgi|Capca1|112116|e_gw1.37312.2.1 ITIYGKTLPIPRLQVWMGDADANYQYSGLELSPHPWNPTIRSIKQQLQPICGHNFNSVLINLYRNGQDSNGWHSDDEPELGENPIIASFSLGATRRFRLRHKYRKDLTPYTFDLMSGSLLVMAGSTQKYWQHCLTKTAKQVEPRINLTFRKTLRGLDD >gi|Vcar1000014935|ref|jgi|Volca1|100989|fgenesh4_pg.C_scaffold_273000001 MRHNLDMAGVPEDSPEAAKIALSVLRGSMGDSLRRLNADPATRFTSYQAVLQAVTPLAPDLIDLALTISTGRDAANRIRDDPSPPRPRDHRRNDLMDTSAALDAGRCLQCGSEFHDKLSCPDLIHLTAPTLAAMTSTAVPTQPAAARPVPVNVAFKPPPAEQVSHSLPIASKPARRVGCMRARVAAASTTEAHRVDSDWMLSRTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDGECVWMVPPTSSPSAFISQYLESKTTNPRTSAIIVLPDRPTAPWAPLIRHMTVVRRFPAGARIVCRRDPSDAS >gi|Psoj1000004970|ref|131422 MATLMRISPNQSRVSLKAQHHPTLLLSMDFRQLLREERRRAREAAQSQTSTELRSGQEAATNNATEDEDKLAQWQDATLKVWAKRSKIDIEEFRRGPIPGVYYIPNWITQDEEKAILERVYAVPDDSDLWVRLKHRRLQMWGGEVKAPFDPKPLPRWLTQISQTLVEAGIFSEEKKPNHALINEYGVGDCIMPHEDGPAYYPFVSIISTGAECRVTFEPHRALEASSATVSEVVPHFDFQLERRSLLLFTGEAYTRYLHSIDNIEVGTRISLTVRHVDLR >gi|Fcyl1000014986|ref|jgi|Fracy1|232852|fgenesh2_pg.1_#_1050 MPKKRRKTQDVSQLSNLSNLTGFGNAGFGASSSLDDCLAEMSNTTTREKQQPSWIRQAKISTAEGIDNWNRNFQIWAKGGIYHPGLLPNIDQEIARNFKVQELSTLLLSNEFVKGKDIRMTTFERWLLDSKHEEEEEDEEDNAGGGGGIVQGDPVLPLRSSPDSKASQRLLSELITCTSTNLKNKNNNKTSKKIPRQDAEKIIAKLCRTTNLTCQELLCQEDRYRKQSPLNKGDRINVSTKTTESSNVIVVSILYSRKRWKKPFCFKLNQNHYKLLKDRFMEIHAPSSTTLTNAPLFGGRNNNIMSSSDNTMTVLVERSFHVLVLALLLRYSALSGGQLLEDLRGGGMQGAIHSSVFDVLQSHFSKIPHNDKKSSSSTKQFWLEGFASPFNATLPRFASAFPDLDWHFGSVGRFLDCSFDGRLSNSGGGGGGDDEEEYCEANPPFTPGIMLAMADHTTNVLQRADNDNTRLTFVVVVPSADNKKNTNKDEAVVKHEAQKSFRSMVSSVYCTKHIQLKAREHGYVEGAQHLRPTQYKQSSYDTSIIILQSPKAKKYGLDKTNMKQLEKDIRIAFASRHENEIMERKKMAASAM >gi|Caps1000015030|ref|jgi|Capca1|35874|gw1.256.16.1 REYGKGGKPKSKYDFMTIDEKQVEVDKIRTGIKQRCLFNDSACKEIEAKIEEVVRKAANGEYKKNTVDTAPLRNKYFFGEGYTYGSHMEARGPGMERLYPKEGEESVDPIPEWIHEMVVQPLLRAQLIPPDFINSAVINDYQPGGCIVSHIDPYHIFDRPIVTVSFFSSSALSFGCKFEFRPIRVSNPVLSVKLPRGGVTLISGFAADEITHCIRPQDTPHRRAVVILRR >gi|Hrob1000005052|ref|jgi|Helro1|69129 IMVANGGLGNGVSREKLYLLFSGFGTIQDIQMKPKRSYSILNFDDVNDAQIVFNKFNKFCAIKDESEKDVHLCLAFIEKMPYLEDSNAMHYPQGLEIVNDFVSENEEKMFLDYCDWGEEDSISLKNRRVRHFGYEFNYKEKTISKLNGLLNRIPQFCCDLLKKLKQLNYVTEDFDQLTVNQYNPGKGIPNHVDTPEVFEKEIAIISLGSSIVMDFKHPDGTHVPIVLPRFSVAILKNESRYIWSHGIAPRLYDLIKSETGLTLARRSTRISFTFRKIKTKLLGNFLFTNVMKFPKLNCVAITVDNDKLAQVYNTIAPHFSRTRHSPWPSVVNFLNRLDPYAIVLDVGCGNGKYLNTRKSDLFMIGCDASPTLCQIGSSVGEVQICNVVALPYPDCTFDACICIAVLHHLASQERRSLAIKEIVRVLKVGGQALIYVWAMEQDFAGVKSKYLKSVKKITELTEFHKVDDNLSLPVHNSKTTFKYCDNLVPWTLQTEYPTKDDLNSVEVYKRYYHVFCKGELVELCRQFDSSLIVSNVFYEEGNWCVEIIKTNKIETILL >gi|Vcar1000015077|ref|jgi|Volca1|100923|fgenesh4_pg.C_scaffold_231000001 MGQRQSLIPGRIDRECEYFARLTISPMNGAGGTNDLGGSLDVAMASGSSGNPAGEAGAAAPPLAPVVPVPVAPVGQPVIPIAGAPAAGAIPNPAQPVAVPAAVAVAQLPDAQMVRNIPAPKLPVATPVEPDRIHAFVADVRDYFVLVGWQANIPAQKLFISGALEGFFKEWHITWTKSVPDYTPDQLLDAFLIRSAPEMYSRTHVARTTFYSATFKQELNESVLTFAGRFEALLQDIPDMQEPTKLWHFREKLLPYLREECAVRPTDFQEHTVYAELLHLPLLLLWRHPLQYTLDLSRRRPSLPLDSRPTRVRAVVLEAVAGLVDVVVAQQAAWPDAMVQGVAPPRFVRRPDGNCLWSKYWQGLGRSVHESEVASIATTMGREFTLDACASDCGLSAVCNAFSCTARPFLDTNIAGHTVWMAPNAADLPAYVTHYRACKPLAPQSTAACILVPSGTEPSLLKGMKLVRRYPVGTSLFYVPDVQGSRALLPPITEVMEVWYDGPDSTEEIPACTAIGNAVPHLAVKISGSTFMAMLDSGATHSFVSEALVRLLHLHVLPSTFTYVRLADGGMSPIVGQPMLKYTLDLSRRRPSLPLDSRPTRVRAVVLEAVAGLVDVVVAQQAAWLDAMVQVGVADREAGVMLTQVEAVVPEEEPPQVAVQRLPVTMYKEAACFFLLLLKLWRYGMMVQTARKKSLPVQLLGMLFLT >gi|Ccor1000005156|ref|jgi|Conco1|17958|estExt_Genemark1.C_990013 MKSLLYKSAFNSKRSILQLCKPSTIIRTYLSSTINNFEYYEHIDSENLKYIDPINVPSTPNEKFNHKDFLIIPDFITKEEEEMLVAKSNFKLRRMRHYEDSHFDGVIQNYKEVTVSDWAPFQNEVNPILERAKSLIKDPINWLPTHILDLGGKGGIDPHVDNIQASGDYILGLCLLSDSVIVFKSEDYHFEVLAKARSLYMQRKTIRYNFTHAIPTLPQDHKFKDVLLDKNRRISWMLRDMKPLSK >gi|Caps1000025183|ref|jgi|Capca1|219923|estExt_fgenesh1_pg.C_4020007 MNNPSLPPPPQASGIPPGPQTPGIRPPPGMLSPAQPGDPVHDLPHDLLEAGWRRFWSRREGRPYFFNKVSNESRWEIPGQDAFTDSDPLGIDASPAPMPSRSLSIDTAMPSTSNGVGMKRRSSEDPNLMSPAKKISFSYSPYWNFDIPTNVMIWERKPCYLPPPHPEIELLRTQLMSKLRIQYKDLCRNREKIEPPRESFNRWLLERKVIDKGHDPVLPCLCTPEVSPSMYREIMHDIPMKVVKCKYFNETKRQLSKYAQAAKELMDSRSASAESRKLVKWQVEDVIQWLQRTENASYADYDSRLVHLKKQCQPHIMEVAKSSVEGICLKMYNTAVDHVKKIHERHWEILAEMKIYPSTTGPPPPRRNIPCYPIQMVIPAPRMSGIEHHIEGDVVSLRFKSEHLIKVNTSHFHKLEQLYRCNCRDDPKFENFLPRVWCLLRRYHTYFGLSADEGSGLQGALPVPVFECLHRVFNVTFECFASPLNCYFKQYCSAFVDTDGYFGSRGPLLDFSPTSGSFEANPPFGEELMEAMVDHFESLLSESNDPLSFIVFVPDWRDPPTEALMRLESSRFKRKQATVVAYEHEYRQGFQHIVNKADTNIRASHGTVIIFLQNEAGFNKWGPTRERLNELLLAYNPQIKDATAPSSAT >gi|Smar1000005290|ref|SMAR008745-PA pep:novel scaffold:Smar1:JH431850:1046606:1050059:-1 gene:SMAR008745 transcript:SMAR008745-RA MASTAVQSQTIPQESSSSQWEETSPATTLESNDGSRDSSPAAATPTGDFASPQHPLSLDPAHELPQELINQGWKKFWSKRENRPYFWNKMTNESLWEMPRMANSSQYDPMTDPLGIQCSPMPSESPATPVIAAKRRASDTDGSSSPSAKRFVLGGPWDLEVQTIAVMWERTPSLLPPPHPQIELLRANYVGRLRQHYQEMCHSREGIDAPKESFNRWLLERKVADTGSDPVLPSACFREISMSMYREIMNDIPIKLVRPKYSGDARKQLSKYAEAAKKMIESRNVSPRSRKIVKWNVEDAFQWLRKTQNATYDDYLERLAHLKHQCQPHLTEAAKSSVEGICLKIYHLSIDYAKKIHDKHWALLNEHGIAEIRSSLQVTNPKKVLCYPVQLALSSPRLPTVEFVQDKDMTILTFNGDSVRINSLYFQKLEQLYRWNCSDDRKFENFLSRVWCLLKRYQTYFGVSNNEGHCTQGALPVSVFQALQRNFDVSFECFASPLNCYFRQFCSAFPDTDGFFGSRGPILDFRPVSGSFEANPPFCEELMETVVEHFEKLLSHSNEPLSFIVFMAEWRDPTPVALAKLEASSFRRHHVVVSAMEHEYRHGFQHVCNRTEVNIKSAHGTVAVWLQNEAGYSRWGPTKERIEAFLDSYKLNRDREHQEIIAEVPVTDIAITTATAAITTTATVAITTTTTTTTATLAANVITTATTPCKSVTK >gi|Sarc1000005302|ref|SARC_05176T0 | SARC_05176 | Sphaeroforma arctica JP610 hypothetical protein (256 aa) MIRHQDDKPIIEGHLIEGAPPSLYCVFDFISADQETLFLSKIYDGSKKWTSLLNRRLQNWGGLPSHKGMVKEGLPQWLTEQCENLSAGSVFPSGQQPNHVLVNEYLPGQGILPHEDGPVFSPTIATVTLRSHCLLEFYPHRRQPDKDDPQAGAEAEDGTEAGKDDGNEPEFKIYLPPRSLFVVKDDCYRTYLHGISDTSADIVDEKVLNRTQSTHALAINEVLERGTRVSMTIRHVPKVMKESVQQALMARLLKR >gi|Bcir1000015312|ref|jgi|Bacci1|280182|fgenesh1_pg.351_#_5 MRKPNDEDLLRLQDRLKVYDQIRGNNINDVTRACSSDTIIMQPSQHQGSLSAYSRDFEHSSGQVKPVEKAVIRSNNSEKDVQLYTEKHQLSKYWTWSTDTPALVVDALRQVWLPKGMYLYPPWKLMPQVLKKVQEQKSILVPNDLKDKTPTITNHLENKSETVSSHLAIINNSKLKDGLDEQRANRDYSSTHLSTLRSSITSAFSIIHSQKQPITDQLLIKEFFTA >gi|Mcir1000005378|ref|jgi|Mucci2|37740|Mucci1.e_gw1.4.913.1 MSNNIDWEDLFGSDDDSDKEVEEREHAKPIITFEAIPGLKLIKQALSHQEQMSLTHALIDRNFFTGNANQAMLFGELPDFIKWVEPWISDNYPDLFGQDIMHRQPLFDQAILNMYKKGQGITSHVDLLRFEDGILIISLMSSCVMTMRPARKDATSYHAENTDDSEKHDILLCPGDVLALSQEARYDWEHGIPSRLVDEIEGRSIERGTRISVTLRKLKHGEQEMPSAATSER >gi|Fcyl1000015510|ref|jgi|Fracy1|233376|fgenesh2_pg.1_#_1574 MYYHSLDIDDNDNDETVTIKKKKKKNDDGGGGGEEEGRTAIIDDDNSNSKSKSKSDADLNNSTNDNNCYYKYSGTSRLCLPCNRSRIDENFIINLCSAADQIITQLQMAAIVALKEEEQQQQQQQHQQQQKNNNDNKKFLENIRLRIIHQKGENHNDNNNDNDCDTIIIENKKEDHVGQKRKTKKGSRKRIYNSCLVNWYKPDHTIGLHSDDEPEMDTVTYPIVSLSLGGPRRFVLKSKQSQSLSKQHQQQQQSQSNNRTIQKNHEFILKDGDLFIMGGNCQKEYKHEIPKVRKTKDHQFGGITSNRISWTLRRMKIKTSKMKATTNNSTNSNSKSNSNSNNNNNNNNNDPKQQAARSLIRNPYASQQQQQQHKRQRR >gi|Spun1000005556|ref|SPPG_05863T0 | SPPG_05863 | Spizellomyces punctatus DAOM BR117 hypothetical protein (222 aa) MLTSNPLEQYILPNAPPTVYYIPSFLTPQESTTLLEKIYKTPKPKWVSLRNRRLQNWGVLAGHCPLSDPLPPWLTAIENRIAALGIWHQHSHANNCLINEYLPGQGILPHEDGPAFHPTIATLSLSSTCLLQLYPHGQTKASHTLFLEPGSLLVLDGEVYRGFLHGIPEQRVDVLEGVVNPVRDFGKEGGRIERGTRVSVTFRVVKKVAKVSVGRVFGKRV >gi|Ccor1000005557|ref|jgi|Conco1|7959|gm1.6199_g MVELSNRQKKLLKHQSTFRDPATFNNQTAFRSAERRYKNRFDNPDYSACYDFKYLDKNLPHIREQIVELDLGCNIASLCEYFGSYKPEVTKAYVIGTIPGLIVIPNPFSNRAQRGLISACTRVYSKPPYVSNLDTHFTVPANGVWDVYERGHLMQITTDDPDYYIPLKLDPSKVNDTIKSGSGLKNIETSTATEGFKYNGLNDRNVLIKHSNLKLLPPDQIIRRLRWITFGYQYHWPSKEYICDENHKVPVEFSNLCDAVVKSVYQVCNPISGEYISNYNPCDFKSEGGVINYYQLKDSLMGHVDKSEPNTTAPLISFSLGQSAVFLIGGPTKDTEPIPLILRSGDILIMGEQCRRNYHGVPRIIENTLPEYLTSSKTNPSLEEDELNYWDVIADYLQTTRVNINVRQVF >gi|Falb1000005586|ref|H696_05801T1 | H696_05801 | Fonticula alba ATCC 38817 (V2) hypothetical protein (411 aa) MPSTPASRKRPAAGGRPLVEDTPLWGALPVEGSTHAAPRTDTAFRQAERRLKQAPRAELLDPGLARTRHGVHDPHNSDDLAAIMRRAPGCFVAAGALSIADQARLAQCAVVSWTGGRTNFSSLTLGSLKALTEVDLLAESPAPVARAAMISNAEACMERSCSRHHCAAALSQRHGALVKNPPAAKAPESDPRAAALKLVDRITTAEGAAAGCPPRRSHPWREFALAGGFPRKRGVPAPPLRKLRWATLGWHYNWTNKTYVEGDHGPMPELLASLARGAISQSPMAEEEGRKWVPEAAIVNYYSPGDSLTGHADFSEQAPEQAPLVSIRYDSMPVRPCPGQGSVLWVLTLLSLACTVSAARPSSLLEALVDRWTRCRFSCVAATLSSSVGPLDWLSMAFHASWRRRLHQHC >gi|Falb1000005587|ref|H696_05801T0 | H696_05801 | Fonticula alba ATCC 38817 (V2) hypothetical protein (452 aa) MPSTPASRKRPAAGGRPLVEDTPLWGALPVEGSTHAAPRTDTAFRQAERRLKQAPRAELLDPGLARTRHGVHDPHNSDDLAAIMRRAPGCFVAAGALSIADQARLAQCAVVSWTGGRTNFSSLTLGSLKALTEVDLLAESPAPVARAAMISNAEACMERSCSRHHCAAALSQRHGALVKNPPAAKAPESDPRAAALKLVDRITTAEGAAAGCPPRRSHPWREFALAGGFPRKRGVPAPPLRKLRWATLGWHYNWTNKTYVEGDHGPMPELLASLARGAISQSPMAEEEGRKWVPEAAIVNYYSPGDSLTGHADFSEQAPEQAPLVSISLGCSAIFLIGGTCRSVDPVPVLLRSGDVVILSGPSRLAFHGVPRILEETTPPALLTYLAARGEDLFAEVGPGGQHKCPLLPSPPAEGEDFPSATTEGQLLAEYMRQSRLNFSVRQVFAAASPK >gi|Bnat1000005628|ref|jgi|Bigna1|77192|fgenesh1_pg.46_#_69 MPRTSLQRTNTGALQALSHLPWRMYFRQPRNSQACQAEGRTSESTSTRMVAKMVLSRPRRAQRKRDTHTPPQASGPKYGGGNHHNSQHPKQRGGPLSRAVSLLQAATTPDEILTVIAQHGYDATFRTSHIVQGLLRLAKSFVPASASRARKEFVADDARLQELLQALLRRTKERDFRPVQATDTLLALALLYPKEVVNDGTGEHYNNNNNNKEEKEEEEEEGQCWRRQLQPQLLGIINRDQDLLTGFECTNLAWACRKLGIDTPEGVAARQEQLPFQHVQQAFTDIEMERLLEEIEFNVDEVGLNLGGKRIKERRMTAWQSDSNKPFEYAIYSGKIMEPMEMTPTIERLRDEIYRRTSVRYDCALINLYPDSRSGMRFHADPDQNTLWSTNSVVVSAGDTRLFIMRQMNDHSKRHQFYVSAGDIVIMYADCQERYQHSIRTEQSSNDDYSSTVAATQRPQQRQQQQESRRDGDFPMGPRVSIVFKQSLEEWERTNRVNHTATIHSFA >gi|Vcar1000005651|ref|jgi|Volca1|100148|fgenesh4_pg.C_scaffold_107000001 MSLSNEFPFPPSFPAATDPAPAPVAPPTTSEAALPWDRISEVATHAAVRALENRSSNNQRGPAPRFNPTAKDADLASWANRLRLHFTVCGLHEDSAPAVAIALSAIEGPTLDTLLLLHQRTPFTSLTAVLQALTPLAPRPDRLLQAQISMDTIRQGSGPNSLVSYVAAFGRLMGDLPHRHELDHVFHFAKGLNRDLREEVYARLPQYGTDVTLQIIIDLAMGIFSGRHNAHLIDYNTAPRSSRDVRRDDPMDLTANVYSTTPHHSNSPLPSGVKTIGVPYNLRLQRMAQHAPVKSALRPSSAETVSHSLPTSSAVHSSETACNVQRLAQIVRKPARPVGRKRARAAAASTSKGACQASNDWKIARSLFLEYDSQYGPFTVDAYCDDLGLTAQLSPFFSPSRPFLSTDIEGECVWMVPPVDNASTIVARYLDAKTASPNTSAIIVLPDRPQAPWAPLIRHMTIVRRFPAGAQIVCRPTTSDPS >gi|Bcir1000005678|ref|jgi|Bacci1|183776|e_gw1.67.64.1 MSPHFVSKRQQKIWEKQQRENAEKRSSYANQSAFRYAERVFKSNIPTSEFEEVVDFSNIDNNIQETRDNLVKVKLSNDLRQLTTAFGVPSESPQVDAIVMKNVPGLIVIPNPFTPEAQRTLVKHCLTDCAKPPHTSNLDGHYHTPLQGIWPLYKREQEGILKPGDPDYYVPIKTTEYQDQGIYTESVDDDDAVSTVNSMYSSIGKSAHIQSPTQLLKKQRWITLGYQYHWGSKKYNLDNPIPIPELISDLMKAVAIATDDIGCDEAIVKWKNEYNGKEYKPEAGIINYYQLQSTLMGHVDQSELNMDAPLISLSLGHSCIYLIGGFTKDAKPIPLRLNSGDIIVMTKICRKAYHGVPKIIKGSLPEYMVHCDDPEWPLYAKYMSTTRINLNIRQVFPNEP >gi|Fcyl1000025701|ref|jgi|Fracy1|243567|fgenesh2_pg.11_#_461 ; gi|Fcyl1000116478|ref|jgi|Fracy1|190950|e_gw1.11.582.1 ; gi|Fcyl1000084399|ref|jgi|Fracy1|158488|gw1.11.582.1 MKGSKRKREGNPTREEQEPSSTSDPTVNQLYLTPEDGVYFQKHLQDHYKGFVHLPVDRLEPRDFHEEFKNSLERLRDAGYYQYDVVMAGGKYSSRTFVKRTLVGNPGITYKYLGLRLFAHAWSGPGCTPLMKSIGDMNQSMIKMTEQFPENERCDYNLTLINYMEPTTHTKVGFKDEANYGMGKVSVSWHADSSLVDNSSIGVYHCLPTQRNSKWDWKIALRPLAIESKDNTGSTKRDNKSTTPKPVVVNTKDGDAYFLLGNFNMKHQHCVLAGSQANRISSTHRVAVTIEDTYDYILKRVKIARKRFRLQMETIPPSLQSPVVGLDAKVIRYCQRILTEVEMEWIAQYWLQGDQHDKMRVWWQKPMKTLEAYWCALEVYTYRLFEFLLSRIASNDTVPIDVVKVLIIEFKTRQSFREQWDERRSDKIYQRRVSEEFRPVARPIFENNDEVKIDERRLPKDLTSAIQALSIFLEKDSKGTKSKKVEQVIDDHPLPPPSKSCQSKISSIVKEDIDTVTTKMSKQQPKEEHSSNAKKRKKKKRNKK >gi|Chet1000005718|ref|jgi|CocheC5_1|33815|estExt_fgenesh1_pg.C_410002 MKRGAIDSFFTRPSPKKPKYQASKAKSSHASYPFAIPHLPEDFAQQLGFAPADEGKVINDQLDLDLVYYQPYIPSSIAGGVFEFLRQELPFYRIIYNITRGGVQTQINTPRFTTVFGVDDTCRFTPDGKIIDAKTSKPVEKSRYKCAPRPIPQCLDELRKVTEGTTGETFNFCLVNYYAHGKDSISYHSDDERFLGPNPAIASFSLGAKRDFLMKHKPIPPKDGEKIEEPKGLKLPLGSGDMILMRGTTQANWLHSIPKRAGPEAGKGRINITFRKAMVKGGTENYYQYNVGSGGVYRWDAKAEKMIQQEIEKGND >gi|Sarc1000005744|ref|SARC_05614T0 | SARC_05614 | Sphaeroforma arctica JP610 hypothetical protein (206 aa) MGRGNKRIGVFPWELISDVLDKVIEEATRITICVPYYPNTTWFPKFLSLLEQDPMIVENTNNTFLNEGTTVCGKTPLVGTLVAKIGTKSPCFDQKLAHPAIKIVSVQTDDESAELDTVTFEDRVTLISQYHSVGHYTVEETVNKLQLNGHTWSRQKEHVNQYIEGCVPCLKYKSTKRASTFTTNNSRISGRQNTSEHHRSLDQLE >gi|Chet1000005801|ref|jgi|CocheC5_1|108207|estExt_Genewise1Plus.C_130166 MASKMADLMDLVSQNQPKFQTHQALLAIGLQNDFVLPDGRLPVNTSTGFLDRIQTLVPKFRELSGNVIWVQTLYETDRIATGADTGEGDALVVGGLIDARRKTLPKEIAKASVEEDDELFLLKSEKRTPACVPKTPGAEFIDFVTQQMELPADAVIRTTNYSAFQGTNLLITLRARLVTELFICGCITNVSVLATVIDAARHGIKICVIEDCLGFRKQTRHEMALKRMDDFFDAYLVNSEEILEKDPAELPQKPELSSNGANSDAKQNEKMVEDLLSGFSDRVRMSLSRGPKSEPGPEAKRGSAPSSKAASTISKQDDKEIQQSLAESGKSASMSQSESPAVEPIQNKIAPVTLATTPVEETAKVPTKSKSPKLKSLANLPVLGPGDEIAEGDSRIIHDFFPSDLRHPSDPSQPLKDIIFGQLYNEVRWQKMLHQQGEVPRLVCCQGAFGDDGSMPVYRHPADQTLPLLRFSPKVQIIRRQAEKLVEHPLNHVLIQLYRSGNDFISEHSDKTLDIVKGSSIVNVSFGSQRTMRIRRKKPQSKKDETLVEDSAVAQRETQRVPLPHNSMFVLGLESNKKWLHAIQPDKRLASERSEAETSHNGIRISLTFRNIGTFLDAKESTIWGQGATAREQRDAADVINGDEEEAKRVITAFSRENHDPDFDWDEWYGDGFDTLHLQTPPKDTPLLFASNNPIETRQAQIALAECKIHYSLLEAPATEETYEQDRQVTFRDADTHHTEIHTPFSILLYLDRYHPIDISPSSHPVIASAYPVMLITADVTKTWLSRANSPAEMVSEFDATLSSLLQRLEDGFEMHAGPYIAGVRFSVADCLAWPVIDALVDEWNGWSAEKFGSLDAWYRACWKKKACVKKVKEKLSA >gi|Rall1000005813|ref|jgi|Rozal1|5814|O9G_000964m.01 MRLPVLAPGEIPAPEKYIPRKGVESRVPMNHTAFRIAERRYKGRKTPIDFSQVIDPRKGHELLEKMEYKVSKEYKWKDFGNRAFGAWSVKGMPGFIVIPEALNEQEQKELAKKCYTSYIEPPNLNSLDRIFSIPSKGLWKSMVENTPIYEYEKSENESSGYDTDVPYATQYKNDKRKKKETPVVDVENAIRRLRWIILGYPYNWYTKEYDFNSTFKAVPLELNLICKTLSEMLGFGPYEAEAGIVNFYQEKDTLMGHVDRSEKNMNAPLFSISLGQSAIFLIGGKTREDPVIPILLRSGDIAILSGESRWFYHGVPRIIENSLPEYFMDCCDCGNEENHSFDCWKLVSHYLKGTRLNINIRQVN >gi|Vcar1000005961|ref|jgi|Volca1|121547|estExt_fgenesh5_synt.C_620028 MGNSICYGGQRGISESDENVAVGATAVPDREANNTATRQAAELENGGDNRSIGARSSSAEVTPSEEAGVRNMQLALLTQTNIEPHPDQPDRSQVPIPRSSVDQPESSGSQRQRKLMITTINMFPGPTLAQQNFILQTPEEMIEDVQVLSAIGAGGYAVVYRGVYQGGDVAIKVATINPDHAGWTEGFVACQLRHPHPQPGQLPAGRKVQQMDELPVGPSEDGLGDPRKTSHDRQGWKDVLARVGAAPNKGLLMLIQEYCDRGNLGKVIRSGIFKTAIGRTSSTTSAEKGQLLARRMLLRTATEICRGMIHLHNASVVHGDLKPANVLIQSSNKDRRGFTVKIADFGLARLLQKDTSSVESEASAGEAGAAAPPLAPVVPVPVAPVGQPVIPIAGAPAAGAMPDPVQPVAVPAAVAVAQLPDAQMVRNIPAPKLPVATPVEPDRIRAFVADVRDYFVLVGWQANLPAQKLFISGALEGFFKEWHVTWTKSVPDYTPDQLLDAFLVRFAPEMYSRTHVARTTFYSAAFKQELNESVLTFAGRFEALLQDIPDMQEPTKLWHFREKLLPYLREELHLPLLLLWRHPLQYTLDLSRRRPSLPLFSRPTWVRAVVLEAVAGVVDVVVAQQAAWPDAMVQEGVADREAGVMLTRVEAVVPEEEPPQVAVQRLPVAGIAPPRFVRRPDGNCLWSKYWQGLGRSVHESEVASIATIMDREFTLDACASDCGLSAVCNAFSCTARPFLDTNVAGHTVWMAPNAADLPAYVTHYRACKPLAPQSTAACILVPSGTEPSLLKGMKLVRRYPVGTSLFYVPDVQGSRALLPPITEVMEVWYDGPDSTEEIPACTAVGSAVPHLAVKISGSTFMAMLDSGATHSFVSEALVRLLHLNVLPSTFTYVRLADGGMSPIVGQTMLKYTLDLSRRRPSLPLFSRPTWVRAVVLEAVAGVVDVVVAQQAAWPDAMVQEGVADREAGVMLTRVEAVVPEEEPPQVAVQRLPVAVYAFGVVLWEMLCGTQPYESMPVGQVMLGVSFHNLRPPWPESHWPGLCALGRSCLAQLPEERPSFRELEKQLVALEEEVRVESLRHTQNIKRDNANQPNHPTASSSSSSRSTFTTPTPATSCPNTYNITSAATTSTTAAAGAATATASTEAVGPAANAPTAAAGTSTSPFAAAPAPTGRVLAMAAGEVTAGGGGGGGSTPGYVKHRSSLELRRAAAAGAAAASAAVAAVSSAAAAAAGNADWTPSSQPDSTAAVETAVVVKRFEDIEDKGEREDKAAASLVVHVDTPAQEASVAAAAAAAATAGSSAAGSEPQSPWLTTTQMSELMRMEEGRESEAAAETA >gi|Pram1000005973|ref|77822 MHIKQVVVCGFRSYKDQVAVEPFSKQHNVVIGRNGTGKSNFFDAIRFGLLTSRFANLRPEERQALLHEGSGKHVMSAYVEIIFDNSDGRLPVDDVEVALRRTIGVKKDEFFLNRKHIPKSDVQGKVNALAVMRERERLELLKEVAGTKVYEDQRTKALKILHETQAQRDKIQEVVSYIEERLSELEEEKEELQEYQQLDREQRALEYTMYEKELQKVRAEIEALDRHRQEEGALATDLHEKLMHVRAEINRIESAHRSRDQDLAQLVEDRKSREDERNGLMEARYKLEMEMKELKEQIRSDGVQRSAVSKEVEVVKREIAAKRARLTNEILPALRQAEQTHDQVARNLQECRAQSEHLIAKQSRKSQFLTQQQRDDYLQREISDIESLVRRKESDTASLRHSTEGLARSIEGSDRTLQEQIEELREHRRRVDAVGAEMLRLKEQRNYLNEERKGKWREENQISYDVRELTKKLNDGENALQSTMAYDVRRGLQAVREMQGRIRGIYGPLIDLVRPVDERYCIAADEAAGGSLFHVVVDTDDTAAKIMRELDKKNLGRLTFLPLNRLKVKEHFDYPHNDDVVALVEKLEYPAEVRKGVMTAFGKKLLCRDLDACVRYAEQTNMDCLTLDGDMVHRRGALNGGFKDLRRSRTRAMMEVKQAQLDLESVTERARRVNTEAQQADQRVTGVISEIQKLEAEKNRSISAHKNLCDEISRRKNHIHSEKENLAQRERSCELQEREVKDLAAKITSLRSELLTPMQDTLTAVEQELLHSLSAKISLFEAEERDQRQRLEEIRSREEGIKTVLEENLVRRENELARQLGEGIEELAISEREEALKAKQIDLEDASRLVDDNNSSLKDIERKIATLQQEITNENVQVDALNGEDVSLSDQIEQEARRAEKVLNKRRRLLKKREDSTKDIRELGTLPISELEKFKVLSYQEVIKQFKRRNEKLKKYSHVNKKALDQFMSFNEQRETLLERKREIDDAYSSIEDLIDVLDKRKDEAIFRTFKGVAGFFSEVFRELVPTGEGKMILVGADTDQSNNGTDGGDEESNVDTYSGVQIKVNFRGEGDSYLMQQLSGGQKALVALAFIFAIQRVDPAPFYLFDEIDQALDSTHRAAVAALIHRQAHSKDSPAQFITSTFRPELVNIADKFYGIGYQNKISNVYSMAKEESLDFISNIMAEEEGVADWSTMADPLEAILASMAAATGNASPPTKSATKTKDAAAASTGKKRGREVSSTPPSGIWNPQEEMPLDSAALDTLQDAAKTWSVSPAMEITRQQTVRVLCNKIRRASEDLGIGKLPNSAYETWQLTSQLTVKEQDPLIPHASSDYSGLFEELRKAGATKSGATRKCKELTKEAERMLRKFGQQDFVAGKKKKVHVADAGDDVRQLTYGNSTVKLSASHFAKLREMYARKLGLSGNGSSMAPKDQRQFESALFCLLLRYDSLDGGGFQVREAVAALNEECFDVLLKEFDCKMECFASPLNCRYSRFCSAFLDTDFAFGSVGSFFDFSPRSGSFEANPPFIPKVIKRMADHMTELLNAADGPLAFIVIIPAWHETEVLIIALGWQQLNSSRFNQRHLLIPQKQHGYCEGKQQIRKTRWRIASFDTSVFFWQNSKACSKWPVSEKKLESLQRAFKSKQADERDALGLRKSGKRARVAKD >gi|Lgig1000006095|ref|jgi|Lotgi1|133518|e_gw1.85.31.1 MNTKQKRSRVQGGWAAPKKQTARSDKDRVKPPNVPVWTSKNIDQVSATPQFLYQQPDEEIEYQPAPRQNCKGGVYDISDGPSGISRLRFFPNFINQSDANKYYDMLYQGLPWKQNSYVKNGVTHLNDRLTAWVGDLPYSYSGIVHPADPSWIPPLPTLKDKIEDLTGYKFNSLLGNLYRDDKDGVEWHCDDEPELGPQPIIASVSLGDVRNFELRQKPVPNTGDYTFSQIVRMPLTHGSLLIMEGGTQDDWQHRIPKEYHDRGPRINLTFRVIYPKP >gi|Fcyl1000016119|ref|jgi|Fracy1|233985|fgenesh2_pg.2_#_93 MPLWAAETRISIRSLASSGSIILPDPEQDAARMAALKRLRHKFVRLCSENNRSKPPVLAFERWLGRASLKRGISTSDGYDPIIPADSVMDKGFAKDISRTLPSWAAANAVAEEMTKEATKQIRGMATQREEIDEHKDLRVLRKKIREEAALNESTTKAQQALGGANNSAVGGGGKVVLNGNRRDGIYDVMLCGPCGKPRRPYLTISSLHLSKLLRLWKLKNKEGNDDDDDGNQIEVIEVENPIDNMNALLEDDRIMFTKSLYCCLARYEGLKGAGYQCAVPGVAFDAAIACGLGSTIECFARRYKQISRYQQLVLSFQVPNSEGCRHTNNVQALAVLWYVNNREKSIISKTSI >gi|Vcar1000006131|ref|jgi|Volca1|62740|e_gw1.31.78.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAVPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDGNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVRRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Pram1000006143|ref|77618 MAKQPKRSASPQLSGGSPTKRAKLSTLPSEEEFSLTSVMANVKKAAVEHAHLSEAVVSKLFDKDASVSYLTTDHRSWIYHVPRWYWHVFEHAKPEELQATATCSPVTWSTMFKQAWEAHPKEHDTIMMFGKPTKIPRFQLLCGEMPSYRYSGKTFEAQKRFPQGLEHAVQHMRRMVEDSVTQQTRLSGGLVNWYENGNHYIGPHADDERDMMACSPIVALSLGATRRFVLTKKTSKSAPQGDGAVTRLELQMEDGDLTIMGGTTQRTHKHAVPKMARCREPRISITLRCFH >gi|Mver1000006163|ref|MVEG_06150T0 | MVEG_06150 | Mortierella verticillata NRRL 6337 hypothetical protein (222 aa) MATIPGLEVILDFITEEEEQQLITELDAGHWAGRGIEPNPEMKRRHQHYGGVFSYRLRRVVGDMEKLPGMFDFITERLLQRRIYDRSPNSIIVNEYEAGQGIMPHVDAPKLFGKTITALSLLSACVMTFQHVKDPSQIYHIHLPQRSLVVMNGSSRYDFKHSISKDLIEHVDGLEIVRARRVSITYRDMLVEDRQQDRESDEAGSSCKELCGNGISSCTRS >gi|Spun1000006275|ref|SPPG_06620T0 | SPPG_06620 | Spizellomyces punctatus DAOM BR117 hypothetical protein (232 aa) MPKRLSATTNDTNATTKRPRTIPPSITPASMQYTLSHATGHLIHYPSFFSPTASPTLFTSLRTVLPWKTIPITIFGRQIDQPRQVCFIADNNTYYAYSGSGKMGTMLPDWPDALKEIKEKVEEVVGEKFNSVLCNLYRNGSDYIGFHSDNEKSLGPAPTIASVSFGASRKFVVKPNPGGPAREAGKLELVLADGSLVIMKGMMQKYWKHAVPKTSRKVGERINLTFRKVVS >gi|Fcyl1000106316|ref|jgi|Fracy1|180788|e_gw1.2.998.1 ; gi|Fcyl1000078436|ref|jgi|Fracy1|152064|gw1.2.998.1 MATNLKRNLNLQFLTSISKKGVQVCSISSTTSTTISSNHARIPDPSFVNTLNAPFDFDASCAVVYNEYITSTEEDLVANDIKNKMKRRRYEKGHWDAVINLYKEVEIADDSFQEEGDEKNNYSEGIPKLFNRIRQQLAEHHLTDYYDDQESIHWLPCHAIDLKKDGELNAHVDSVRFSGDLVAGLSLLSPSIMRLIPCDDNDDDDNKNSENSTKDEEPYYVDMFLPPRSLYVLTGVGRYKYSHQLLPDGSIFHKTDTDIVVRRDHRLSVIFRDSKQPSS >gi|Fcyl1000016334|ref|jgi|Fracy1|234200|fgenesh2_pg.2_#_308 MKYLLPRPMATNLKRNLNLQFLTSISKKGVQVCSISSTTSTTISSNHARIPDPSFVNTLNAPFDFDASCAVVYNEYITSTEEDLVANDIKNKMKRVWQFLFVDYCQLTSNGICSFCDSAALKMNDESMKKKSRRYEKGHWDAVINLYKEVEIADDSFQEEGDEKNNYSEGIPKLFNRIRQQLAEHHLTDYYDDQESIHWLPCHAIDLKKDGELNAHVDSVRFSGDLVAGLSLLSPSIMRLIPCDDNDDDDNKNSENSTKDEEPYYVDMFLPPRSLYVLTGVGRYKYSHQLLPDGSIFHKTDTDIVGCEMVLQVLTSNCTYFHCDSNYLPLGITLLHGVVLAVLVYYMATTPYS >gi|Sarc1000006366|ref|SARC_06218T0 | SARC_06218 | Sphaeroforma arctica JP610 hypothetical protein (1129 aa) MANKKLASQSVQAGDESHSHSKPKWKHAKDERLVKRKTIEYNKATDVAHDGLETGPVKKRRKKETSRPFSVETNLRHQQSSESAAKPSKIDKQSEKVKKAKKDKKNKKTKKFKEKDKKNKKSKKSKESDDDGGEGGMHSDKATDGSVTEKKQKERKAKKRHRKHETVNTHTEQPQSAESDTSKDHEVVDETAEATFTSEAPPLGCVWNPNTKLSLETWPQSSHQKNVSTTSRVDPNLEIYRQKAVNRLQYQLETDCRNNRIRFNNVFFENWIFNCERASKSNQCASEGIAPDPVIPSQPTADGLVADLVGAGMQKTVAQGVARKLCAAAARETERMARTTAETSARARSVDCVEVVITGKKAREALLERATTDLRAFDLTYNQMTVRINKAHHDKLRLLHSRNAPQSERGDDSALNSRVFSLLVRYHTLQGGHVQGGGMQAALIEDTFDALLRNFGVNFECFASPLNSRYGQYCSMFADTDGPFGSVGSFFDFYPLSGSFEANPPFEDGVIHRMAMHIDVLLDRSDRENKPLSFVVVIPAWAESSGWQRLNQSTHLKRLLTLSQRDHGFCEGTQHSRPTRYRISTYVVPIVVGLAMTGDFFAYWVISAVMVGMVLSLVRLFMEFLNFVILAIRHVRENWRRLLVVRFKPVPLSEVVKSYAFFAYEHKLLIEKRVTSWIFTYRLQKAFRYFGGSYWALSILIVIQFVFVGLFEDSQGWIFLCTINYLVIYVLIWRKAAPWSEVAALAETEEDSQSTGDANNKTTEVPQTSAENLSAKPSRLARMASRVAGQDRYSTDLEGCEYIYRFIPDYHVRNRKTWLSLLLPILFISQSIWLCVASVEHQGGVAFIVAYLTVFYGLFWGLFYCFIMALWLKSRKVVRIVERREIAAEYVRIIKAMNEYLIYILMFVSFIGVVCGIVYLTILAEGTFLDGFLLFLAVFLLFSVISTITTKAFRRAYPMEAFWFLFILFVTVGVVLSVFSYSLQQNDDNNDTSEPISLVSAPIAPSPTSFASCDVTFGPTAMSIMDMAYMSKVAYAPMEVVQRELNTWFNANQTDGTGWYIHYNATTDGYPYFYDIRHNTTGMAIISVRGTNSLYDVVMDMDLWLEIAVLQMTDFFLPTLSLWPVGMSL >gi|Fcyl1000026428|ref|jgi|Fracy1|244294|fgenesh2_pg.13_#_88 MSSSSSTESPPLVLTPPDISRSVIGRNINSRWISFIQNDELSCYSFYLPHYDDDDTSSTVVNTVGNNSNKHNDDESTTRTNPDNPNPDNPNPNPNPDPNPKTTTTARTSSIFTSKQLDEWFYKLHPSRYNNIDNDDDVDNASSASSASASAWTSASYKNQLLLRKTAWYTFNKECVCEYGYSDTWQKQIQSKEMINVLQEITEVVSSNLGQEEEELNCVNLNYYPSAGGIGFHADDEYMFDGLNRNTKIISLSLCCTTTTTSTCLHKPNNNYKNKNWGARKFQIKKRDNNKSESDDDDDNDDKNVVHEIILRHGDIVTMEGMFQKYYLHSIWPGDDDDNSIDYNDDDLCQGERINLTWRTIVKHLGSGGSSSIEEVEDFHGITCPLSLSLSSSSS >gi|Pram1000006453|ref|77246 MSDVGESAANAQPSLTDFVADEQPSKYLKLTMSRRIKDKAIAEPKLLPFLLQLLQIDPVPVVSSLDADSIVKRVLYGTGSRRVVLLELSDVSVAQKLREQLHEKPCELLGRRPMYAEFALPRKEQEARDRLLRAHNVQRNADPPGLRVPGLRFEAEFITKEQEAACVAFFERENGAHWANTIRARQVQHFGYEFNYDTRRCDPDEPMKEPIPEVLQPIMEKIAQSGIMDGDTPDQITINEYLPGQGIAFHLDTHSAFTTTIASLSICSEVVMDFRHPDGVRYEGVLLPARSLAVMSGASRYKWEHAIVPRTFDVIDGKQVNRQRRVSITFRKVRSGPCECPFPKYCDTPEREGQENTGDDDQEAITTAEASSLAPTALEQQFVHEFYETVAAHFSSTRHSPWPRVAEFVGSLPSGSMIADLGCGNGKYMKCVDAAQSFVVGGDRSSRLVTICRDRGLEAVVCDALAVPLRSNSCDAALSIAVLHHLSTLGHRLAAVKELLRVLRVGGRGIIYAWAHEQMKGSRRRFEEGRQDFMVPWNLDKRFVISNEDGSTTTETAEASQDVEPIEEGSQQDPSEDDGANRDDNTCDKSTAKVHERVLVQRYCHMFKQGELESLVGLAGNAEVEKSYYDESNWAVVLRRVS >gi|Psoj1000016465|ref|144652 MLGSRYLRRSLAPLRRLMSSSAAAASPASSLWQDVYNLDATHCHDPLVSEGDLQVLLDVITEDEEMVVADECARILKRRRYEEGHWDNVIIKFKEMERSRWSAETQRVLQKVREAAILPKELNYFPAVHVIELAEDGYIKPHVDSIKFSGRVVAGINLLSPSIMRFKEEHGDSVIDAYLQRRSMYMMTGRVRYHYTHEILPGAQVFRGELPVNRTHRISIMLRDEFLEEHVAKYHTPFAKPDVETQ >gi|Psoj1000016470|ref|144657 MSDSSSDEEDFFGRMESDDLFEESEVQQEKRREAQRYVEQYAERDWGLAARQCRAQGTNKDLVTESTLELRADKKVVFQEKQGQQAKVWDCALVLSKFLTNDAYFAPDFFVNKHVIELGCGIGVPGLAAAALGAKEVMLTDMDMAIPWIQVNIERNQTLGCISGDVRAEALMWGENAPLESHQFDVILCSDLVYGERKISEKLVQTIAKLSHPDTLVISAHEARFAGDRGGSFFELLSEQNFEVEQLAEDGYIKPHVDSIKFSGRVVAGINLLSPSIMRFKEEHGDSVIDAYLQRRSMYMMTGRVRYHYTHEILPGAQVFRGELPVNRTHRISIMLRDEFLEEHVAKYHTPFAKPDVETQ >gi|Spun1000006493|ref|SPPG_06848T0 | SPPG_06848 | Spizellomyces punctatus DAOM BR117 hypothetical protein (383 aa) MAADASGHSTGSLSNQKRILREAHKLARKRATQLDIFAKSENLPLSELVVDVENGQDPTRFIVILNAGSTGDVGGVSVDDLERAFGAFEGFLGVEMMLGKPYSYAVYNTPTAAFQAYTVLNNSPIPIPHSNSKPLLLTFTTRVSLDIALSDPKDVMEAVPGLFLLHDFVDCEEEQNLLACVKDNANSWVSLNKRRVQHYGYRFDYPLNTVDFDTSIIDPIPPWGQEILSRYCRTFPLHSTPDQLTVNEYWPSAGIAPHADRHSIFGDVVIALSLGSGVVMEFRRPEETPSTTVSNPCKPFSYHCINIHLPPRSLLVMSGNARYQWEHSIRPRRTDIVHGRAIERGTRVSLTFRNVKRVKGCECGWTHACDVGRDDTGRLDIG >gi|Fcyl1000046637|ref|jgi|Fracy1|264503|estExt_fgenesh2_pg.C_270122 ; gi|Fcyl1000031649|ref|jgi|Fracy1|249515|fgenesh2_pg.27_#_122 ; gi|Fcyl1000011937|ref|jgi|Fracy1|271662|estExt_fgenesh2_kg.C_270020 ; gi|Fcyl1000005027|ref|jgi|Fracy1|220849|fgenesh2_kg.27_#_20_#_0_0_CCUX4586.b1_CCUX_EXTA MTARNSDDAIASLMQEMGQQRGYDQYDQYDTTTATTATTTVVQQQATKLNNANGNTTATTTTDDDGDRQLLLRQQQQQQQQLLIPELVDYPNNQFMAGSQSIWNPLVMSSDGSGSSDNGSGDRYEYENYGLILPTNPQIELIRRRVYEKFRNEVTILLQDIEQKLLPSSLIGGGGGGGSFSKNKNKLPIPSMLDKWHMDSKLVEWNILKQQQGKHDDISGSISTINRMTTTSTVDILRELTGRKQQDGMTPVYDPILLNKQSPTIFVDSILKPNVEKLWEQYHYHNNGGSNNNNKGNNNNLNLPPKFNKKSKQIHKGLYRLACEANDSFELQLHQVVRQESSSSSSSTTTKNKKNNNRLPKISLSSSSDEGSSGGGSDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRQQQQQHQQRSQQQYQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCKKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFEVEVEVTPPSVAVTAVAVAAAATKDEKEEGICYQANPPFCEGLILQLNNKITDILLSSQQHQHQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLWNLQKTDQKQKVSSSSATCSTDDNNKSNKNNGDANDDILPLGNAILDELKNSFCQDPGSMQKEEQQQQQQQQETSRINKPRRSNLVTTNTKSTTSTNPSNVVTSSTHDNKPSENTTSLSSFSVVQRKRNKKATKLSPAKKERKRKWNQQEEGKAQLNLLESLGLSSSAATIATTTKSSQAGSRTVSAEDNNNSSKKLNKLRPGGKTNNKEKSLPRKMKKKRPHR >gi|Bcir1000016642|ref|jgi|Bacci1|264733|estExt_Genewise1Plus.C_7650003 MDWKDLFGDEEEEEDNIEIKGLTHIKQALDHDQQMLLVNQLIEHGYFSNTNQAMCFGELPSYINWLIPWISTNHPTLFPREIMERQPLFDQAILNLYKKGQGIVSHVDLLRFEDGIIIISLLSSCVMTMRPANSNKKGYNQHQEQQDTLRRDILLRPGDILALSGEARYDWEHGIEEKLIDEIDGQIIERGTRISVTLRRLNE >gi|Smar1000006659|ref|SMAR007432-PA pep:novel scaffold:Smar1:JH431789:114970:119470:-1 gene:SMAR007432 transcript:SMAR007432-RA MLIFQDFISKEEENTLLMELEPYLKRMRYEFDHWDNAIHGYRETEKTNWNEENKNIIRRIKKLAFPEDTTTLQHIHVLDITKDGYIKPHIDSVRFCGNIITGISLLSTSVMRLIHEKNPEFKVDVLLPQRSLYIMKNLVRFEYTHQVLADKDSFHRGNHIPRERRISIMCRNEPSIS >gi|Vcar1000006689|ref|jgi|Volca1|45981|gw1.75.44.1 MFLRDEFRRVENELGRQFTFDAACNDSGDNSLCSRFASPSNSFFDTDVSGEFVWINPPYTHIKEWQQHYHRCKLKNPKTTSAVFAIPKWTQVECLMRSAGYQLLKTYAVGTQLFSQPVDVGTRSDLPGIPWPLQLWYDPPSKSVQLNSLDSNGRTMLFDARLCQTACKVLADSGANCGRVADGFISLEAVCRMALSVRPCTISTVRVANGTHELINGSITAKLRIGSFHETVTLLVLKQGVPGVEIILA >gi|Pram1000006748|ref|76882 MTAAYKASEKRWKRATDEDLQHDATLLDPRSLSEEQQAKVRCVGSWKWGLEDEERPVLAFDGFGASHRGFCVIPNALDGKMQLQFAHACLTEFAEEPHVTNMHLQKQQVSEIWRKASEAHPQNPAESPLLTKLCWAASGYHYDWTARRYYKDSFSPVPELLQQLGARCASACGMAMAAEAVIVNYYKQKSSMGGHLDDVEYTMDHPVVSLSLGCRCVFLMGGHTKDEPPLEVLLRSGDIAIMGGASRTCYHGVARVLPTPFSEEFDTLPQSDDDEREEYEAVRTYLSTQRININVRQVYPSEPASTD >gi|Smar1000006760|ref|SMAR013901-PA pep:novel scaffold:Smar1:JH431789:635068:636630:1 gene:SMAR013901 transcript:SMAR013901-RA MAAGVVDLRHKLRSNQRQTATALRRYEKDETNRYHRAYKDKAPESKRNESETDEYSEYDSRLKTLRKLHSGIQQRRLFTDAECQVIEQQIDEVVENGEKGFYRPQTVDRAPLRNKYFFGEGYTYGSQLSKRGPGMERLYPPGHVDVIPEWIERLVVKPIVKAKIVPEGFINSAVINVYQPGGCIVSHIDPIHIFDRPIVSVSFKSDSALCFGCRFSFKPIRCTKPVLSLPIPRGCVTVLSGFAADDITHCVRPQDTVKKRAVIILRRVRSDAPRLDPSEMSPLDTEKRSEASRKRRLKSAIVDCNVKDENDPVKDVAKIPEERKSNKKIKIKR >gi|Vcar1000006797|ref|jgi|Volca1|81146|estExt_Genewise1Plus.C_190101 MFLPDEFRNVENMLGRQFTFDAACNNSGDNSLCTRFASPSNSFLTSDVSGEFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARKMGLAIRPCSTSSVCVANGSSELIEGLIHAKLRIASFHDTVKLLVLKQANAGVELILGAD >gi|Ccor1000006818|ref|jgi|Conco1|72317|fgenesh1_pg.165_#_2 MELNKEKWKIGKPLVWANTIQELCEGLDYFKSYQSSLYTNNSELKGLLLCDSIMPNDYFAWDVFITHGGGGMKASKEPGASSGTFVYRLSRSQEEDDTLVIPYLKLYKSQGRIPVILNSTTYDTGSVVDIPKYSVLGYYFITHTWIELEEGLSERCSRYKFRFERQESSYPRWWSVDKLIAEPDEIIKNEKCLVCNHYSPILYEIGFICFNKSCPRFYLHHGQVLPDYLAIKLGVLKIQFNNSESLNLPSLIPKNRLADQTNTLTDSIECKAYYCQDCYKLSSREYWHKWICTNCNNTIDIINPTISDIYSIKNDSIEFIMESFEMGMLFWNSDYFSIKRYYKGEQCRAVYKYDKNNSIEHITCHSYNKAFMDKLFLFCQNSQSGLDFKRYEVRKSRIRTRLLGNQFQYSCGVLYQHLLRTETIPLEEAPPIIRAILNYMQFYEPTADYNQITSLLYCKGQKMGWHNDSEEELGDKIISLSLGNPAKMNFKFKENDEKALSLYLLHGDILIMNGPDIQKKFLHCIQPDGFRIAFAARYLTRPRVV >gi|Smar1000006848|ref|SMAR015629-PA pep:novel scaffold:Smar1:JH431783:5867:6133:1 gene:SMAR015629 transcript:SMAR015629-RA MAALSIYFTLLAESYGKRLASTKFSYPLCMCQEKKTPSLIFFLETLFQHCSVFNKLTEEFGTPSIDLFASRLNFKLKIFYSWAPDPEA >gi|Mver1000006934|ref|MVEG_06923T0 | MVEG_06923 | Mortierella verticillata NRRL 6337 hypothetical protein (281 aa) MTAQLEWSINNSFFQHLDSTWGPHSVDLFAMAENAKVPRYVSWLHEETAWKQDAFSCSWKDLGRAYICPPWSLLNRVLEKIRIDRVQATVITPQWPAMIWYPTIRAMSTNEPIPQRSSDRPTTNLALTKTPAFYLFRSPDPVSILSDSSDSSEQSGPSSPLTNPENPSPIMATPLIPETFLKGLRPDTFNGRYRDTRAEDWLVRFERYCNAAHIPETGQDRILCAGLLLTDGASCWYDQLGTIAATTVNGQDLSAYQVFKYKFRQCFVNANDAEDAFDLI >gi|Bcir1000016958|ref|jgi|Bacci1|300658|MIX930_10_37 MPSSPIPDSSTLQGQPKRQRVAISEEQVKQIVDILGSMLIKIKNEIMDIKEKYAPTAAVNFLLRRMESSDAILTNDVYCLMTGFMTIYLVFIELLFKDLSNNSHNTKINPLEDFMQLFSENDTLFGSDNPFLWYYLYARTNTSILEDLDKQLTIHFTSLNHIDSIFSRFYTVYFLGEESNTQKKHQKDHGQYYTPHSVIRFMWDRCLLPSSSSLLRNGNIPRVFDPCLGIGSFLCEFLSRLVNACRFNLDIWNDPRRLYSILTTEIPESVYGIEIDPFAFQLCKINMLIHLFPFYQRLLELNVQLHQTSRIIQRIRLFCNDTLKLDKX >gi|Vcar1000006954|ref|jgi|Volca1|88423|fgenesh4_pg.C_scaffold_9000001 MVILSRVPIVDPVPTTSDTAQYVPSLQEAITSAVRAAMAELPPAPWSTAQQPHTGPYAPLPTITQPRPPYSNLLHVSRDFPPLIPRISTQTATARPVPVNVAFKPPPAEQVSHSLPIASNPARHVGCMRARVAAASMTEAHQAQISMDTIRQGSGLNSLVSYMAAFGRLMGDLPHRHELDHVFHFVKGLNRDLPTHLFRVASKPSESPTTCASSAWPSTSAYSVAPVKSALRPSSAETLSPFFSPSRPFLSTDIEGECVWMVPPVDNASTIVARYLDAKTANPKTSAIIVLPDRPQASWAPLIRHMTIVRRFPAGAQIVCRPTSSDPSSVLPPSPTALFKLRFRR >gi|Ccor1000007017|ref|jgi|Conco1|72468|fgenesh1_pg.176_#_7 MVEVNEFDYLPGATVIYDFISEAEELELVSNIDKSNWGGNGQYPNPELRRRTQHFGYLFSYRYRQIEKYLGDFPQFLKTINKKLVNIENGHIDLNSIIINEYEVGQGIMPHTDSAEIFGPVISSLSLLSPCNMEFTPTKQKTLELQKSDQKIPKINIHLPRRSLLIMKENCRYDYQHSISKNSIEYLPIINDGSLSSEEFTRDRRISLTFRTMLDTESFDNSNNN >gi|Fcyl1000037010|ref|jgi|Fracy1|254876|fgenesh2_pg.60_#_27 ; gi|Fcyl1000126553|ref|jgi|Fracy1|201025|e_gw1.60.71.1 ; gi|Fcyl1000121611|ref|jgi|Fracy1|196083|e_gw1.26.280.1 ; gi|Fcyl1000087922|ref|jgi|Fracy1|162208|gw1.60.71.1 ; gi|Fcyl1000087925|ref|jgi|Fracy1|162211|gw1.26.280.1 MAPPTIATFFSMAWSLMAAPECLSLATISVSQKKRVEVVEPGLVILRNFIDDEACQRIAAMAKDFGDEFYTVNKEGEKILNTGESRGRIYDAATRFPRDLIQLSNDAVSTSRAADTSMPAMQCTHVLLNLYTTSEGLVWHRDIYENDGKSDHPVVNLSIGATCVFGFKHLDTDEERTVELRSGDILLFGGPCRLIKHAVLEIKLDDAPEWMSYDPSRFSFTFRDSPEVLGREEEFKYFRVKEDLVGQDNFKVPTSSTDRKAFHGLPSYTTQQHVSMAS >gi|Spun1000007058|ref|SPPG_07458T0 | SPPG_07458 | Spizellomyces punctatus DAOM BR117 alkylated DNA repair protein AlkB (379 aa) MGRKKQKVLPPDSPLWANQTAFRVVERSWKRKEHAPDLLKTLVDPSRNAAIAVEAGLLKPVALSDDPRRISPHFGFSVEPDTPLPPAYELVNVPGLVIIPNMFSPSAQRTIIKQCLKEYTRHPNVTNLDTHWKLPSAGIWNLHELVRKGEIASGAEETLLRMRHDGTVEGALKNGYDSDVNEAGIKIDPPTTDTHHLTHLTPQDALLRLRWTSLGFQYNWSTKEYHLDRRPSFPPLIGELSTAIVEAVQGVTGYEARQWKAEAGIINFYGLKDALMAHQDRSEENAAAPLVSFSFGHSCIFLIGTESRDDIPTSVVLRSGDVLVMHGQSRLAFHSVPRILENTLPEYLFPHVDEAPDWDLFAEYMAASRININVRQVK >gi|Fcyl1000047105|ref|jgi|Fracy1|264971|estExt_fgenesh2_pg.C_350095 ; gi|Fcyl1000033484|ref|jgi|Fracy1|251350|fgenesh2_pg.35_#_95 ; gi|Fcyl1000012287|ref|jgi|Fracy1|272012|estExt_fgenesh2_kg.C_350010 ; gi|Fcyl1000005423|ref|jgi|Fracy1|221245|fgenesh2_kg.35_#_10_#_0_0_CCUX4586.b1_CCUX_EXTA MTARNSDDAIASLMQEMGQQRGYDQNEQYDTTATTAVQQATKINNANAYDNSTTTTTTDDDGDLDYPNNQFMAGSQSIWNPLVMSCGNSSDSGSGGSGNGYEYDRYGYGLILPTNPQIEIIRRRVYEKFRNEVTLLLQDIEHKILPLGGGSGSFSKNKNKLPIPSMLDKWHMDSKLEEWNTLKQEQKEKQQQQQQETQKHDELSGSDSSTINMMTTTSTVDILRELTRRKQQDGSNTMTPVYDPILLDKQSPTIFVDSILKPNVEKLWEQYHHHNGGSNNNNKGNNSNNFLNLPPKFNKKSKQIHKGLYRLVCEANDSFEIQLHQVVRQESSSSMSLSSIKNNKNKKKNNNRLPKISLSSSSDEGGSGGGDSSSTLVRVTYSGVTLKLHSAYLEKLQRLYDRTQQRRRRQQQQQQQQHQVGLSFEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCTKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFELEVEVTPPPVAVATAAAAAAVVTKDEEEEGICYQANPPFCEGLILQLNDKITDILLSSQQQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLWNLQKTNQKQKASSSSTSCSTESYINNGDADDDVLPLGNAILDELKNAFCQDPGSMQKEEQQQQQETSRINKPRRSNFVTTNTKSTTITRSSTHDDKPSENTTHSSSFSVVQRKRNKKATKLSPAKKERKRKWNQQEEGKAQLNLLESLGLSSSAATTATTTESSQAGSRTVSAEDNNNSSKKMNKLRPGGKTNNKEKLLSRKMKKKRPHR >gi|Bden1000007191|ref|BDEG_07173 | BDET_07192 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (367 aa) MILKPWQVFKRKKKMLATLRKLHPEIQIITTEDYNTVEITNTISVDASCWLIALNASYDSKITGIMLADLFGRCSGFIRSEMFLGAMPHSFILFNSPQSAYNCYKLLDNTLISQWNNKLLLLAQLDKCCDPLPLFPQRPITTASDAALAIPGLYLVPDFISVCNSLDLVCHLKSHWTSCPISTDPAWKSLQRRRVLHFGYSFDYSRNEIDRTVVGSDHAQLPHMPEWSVSILDQYTKLFPQYPFPNQLTINHYFPGGGIAPHSDRHSSFISPIVIISLGSGLVMEFRRKSSLSDPTYTTVHVYLPPCSLMVLDGDARFAWEHAIRPRTMDLIDGNVVERSERWSLTFRNLRELHDVCQCGYTHLCN >gi|Lgig1000007221|ref|jgi|Lotgi1|142409|e_gw1.180.14.1 MNSNKSCGCKGIRSCLLCEDNKDKIENVSEKKEKVTLKYCDLCKKAWAWGFHPNHTGDSRDFEGIYLHEEFIDEILEAELIDEIDKTVYSDSQSGRRKQDYGPKVNFKKQKVKYEVFTGLPKYSEVLYTTLTALPILKDFIPVELCNLEYIPERGSAIDPHYDDFWLWGERLVTINLISETVLYFINDQDSNIEVQVTLPPRSLVVVFGEARHKWKHGIHRDDIQNRRIAMTFRELSQEFEKDGKSSEIGEKLKDIALTFNGTAVGTIPKSS >gi|Spun1000007268|ref|SPPG_07680T0 | SPPG_07680 | Spizellomyces punctatus DAOM BR117 hypothetical protein (281 aa) MPRIDELFQPSSKRRRSADEKADDESAQAPASKLPKRDAPDVSPAINVALRRVIKKSPGLDLLYFKQFITKSMAKEFMTWCLESLNWYKVQYKIRGMEVNTPRFTTVFGIDETKSPASFYKQPPRPIPMPLQILKAHVESATGATYNFTLLNYYHDGSHSITYHADDESFLGPNPSIASLTLGGTRDFLMKHKEDKSKKEKFVLEDGDLLVMQGTTQKEWLHAIPKRATASPRINITFRKAINVAGTNNYYKYNVGDGASYRYINGKMVPSTDAMTGKAL >gi|Lgig1000007292|ref|jgi|Lotgi1|143038|e_gw1.192.7.1 MESKKLTKSERKMNKKISRSRQLLLKHEKIETTEEVSNMLCVYNGGLIMGNSQEKIQDLFNPHGTIQDITMVPEKPYCFVCFERQVDAEKAQSILNGFSFMNGETEYKLCIFYVSQVPPSIGPTTALPPGLILIPDFIPETLEQELLESIDWSTGVDQGNVSLKHRKVKHYGYEFRYDINNVDPSCPLNEGIPSRCLKLLHQVNDLGIVNFIPDQLTVNQYQPGQGIPSHIDTREAFEDGLMSLSLGSQVVMEFRHPKGDHLSVLIPPRSLLIMTSDSRYVWSHGITPRKSDIIPASDGKFTLSQRGIRTSFTFRKLTENAGIYMNNNQPVQDKKEKKAVFSLPRNDEDAVKLEQQHVHQVYEEIADHFSSTRHSPWPQIVQFINSQPPGSIMADIGCGNGKYLGINENVYSIGSDRSYNLASICRFRNFQSLVSDVMCIPLRPDSFDVCICIAVIHHLSTWERRLGAIQELVRILRPGGQVLIYVWALEQQRHKEKSKYLKQDKNKLDSNENSEVENEYSANNMQINSQDSANNMQINSQDSDTGSSKNIGIHVNRTEFQEQDMFVPWQLKNKSVKNAATFHRFYHVFREGELEDLCEKISDCRIIKSYYDQGNWCIILQKL >gi|Sarc1000007323|ref|SARC_07150T0 | SARC_07150 | Sphaeroforma arctica JP610 hypothetical protein (147 aa) MSRVHEGTPESYEPEILCADSWKLFPDYIEQAQQLFGPNNVDVFVSPQNEQLLDFWIKDEQKGAFEHNLSLVDGCPPWQLIHKDGASATVCLPWLPRAPWFDLFKQLLQSEPVLVPRIPHTFLKFGKYACGKILGSHSDRKNRPAQ >gi|Spun1000007354|ref|SPPG_07764T0 | SPPG_07764 | Spizellomyces punctatus DAOM BR117 hypothetical protein (742 aa) MVGGGPAVWCETRQELCEGTWYFKAYQSGVYHRGGVVQGYMIDGFPGQRDAWSERVFISHGGGKNDPLRQRHSVRFEDPMRKEDLGEATLRASQEESDRSIRVLMNTWREKRPFVVVMGSEYTLAGFQASYRYCVLGWYVITQTWAEKEFPSVLPQEPNAPDYFLRYKFAFQFLPRNTIKPWFMTEECVRPSTRHTYNCALCGKDSPKVYTAGICLIPDCERFWKIQDGEYWSDAHELEIDPEFLCPVQLDSDLETLIPESIIPPPPPLNIERFTPCALWCRRCGRITCREIWDKWICASCGYLKTYPNTPVPCATLSSTSTQWKDHLGCLAAARDSGIKRYVDVIDQGICRGLIRATYVLPTGGRIEHILADPHHKLAKIGSTIFEQYQTGVVPLRRWRFSAHRVKGFLTQNFSHNAGEAYRHAISMPYSHFGSAPEPVRRARSILQAVSPHTQINELLSIFYMDGQKMSWHDDGEKEVQGPIIGWSLGSDSIMRFRRKMKKKKCIFDIGRAKAAKQRRSARPQENRLDSKPVQTESMEPIQKKQSEQSTANIENSLHTISGPVSAKIDEGETHIPRKTVLKLVTRHGDLVIMYGKAVQTHYEHAVEPQGLRLCVTGRTLRTLTPLSDEAYPADPPWSTSPGSVPLDLNLPKEGQPQSIDDNILGAARLMRLADPLWNSCSPLDMEFDEIFAQSDYEPDISSSDSERHDSVLDSETSDLERRFTCAGLLMRISRSLTVDN >gi|Ttra1000007412|ref|AMSG_08546T0 | AMSG_08546 | Thecamonas trahens ATCC 50062 2OG-Fe(II) oxygenase (330 aa) MGASAVDDEPWASNAPHTSGGPTIAEPVEAVPGLFFIADFVAPAESDALLDFLAACDRWKSMGTGRRVLHFGHMFDYATESVVPIPDPDCPDDVFMPAALAPVVASINALVTPDGSPLPGGPQAYDQVIVNGYEKSQGIHPHIDRTHCFGPVIAALSLGDDAVLTFLRDTQDGRVVYDVPVPARSLYIMTGDARYAWRHGLDARTNAAVRTGVERVSITWRTVVDSETAAAVTPSANSLPGVIIDALRPETDLVIEGELVGRVVGRRGATVRALSGRLGSQLVVGTLAGDPSVGLVRILDRGTTSLEALQTELDAILDPFRSSGAAAED >gi|Bnat1000017417|ref|jgi|Bigna1|21597|gw1.33.39.1 ESLFRLLARYHALSGHGFQAALNETAFEVLRKHLSVNFECFASPLNAYFPRFCSAFPDTDVPFGSSLGSFFTLPEDGSVEGSFEANPPFISEVMTAMADKMHRMLAKAESSSRRLSFVVIVPGWTDEPSWKKMSSSSFLSKKLLVARDDHGFCDGAQHQRRDRYRESPYDTAIFVLQTTSAREAWSFTDDAERDLRAAMAEAHPTPGAKLRRK >gi|Ttra1000007497|ref|AMSG_08656T0 | AMSG_08656 | Thecamonas trahens ATCC 50062 hypothetical protein (512 aa) MHTHDPPTAKASALGLAWRRLQLGLTAAGCGFGSYEEVAAALKENADVWDEYVSLVGVDLPASKCLTAALEIMHAALATEPLAALVAEACGRDGSEPFAGWEPVIRATGSETKWTLSMPLPEALTDALHNSVWVPKRRKRKDRFVPTKSLAVAINVEHVTKLAARYTGPRPAEGDVDVELLFLLLFQYNALEGGGFQAALPDPVFDLLAAAPFHADTEAFASPLNVTLPRYHSALPAIDAAFGSSGSFFDAMPDAGVVEANPPFTEPFITRMLAHMHKCLAAASGPLTFVVIVPAWRQSPAWLALTTSPHASRTAVLPAAEHAYCEGKQHLRKTRFRLASNDTSIVFLQNELAAASLTITDAHVEAIAAAFAAPLETSRTAHATGADSRAFRANRAQERKDALSSHVRSAVATSNWASLSKSMKGKGKGKGKGKGKDKDKGKSKDKGKSKDKGKSKDKGKSKDKGKSKDKSKSKSKSKSKSMDNDKDRKRRRHGDDVDHASSRQSKRSKRS >gi|Sarc1000007502|ref|SARC_07328T0 | SARC_07328 | Sphaeroforma arctica JP610 hypothetical protein (346 aa) MPTATQNILSVQVLQRHNYVIYLPSEPNKFVPTECHILQNDELLHRNGQYTVANFTATSLENWHDRLGHRCDPRNTLTTSNTTPTTPAKPSHDDWKLNPTLVKTHVTDVFGEIGIDLFAQKHNAQVPVYCSLEPDAPLHDAFKQSWQQPNTHLYGNILSVSKHGEMHFSSLRRKMVLSLRGFRFNLTRATTTKRNFSYTHMWNSSKPPPTRSYRMALRIPDRLSGDESTLPPVGTVFHVDLHTGLPTSPTGYTCNIKYVDANSGQPFVYPLRHNDDASATLDVFYTDIGKDYLANLRELKCDRGGKFVSKEFNSVNTLQHVKVTCIPTNTPQLHGLVERTNEQLV >gi|Uram1000007539|ref|jgi|Umbra1|252871|fgenesh1_kg.52_#_84_#_combest_scaffold_52_106915 MQNSIISQYSSIAPHAGASISFNAPKRKWSSVLSSSDSPTPFHDNFLSQIEQHQPRYDASESITPVYKNPSALIVPLRFPTLDKGSARSLSRSSSSSQVPTRDPSPSTHSQETNTSGDRSSAISFSLIVGVLASEMMSVKEAILKSISSKLLEPCGQKWPVDEKFSSFVYQQLQTHLKKETPQGQNSAQLVRSDDPDVRNAVEFYCLLVAFMTVYVSFLEVSTSLMASKESLGFLALPFETIMNSFDPFDNIFHDASEDMYFWYYVEQRYFLGSVTSSICKELQVLNFTVEKPHQAEAILSSFYTNHLLRFAAQRHQKDHGQFYTPTSVVDFMWTTCMKDDADWVSNVLHSYCPSVLDPCMGTGSFLSSYIERIVQCLQERSTSWDNSDALKTMINSMCSNIWGIEIDHFVVQLGKLNVMLHIFPLLCRWMSITGQPLDFRLPRLNLFCNDILTLSLPPATNHFNNWEFEQLKKLRDPEVLKFEYMVTNPPYMIRKTGFISVPDTSLYDMSLIGGRGTQAYMYFMWICLQRCHPLRGRLCFITPSQWILLEFAKNLRSWLWKNYILDVIYQFEPYKVWPKIQTDSLIFRLRPRSAAHLEEACTLFLRHEDRTLTLDSVLSDYQSFIFKDPTTHTRISYRLTPATIDALENVKDCSFSSLTPASPVSEQMKQLTQDFIKICGGKNDSSGIQSPLLWNRGPNTNPVYALLVRTEWALKTFGTDVVEKWMRPAIYWNGKRENSTTGKSESKEILFWKDRDRLRVSHKENSPAEAYVPFRFEDLANEDKHYSMILVDSANASILERDGTDQPLYQYLKDAREKLQPKQIDKEIAWCPFRQCGIESPIKIVHPINFGYFSRSQPRQRFFLDTRRQCVTNQCMYFTIQPTTAIQDPLFYLGLLNSSTVQFFITIHCCYDQQGRTRFFAKNMANIPYPPSPSFELVAIMVKLVSRISSVRSMVYMVARERRMRILVEKLRQGRWDLSCTACDKSNHTSSEHTSPVGQPVEEDIRICPNKNICSCLRVASLLQYGVDQLSYLLYGVPVDTQLAVESELSIGTFTDFETVFPRQDAIIPAWYDRIIQYSDLILSSVDL >gi|Bcir1000007546|ref|jgi|Bacci1|282602|fgenesh1_kg.13_#_14_#_Locus8775v1rpkm8.87 MIFNKDYKSQVPGLILIEDFITEQEEACLVSYTEQGTWSGLGIGPNPELKRRTQQYGHLFSYRYRKVMEEYGPLPDFATLLVDRIMENQLMPNTPNHLLINEYNAGQGIMPHTDAPALFGPSILSLSLLSDCIMQFTCQDQAVDIVLPRRSIVVLTGDARYKYKHCISKDLIETTPSGMTIHRDRRISFTFREIVSWEDTKKCNP >gi|Uram1000007702|ref|jgi|Umbra1|253628|fgenesh1_kg.54_#_164_#_combest_scaffold_54_109393 MQILPGLTVVEDFVSPEEETHLVQCCDERLWSGLGISPNPELKRRTQQYGHLFSYQYRKVLQELGPLPDFVTPVLDRIADQNLSPPPNHLLVNEYEPGQGIMPHTDAPSIFGPAILSLSLLSACVMKFTNAETGRTVDVLLPRRSMLVMTEEARYNYKHSISKDLIETLSDGTTIERSRRVSFTFRQIISFTGECSK >gi|Mcir1000007727|ref|jgi|Mucci2|85076|Mucci1.fgeneshMC_pg.8_#_475 MRTIDLSDKIPGLVLIEEAVTESEEARLIESVNKETWSGLGIGPNPELKRRTQQYGHLFSYRYRKVLEEYGPLPDFTDFLVNRIMEYKWMPRKPNHLLANEYNPGQGIMPHVDAPALFGPAILSLSLLSECIMKFTCDEQSIDIVLPRRSLVILTGDARYKFKHGISKDLIETTDSGIVIERDKRISFTFREIIAWEVAENAPCCHGNKQ >gi|Fcyl1000027737|ref|jgi|Fracy1|245603|fgenesh2_pg.15_#_440 ; gi|Fcyl1000118033|ref|jgi|Fracy1|192505|e_gw1.15.550.1 ; gi|Fcyl1000089800|ref|jgi|Fracy1|164179|gw1.15.550.1 ; gi|Fcyl1000088409|ref|jgi|Fracy1|162731|gw1.15.516.1 ; gi|Fcyl1000117801|ref|jgi|Fracy1|192273|e_gw1.15.516.1 MTRSSMEPIAELSVADSRFLGFCFNFSTIKEVEQCQRTLKEQYPTAAHIPIVYKFSSNNNNKEGWDEDQEPSDSVGPGIMKEIIKKKQQQQQQKKSDDDDGSSGDDENKNKNNLVVAVVRFWGDTLLGVTCGRLPQCYQSIARLVLHRYCYSTSTNSNNNKIMQPFELEILNNIENSIYGLGAGDCELIVNIVPDDNDDDLLLVDKVKMELNFEGFMGAAGEVLPRLQNLQADLTQNLIPIYRYPGNYSGDSWKTFEWSPTSLIIKEAVEDNLLLPQTMNHCVTNYYRDGTDFIGHHSDKDLDLNRDGAIVSVSLGDERIFELKRRKDPKDITRIVLPPRSMLVLGPITNKEFSHSILQNVDSNKTRISLTMREVKTFKDLNTNRLFGQGVRNKSLQQLRKRHLIENCALFSGFCTLSALMVSKIKINNAINNTNTCLLMTGIFATGTLSVRLLTNTWYRQQEEREARDFFSKSSMSGTKY >gi|Vcar1000007748|ref|jgi|Volca1|100029|fgenesh4_pg.C_scaffold_102000028 MPPPVPHISTRPGSELFKDGIDVIVSNSGSITNYQETIAAAVRAAVADLPNSTPWPTLPQPHPGPSQAYAPVPSFAHPRSPYSTPNLLHIARDFPTFDPKEPRADIVGWDILMRHNLDMAGVPVDSPEAAKIALAVLRGTTGDSLRRLNSDPATRFHSYHAVLQALTPLAPVIQHKLDAQLAMFELTQGSGPNSLQRYIAEYKRLMADLPYRHEKDHVLFFARGLRDDLREEIFSRIRHLGSYVRLQDLIDLALTISTGRDAAHRIRSDVIPPRTPPTRIAMVSTVAPIHSAAAPLVPANVALGPPPAEQVSHSVPVRFAVQRSTEPDTPPPPFNVSRLNQIATSTTGAQRVDSDWMLARTIFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLGPSRRRYLESKSTNPRTSAVIVLPDRPTAPWTPLIRHMTVVRRFPAGARIVCHRDPSDAS >gi|Mver1000007812|ref|MVEG_07803T0 | MVEG_07803 | Mortierella verticillata NRRL 6337 hypothetical protein (408 aa) MLQRLGFLVNNTKSVLQPTQILKHLGFYINTCKMILTLPKDKVRELKREATKLWTSTMTTIRRLASFIGKAQAAMLAVLPARLQTRHLVACKDCALASGLNWSDSIQLSKEALKDISWWRNHLQSWTGQSFLPQIPEMDLFTDASDWGWGIVLPYKVISRAWPPQESQHSINWCKLRTVLHAVHLPEVQGKVIQIHSDSTTTLAYITKFGGTRSTLLMDLACQIWTHCLQTGTRVMTSFIKSEDNPADRLSRALWLQTEWTLDPTLFQQIDKMWGPHTVDLFASRSNTQLPRFISWKPDPQALAMNALTVPWTQENSFACPPWALISQCLMKIQREQLTLTLVTPYWPSAIWFPTLKNLALHPPLLIQSQDRLAHSPDGTLLDWTLAAWCISGSGSRRKVHQRLLSK >gi|Bcir1000007834|ref|jgi|Bacci1|265756|fgenesh1_pg.3_#_153 MYEWKLPRQWMNKIQIHWHSLCIDALASRVGKRLLIFWSHRMDLGAAATDAFMQTWPKQELFFHSPWKLIPHGEISQFKIKNVVPSDNGLSIFIDQSKEGGIKFTHIGYQVEQLAEYCLVRTWKLFMYKTKYLQHGPDAFLFLYYIEHHNNKFRPAKAASWLKQILKDSWIYTTLFQAHSFNFPTQYEFFY >gi|Vcar1000007858|ref|jgi|Volca1|105574|estExt_fgenesh4_pg.C_300114 MALVPTDANRDRSRSRSAELPSNVHESPFTTLTPIPPMVNPVPTVPIVDSVPTTSVTEPYVPSLQETIAAAVRAAVADLPNSTPWPTLPQPHPGPSQAYAPVPSFAHPRSPYSTPNLLHIARDFPTFDPKEPRADIVGWDILMRHNLDMAGVPVDSPEAAKIALAVLRGTTGRRSSRDTPSGSYVRLQDLIDLALTISTGRDAAHRIRSDVIPPRTPPTRVAMVSTVAPTHSATAPLVPTNVALGPPPAEQVSHSVPIASKPARRVGCMRARVAAASTTGAQRVDSDWMLARTVFLDLDSQYGPFTVDACCDDFGINAHVVPFFSPSRSFLSAQVDALKQHNRLHQPPLPPVDENTPTSRSTTLDMVKQWATDVGGGDCNSHDDSKGHLKAQPFWT >gi|Vcar1000007918|ref|jgi|Volca1|58624|e_gw1.11.237.1 GDNSLCTRFASPSNSFLTSDVSGEFVWANPPYTKIELWHKHYLRCKRMRPETTSAVFVVPKWSQIERRMRAAGHQLLKTYAVGTKLFLEKADDGSRQEMPGIPWPIQLWYDPPAKEVQLNSIGSSQSMIFGAFLCQTPCSALIDSGADCKGTADGFISSEAARKMGLAIRPCSTSSVRVANGSSELIEGLIHAKLRIASFHDTVKLLVLKQGNAGVELILGAD >gi|Smar1000007917|ref|SMAR006464-PA pep:novel scaffold:Smar1:JH431701:250462:251748:1 gene:SMAR006464 transcript:SMAR006464-RA MVTLPENRILNIKTSISKILKLDKVTVRDLASVIGKLNATTLAISAAPLHYRGIQLLKAKFLRKGHYESTCSLTSEVKNELIWWMRNLSLCKGRSLTPKPLKLVIASDASLEFGWGAVSNGVSIGFKWEKFMLSKHINILELIAAFWGLKSFALNLQDATVKLKIDNTTAVAAINRLGSPRSPEATAVAQDIWSWAFKRNLTLVAEHIPGSKNCLADAASRGAVKDSGDWKLDSIVFSCVLETWGPFSVDLFTNCRNYQFKLFFSWLPDPLATGFNALNQEWCEGLPYAFPPFALISLCLQRLRKSPSLQELVLITPVWPAQPWFPLLWQLSTQFPLLFPLFDELILDAQNAPHPLIINGALQLAAWRLSSPSSKTQEFQSKLPNWWRRLGSNHQGADMICIGKIGAFGQENNFWTLFNHLQLPSQNS >gi|Smar1000007923|ref|SMAR006470-PA pep:novel scaffold:Smar1:JH431701:276342:276938:1 gene:SMAR006470 transcript:SMAR006470-RA MTSREVDWFATRLNYKLPKFCAWGPDPMAWKVDAFAQNWSNIYGYAFPPFSLIPRIIQKMNRDQADLLIVVPLWPA >gi|Sarc1000007962|ref|SARC_07773T0 | SARC_07773 | Sphaeroforma arctica JP610 hypothetical protein (360 aa) MPPKKSTLEHFFKAPPSKRTRNAASPSSPSTTTKSSTDSLVPSSHPTYQFAIPQLPNSITESLEHAIPSLDNGKRVDDQPHLDLVHFHRYLPRGIEASIFDFLRENLFFYRVQYAKKFGNKEMQINTPRYTTVFGVDESSFFNAQPGSDSGPNVCVECMGTGACNHGVDTRTTQVIFDSKSKCPVPKDRYSCRPRPIPDGLMFLKKMTEASTAQTYNFCLVNYYADGNDSISYHSDDERFLGANPAIASFSLGTNRDFLMKHKPDKKKPAKTENVLKDKPLKLTLDSGDMLLMRGAKKAKWLHSIPKRKGGESGNGRINITFRKAMVPGGTENYYKYNVGEGGTFKWSRKDKNMIPWTS >gi|Hrob1000017985|ref|jgi|Helro1|86209 LPLEVYYVPDYVSEICEGELLSQIYNVSKTRWTQLSHRRLQNWGGTPHIKGMVEEKLPNWLAEQCSRLASLGLYGGKTPNHVLINEYLPGQGIMPHTDGPLYYPTVCILSLASNLLIDFYRPHNHHHHLQQESISKRSKMADRRVGSLYLKRRSLLIFRSEAYTNYLHGIRDTTNDHIDDKVLNLGPDFYGKDLKRAEARISMTIRVVPKTLNASKLFGKIR >gi|Vcar1000008009|ref|jgi|Volca1|45734|gw1.11.234.1 RSVFLRLQKASGRVFTFDATCNGGSDALCPKFACSSSPIISHDVSGQHVWCQPPPKCVNDWLDHYSACKQRSPESTSAIFVVPKCTQFEQTFQKRGWTLLKEFLSDAHIFSVPKSGGGRTRLRSNTLVTQAWLDPCQQ >gi|Bcir1000008046|ref|jgi|Bacci1|215245|estExt_Genewise1.C_420006 MRQASLTNILHRSKQVFSKLNHKRHSQQPLDSTQLYKLGNRLLFGRHNETQNQPLAISYFLKAAQLGNARAQGVLGFCYEFGLGVETDFVKSEAYYLKAAKLDDGLSMARLAFLRKYGRPNVKIDRAEAEEWTEKVRHRPNAIQWIVEAASLTGDPAAQYVLGVCYHDGISVAKDEQAAFRWYKASADQGNARGQGILGYCYGEGFGVAKDEVEAMKWYRLAALQGETVAIYNVGYCYEDGIGVDKNVDEAVKWYKLSAEQGNAFAQNSLGYCYEDGIGVQQNFKEAAKWYKLSAEQGYPWAECNLGYCYQNGIGTAKDDSSGAYWYRKAALQGHARAQHNLGFCYQNGIGIEKNEKEAIKWYRRSAERGNIFAYHSLGYCYQNGIGVKVNERESVFWYYLSAEENHAPAQLSLGYCYRNGIGVPKNEGEAVKWFQRSAEQGNALAQNSLGFCYEEGLGITKDMTMAVHWYTKSAKQNNPWAQCNLGFCYANGIGLEQNNVKAVYWYRQAAAQNHARALDKLGTHLLNGVGVERDLKTAFELFQKAAQADHVAAQYHFANCFEKGLGCEVDLTQATHWFERAALAGCRNSHERLRRLIVRECLLSPANSPLLSNDDDFGYGGLTIGYSAPAA >gi|Fcyl1000088053|ref|jgi|Fracy1|162353|gw1.27.225.1 ; gi|Fcyl1000086727|ref|jgi|Fracy1|160956|gw1.27.207.1 FEESLFCLLCRYDMIQGAGLQAGVPGSIMDILLKQFNCKKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFEKEEGICYQANPPFCEGLILQLNNKITDILLSSQQHQHQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLW >gi|Smar1000008099|ref|SMAR006255-PA pep:novel scaffold:Smar1:JH431682:46249:48654:-1 gene:SMAR006255 transcript:SMAR006255-RA MAVCFNLSKSGSPDLSRVIHFNLTDHDAQKYEFEVYPLNYEMKHADIEEIGLLFIVNPFTAAGQRYWIQRCYLDYPNSPNVTNLTGKKHIWSKNDVHTWKYLRWTTLGYHHNWDTKVYSLDHHTPFPADLASLNAFFAKILQFPPYFAEAAIVNYYHLDSTLAGHVDRSEFDLEAPLFSISFGSPAIFLLGGQTKSEEPSAMMLNSGDVVVMSGQSRLSYHAVPKILSSEATPWLELENHEQERPEWIHVKKVICQSRINLNVRQAQTETISKSLLTT >gi|Chet1000008148|ref|jgi|CocheC5_1|32311|estExt_fgenesh1_pg.C_180212 MVFLDLVLIRSSFHSVRQHVLPRNAFVASLPSWNCQRMQLNAFFIQKEHWRYQSMRKADLENDPDIFDLSKRNEFSQDWKDIWRPAGVIPATQIEAACMAYAGGKPLSAPVQDAQIFEHRDFPGLQVISRLLPPETQVLFTSCLMHRDLADPGHKINLQADFDIPYPPKPTSEPSRFDSNFFLRDRAAETDCLIPKSPDKQKPLNNEQFLYSKLRWLTLGEQYDWPTRSYAKHATPFPDDLSRLVTGLFPHIRPESGVVLMYSAKDFMPVHRDVSEQCQRALASFSVGCDGIFIMAKGEDDGQGENAPRSVAIRVHSGDVVHLTGDARWAWHAMARCIPSTCPPHLASWPVGTPGATPAEEKAYKKWKGYMSTKRINVSCRQVWD >gi|Bcir1000008318|ref|jgi|Bacci1|335164|estExt_fgenesh1_pg.C_1200041 MSLSHVVELNESSMPSSPIPDSSTLQGQPKRQRVAISEEQVKQIVDILGSMLIKIKNEIMDIKEKYAPTAAVNFLLRRMESSDAILTNDVYCLMTGFMTIYLVFIELLFKDLSNNSHNTKINPLEDFMQLFSENDTLFGSDNPFLWYYLYARTNTSILEDLDKQLTIHFTSLNHIDSIFSRFYTVYFLGEESNTQKKHQKDHGQYYTPHSVIRFMWDRCLLPSSSSLLRNGNIPRVFDPCLGIGSFLCEFLSRLVNACRFNLDIWNDPRRLYSILTTEIPESVYGIEIDPFAFQLCKINMLIHLFPFYQRLLELNVQLHQTSRIIQRIRLFCNDTLKLDKDSNAFTNNIDSFESDCLSQLRDSSKLKFHYIVTNPPYMIRKTGFITQPDPQLYNEHVLGGRGTQAYLYFMWICLQRCDDSLGQVCLITPSQWTVLEFAEHLRNWIWSNCRLLEMYEFEPYKVWPKVQTDSLIFRVCKRSCVLPHIDKTIYLRHTSPRRIPLKELLACYGNFNINQNNPDIMFKCTSTNDHLSTSLSTILPTTSVLDRLTALTEHLPRICDGEGRKNNNNSNTSPLVWNRGPNTNPVYSLVVRTEWALPTFGLKACARWLRPCFYWNGKSSLSESGGGKEGQFWRERDTLRLEKKEFSAAEAYWPFCNLTKSKVSYYSMIMVNKEDAETLQREYDQWGEQSDSASLYHYLQEARIALQPNKKDPLASCQYNKSGAEHAIKLVHPINCGYFTRSQPRQRFFLDKSQMVVTNQCIYFTIKPESKIQDADFFCGLLNCSLFQFFIKSTCYYDQQGRMRFFGRLMANIPYMPPDDSTITCCVSRFSQGITTCRTWLYAIIRPTSNTKGLMERIRNNEYKLSVAELDLIKNMDHIIEPDSSLPAPLDEGHFPWVHDFVLSKRVSMDKVFVILLKTTCLFQFAIDQLVYCLYKIPIDLQLGIEDELNLKNNRMKELPPILENDANAWSESVIDFAKKLTLISNEHLIL >gi|Fcyl1000018322|ref|jgi|Fracy1|236188|fgenesh2_pg.3_#_923 ; gi|Fcyl1000107847|ref|jgi|Fracy1|182319|e_gw1.3.1101.1 ; gi|Fcyl1000089018|ref|jgi|Fracy1|163370|gw1.3.1101.1 MVTPNNRRRKITGKKQQKRLTSYFQDPSTTNSSSLMSSSSTTSFRKRQRASSSSTSSSSSNAAKFRAQNGGSSFGVCPICQSTIAWHILESHASECHGRSKETYENDDNNNSKKVIARPGQSTMIATATTTNNYNTASSRCYNNDVIIGSLKGESKSASTSASTIHQLSPSSSSSSSATQQQQQQQQQLPRPHTFEPIPGLFVYEDFITEEEESMILYGIDAVDTLPWKLSKFNGKHIGKRWGVHCNLRDRRVDAPENPLPDIIQQIVLPKLKRLLFQKGKTKNTTIPNEANSIDYRRKQGHWLQDHVDDRKLSKEVIFNLSLIGDCYMTFKNIAKHRNIAVPKQRVLLKRRCLQIITGKSRYDFTHGITNSDLLSDRRISITMRESPLTKSKPKPKPITITKSATTTITTQERIPE >gi|Bden1000008328|ref|BDEG_08304 | BDET_08329 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (371 aa) MSNRDKKRQRKQQQQQQPANSQQWSNQTPFRILERTFKRRQTLLKDLQPLLLDLDLESPSTSFKTVELSPPLILSEPQLDVPGTTRVIELTDIPGLFILKRAIPPSLQRQLVQECLEKHCKVPNLSNLDAHYLIPNIGIWKVYQQSKRLNHAELIHPRSIENTETMESTTADGAMKIDPPTDAMTAPKSMSVDAVIHRLRWVTLGHQYDWTRKQYHFDRLHAFPDTIATITHQILAFTQELTGYSSSQWRPEAGVINWYHPGDTLMGHQDRSEVDMTAPLVSLSVGLSCVFLISPCESKDITPTAIRLDSGDVLIMSKSARRVFHGVPLVIPDTCPDYLMHGTDSEWNAFAEWMDHSRLNINVRQVFPTI >gi|Ccor1000008322|ref|jgi|Conco1|123389|CE35304_1103 MKCTKCRMMGATVGCFNAKCPRIYHLTCCDKNPKLFLQGYIFYCPKHEAIENKKQTYEEYYHCDHCKNSLPRANTLGPFPDYSPDEWFTCQACVEENFFSGFDLCTECFTHKFKSIKHNHKANRFIRTTKEKLEVLLDVLANNKLTLRNDKLKGRKSLKNDKLNSIENIKDIAKPEDSAPKKRKIIKKIVQYQPNIHCSYCWSTSSTIWRRGYMGVLLCSKCFMNTSTDKNLASIATQSTSEVNEDDQDSEEIIIDVVNDLPSPPAQQDKFGYHGNYEDYIHQPYHTRNLPQLNCLKYDPSQIAESSVMANKAIHLETYGPTIYQAFSLDYKSTYYDIPGSAPRWASHSGSDYHGTWLPQTVRRAITRYTKEGDMVLSNFLGRGTDAIECFLLKRKCIGIDINPVAVSLSQKNISFALPPSLLANSEFKYHRPTIIQGDARNLFNILTNESISHVLSHPPYKDCVEYSNNIDGDLSKFSTNMEFCKEMQNVVNETWRVLKMGGRCTLGIGDNRDQCFYQPVSFDLLLLYMETGFQVEEIIVKRQRQCRAFGLGTFLCVKYDFLMFTHEFIITLRKVPITSRGSMSRNMKKSKFFQQ >gi|Hrob1000018333|ref|jgi|Helro1|102536 MRGLMVAKLRQNYQDLCSHREGIDAPMESFNRWLLERATVDKGIDPLLPSQCSVPLSSSMFREVINDVPIRLSRPRRCEDARRMLSKYAEAAKSLVMKRGCNVENRKTVAWHADDTFSFMQKRGNATYDDYMDRLAHLKRECQPHVIEAAKSSVEGICLKVYSLSCVYVKKIYDKHLSVLSAAGVDVKTLPGPTSSSSLLLPHSLTHHHHRYPCSPIALSTPLSPNVTVEHHDSGEVVTLRLKLAAAGVANNDVMKINSMHFRKLEQLYMLNCRDDPKMEHFLHRTWCLLKRYNTFFGTKENEGFGLQGALPVSVFQCLNRSFGVTFECFASPLNCYFKQFCSAFPDTDGYFGSRGSILDFYPISGSFEANPPFNEELMEAMVDHFESLLSETPLPLSFIIFLPDWKDPPTEALIKLESSRYKRQQMTIPAMEHEYRHGFQHICQRKDLNVRSLHGTLVIFLQNDAGANKWSVNNDNMRELLYAYQLNNNNNNNGPTN >gi|Sarc1000008354|ref|SARC_08162T0 | SARC_08162 | Sphaeroforma arctica JP610 hypothetical protein (611 aa) MAAIAVPMSDLTKKGAVFTWKTPQQAALQKLLTLIMKRIKIYEPLPHKQLVIMSDACNVGVSAAVFVQIPPPGTAEWDEEQKCIAACKQYREALGLIPLTLHPQTPRLVISDQERALYDTTDKPETIYNPIDDLRTLPGFQTATPRNAASDLDWAHSVDISHRNTNDQTVKLTREQSGTLVHSVEFQHKGQSGPLAQDADQQGKEQSGTLVRGGVPQEDEQSGSLVHMNQVQSGSFTHTLDMETVSAFTHTLDMETVSANSPRDLRITIGKYRFLYPVAFYSSKLNPAQRNYGATDREMEESSKNSRIIRALNTINEVVMDVRYVQGPLTVFGDYFSRSTNHTDKLLQQKQRGRVEFGKATGAAASDTQPACALSFAPQQRRVTWGHIKVNQEHNGREALIYSQSHFEFPESYSLNLTWVDLIDQLFGPFDVELFASHEDHVMDVYCTADGSGTWGTDAFGVDLQTPSAVWFPLLMRIAQGLPLVIPHRDDTFLHRGRTISGLPPWEYTLVVILQGGCPKPPLLPDTFWHKIGLQRAPAFLHSLGYPLKPCRTMGTGVPGIPAGAGVPVATKTWKWLSTKKAPTPLHYAGDDRTPQQWKFGPGKRSHLEH >gi|Caps1000008447|ref|jgi|Capca1|112863|e_gw1.16.157.1 QVPAAIYYIPNFITEAEEESLIQYVNSAPIPKWTQLSNRRLQNWGGLPHPKGMVPEKIPEWLDSFGQRIGQLGVFDGQMPNHVLVNEYLPGQGIMPHTDGPLYFPTVSTITLGSHTLLDFYTPLNDRSSSFDDRHFASFLLERRSLVLVREEMYSRMLHGIKEVETDTLCEKVLNLDSSEHSLGDTLARNTRISLTIRVVPKVLKAKLFFGKKK >gi|Psoj1000018446|ref|125662 MKEPIPEVLQPIIEKIARCGIMDGDEPDQITVNEYLPGQGIAFHLDTHSAFTTTIASLSICSEVVMDFRHPDGVRNEGVLLPARSLAVMSGASRYMWEHAIVPRTFDVIDGKQVNRQRRVSITFRKIRSGLCECPFPKYCDTPGRDGQEAAGDDEQGSVAEETTSLAPTALEQQYVHEFYETVAAHFSSTRHSPWPRVAQFVSSLPSGSMIADLGCGNGKYMKCVDAAQSFVVGGDRSSRLVKICRDRGLEAMVCDALAVPLRSNSCDAALSIAVLHHLSTLGHQLAAVKELLRVLRVGGRGIIYAWAHEQMKGSRRRFEEGRQDFMVPWNLDKRFAFSTEESSTETDAAETSQEDEQAEESPKDSSEEGDDANAGDRSSSKVQARVVMQRYCHMFKQGELESLVALAGNAEVEESYYDESNWAVILRR >gi|Ttra1000008560|ref|AMSG_10032T0 | AMSG_10032 | Thecamonas trahens ATCC 50062 FATSO protein (561 aa) MSNWAALQRELLGSGKSKDEAGKSKDKGKGKNKSKGKGKGKGKGKGKGKETLTGSSGQDGRGAESGKRARKRKRDSNGDGIGDARDKRCGSDGKSNSKKTKKQKRGGATSRSEATLHLHLAMALGLPYPIEALPSPRPLFCTPSSHPKLVEKLMATSYAGAVIEDGASSLAASDHAKFHAAMKVLKARGWFKFDFVQPGKIEELGPTYVRRVVVGEPGMTYKYLNARVFAVPWAGPALEAADAAVAGALGAIRELNEVMIAKGNAHLEAAGTPGSTLFNLTLINDMDVLTGEGEGKPPPFEGLGRAVVSWHQDTSLEDWSTIGVYQLTAENETAPWHVGLRVIYDEVTPKLALPLDHGAMYFLLGDFNHHHQHVVVCGSSHRYSSTHRVTTTEHNHFDRVFPRLTRAVRKAKKLTKSPAALRALLPRPGSTWDLLCESFLVLEFEWIRQYWWHGATHHKLQRGLAYWPERVAQLEKAWAQLLALFRDVATLTASRPQSKLAAAFASLQADVAREREAWRARFAAVDAKVTAGEWTGVDLPIDWRRCPVAHGASSLVGATD >gi|Sarc1000008574|ref|SARC_08375T0 | SARC_08375 | Sphaeroforma arctica JP610 hypothetical protein (91 aa) MLSRLTATVDPIMEQVGGQGLNPRRRGTWRYPVARDSKPRTNKYKVAGFVFNKAEREWGPHTHDAFALPGNQLLPKYFSPSTLGGAVAES >gi|Bnat1000018648|ref|jgi|Bigna1|43015|e_gw1.71.34.1 MAASVGDSIAQYNLGTSYKKGSGVDKDFKQALEWYRKSADQGYSLASYAVALCYEKGEGVEKDWQQAVECYRKAATQGHSGAQYNLGVCYQNGRGIERDVKLAYWWYRKAADQGYVISQCNVALCYQRGVGVGIDLKQAFEWYLKAANQGHSGAQCTLGICYEKGRGVEKSWEQAVEWYRKSAKQGNRGAQNNIAFCYQKGLGVEQNWKQAVEWYRKAANQNDSGAQYYLAHSYQKGVGVEKDMRQAVEWYHKAADLGHSGAQYALGFRYKKGEGVEQDWKQAVEWYRKAALQGHSGAQCCVAACYKKGEGVQKDWQRAVSWYRKAAIKGHSGAQHELGLCYEKGRGVGKDWEQAVQWYRKAANQGHPGAQASLEKADGIALKEGCRTIRPNS >gi|Pbla1000008754|ref|jgi|Phybl1|68567|fgeneshPB_pg.23__178 MSLTTLKKTFKYLTVAYNGNVDYINNLIIPMEPPAHLTSRRQRKIWEQQQKDNAAKRHKPHHDPFRTLERHFKAPHPSMEDVIDLHAPHDKLIAVPLAHPLTSDVFGPQITSQTAYIVKDIPGLIVIPNPFSPEAQRAMIAQCLTDFARPPNTSNHDAFFKIPETGLWPLYVAEQSGAKSIPTVPPKEKSQGAFLANSLLSPTDMLRKQRWVTLGYQYHWGTKEYDLERDIKVPSTVGEMAVDVVTAIQGIGGEGWTNSYQANDFAAEAGVINYYQIKDTLMAHVDKSELNMEAPLVSASFGLSCIYLLGGPTKETPPVALRLSSGDIIVMTGACRKAYHGVPRILEGTLPDYLGSDAFGDQLDGELLGKWMNSTRINLNIRKVFPSSTSLETP >gi|Spun1000008764|ref|SPPG_09218T0 | SPPG_09218 | Spizellomyces punctatus DAOM BR117 hypothetical protein (195 aa) MECSKVPGLFLIDDFISKEEEDQLLATLDGRAWGGKGQKPNEELRRRTQQYGYLFSFRTRQVEEHLGPLPAFVDGVVERMRAFGVFAKEPPEYLLVNEYERGQGIMPHVDASTFGSTVTSLSLLTPCVMTFSKRDSGESVDILLRPRSLLVLTGDSRYNFTHSISKNQVDHYCGEPIERGRRVSLTFRRRAESS >gi|Pbla1000008773|ref|jgi|Phybl1|68613|fgeneshPB_pg.23__224 MEDDSLDWESLFGSDNDSVDWNSLFGSNDEKEDDDRYSTDIPGLELVREALDHSQQMKIIQAILDTNTFSDAGRVNQAMCFGKLPPHLDWLSRHIKNQLPHLLPRVLMQREPVFDQAILNLYRKGEGIVSHVDLARFEDGVVILSLMSSCVMTMRPVPKVSPADPSLEVDILLNPGDILALSGLARYEWEHGIKECEYDVVRGERIERGTRISVTLRKLGTTVETPTIETTATRTSI >gi|Caps1000018790|ref|jgi|Capca1|105332|e_gw1.669.6.1 MSSDRRRRARVQGGWAAPVPAAAAAAAGKGKEKAKLASPSPPAWLGKNVDHGPAPQKFLFKEPQEVILKYKLYIKAGVYDISSEPSGVARVRLFPSFIEANQCEWMYEQLFSELPWRQRSDVKSGVSYLQPRLTAWFGDFPYSYSGVRHEGNKNWPPILAMLKEKLEENTGCKFNSVLANLYRNGHDHVPWHSDDESQLGNHPTIASLSFGDLRLFELRKKAPLELRANLPEDYQYTEYVRVPLDAGTLLIMEGACQEDWQHRIKREYHDRGPRINLTFRVIHPETE >gi|Fcyl1000078853|ref|jgi|Fracy1|152506|gw1.4.648.1 MDEQYLIRDIEIPCVYHDSNFISKSDADDYYETLRTTIPWQKTAKINRWVRLYQAVDDVDESTIETTTSDGDEDGSDEGTSGYTYKDAPPVEDDKNKDNNVIGAGYPELIQSIRQQCQDWYAAANPHCKQDDNIPSFNICLLNFYEDGQQRIGWHSDREEIGRTTPIISVSLGASRQFLLRSQTDGRNDRCSLNLTSGSVTVMEPICQIKYLHSVPKESDVVNGRINLTFRCK >gi|Smar1000009144|ref|SMAR005333-PA pep:novel scaffold:Smar1:JH431599:48277:48814:-1 gene:SMAR005333 transcript:SMAR005333-RA MVFLLAERLGTLEVDLFASRLNFKFKKFCSWSPDPLAWKPHSENFIESSLGPSNGVISGASMDHTKLVRPPTRVPYSISLQLPSKRSFEARTSGGNKVSTRRKNDPPSMQNLRASLTDRGFSEDATALYTASWWDTTVASYTEGDAQWQEFCDKKQDS >gi|Bcir1000009260|ref|jgi|Bacci1|261852|estExt_Genewise1Plus.C_2670019 METPPVFKSKRQEKIWLNQKRFNENKKKDQKTYVNQAPFRYCERNFKSKVPPPDFTNVIDFKKQTHNTIENQDRIVSVELKHDLADLSSLFGTSTRQAYVLKNIPGLIVIPNAFAPEQQRYLIKQCLLKYPQAPNTSNLHTHYEIPSQGIWPLFEQQRNGKLNEADPDFYVQKKKIDSQDASTYSDSEEEEEEEEEEVMACSSVTACSDDFSPIIDGPKIDPPPAPGVPLLSPSELIRKLRWVTLGYQYHWPTKTYHLDRRYPFPEDVSELTRAVVTAVEGIGYQDKWRNTYKGEDYKAEAGVVNYYQYRDALMGHVDRSELNMDAPLVSLSLGSSCIYVIGGETRDTEPAALYLRSGDMVVMTGPCRRVFHGVPLIIEDSLPDYLSTSNNNNNNDDDDYKLYSEFMKTARINLNIRQVNTINNTD >gi|Fcyl1000039366|ref|jgi|Fracy1|257232|fgenesh2_pg.88_#_37 ; gi|Fcyl1000079933|ref|jgi|Fracy1|153622|gw1.88.28.1 MDEQYLIRDIEIPCVYHDSNFICKSDADDYYETLRTTIPWQKTAKINRWVRLYQAVDDVDESTIETIETIENGNQNENQPQQQKQTTASGYTYKDAPPVEDDKNKDVIGAGYPELIQSIRQQCQDWYAAANPHCKQDDNIPSLNICLLNFYEDGQQRIGWHSDREEIGRTTPIISVSLGASRQFLLRSQTDGRNDRCSIALTSGSVTVMEPICQIKYLHSVPKESDVVNGRINLTFRCKDFSSNNSKEDQTTEGEELHERRDTFIDRITNGIEPSTTPWTATTSKSSTINSTCRATADNTAFGSKPYLFGEEEEEESFDENNKDNILEPSNVQFLIKTNMGAERYCKAEIRERLFTLSCDNYMNIADHWKVITRPFNLDGYIAVVCNKSTTGDEDNDNDNDDNTGFDNDNQTTINDIRDTLLQLKTAHHVLKYHYHFHLKECKKFVHQYIEDGVDTNIDEMKVEQYPKETLYEHIKEELTNNIISFRPIQDLTNNPSKTSFRVTSDRVGGPHAWQTPEVEYEIGGAIAEAYEQYHWKPKMVDYDICIRADIIGPSCIIGTQLNIHDLSKGRHFTRFRNAVTIKTNLAYAMIRLANISEGSKIVDPFCGSGTLLLEALDIYNGRLKNCLGMDVSRRSAIGSRENADAEGYTEDIVRFVCSDARTLRRKVDGDNTVDAIVTNLPWGIMTGQNQSVSSLQTLYEVFLRNSWYVLKPGGRIVMLVLRGLQIMRIVRKLSGRFRLLHINVIRTTNNLPCIIVIEKLNDDIVRDSIKGQLAHLNQYVNVSPEIYKSIHCEEVDEDCEPTNNNYNNNKK >gi|Fcyl1000029493|ref|jgi|Fracy1|247359|fgenesh2_pg.20_#_150 MISSFFLPRRSSSLSSSSSEVNNNNKGDDNNNNNNDDEDRQQQQQQKGQLIIPTLPLVSIEVITQPSSQQQQQQQHDDDNNNNLNENRYSQMYYHSLDNNNDKDDNDNDNDETVTIKSKKNDDGGGGGEKGRTAIIDDDNNSNSKSKSKSKSDPDLNNSTNDNNCYYKYSGTSRLCLPCNRSRIDENFTINLCSAADQIITQLQMAAIVALKEEEQQQQQQQQQQKNDNDNKKFLENIRLRIIHQKDNKNNDSDTIITNKKEDAENDTIKIRKSRRKRIYNSCLVNWYKPDHTIGLHSDDEPEMDTVTYPIVSLSLGGPRRFILKSKQKQQKQKSQSSQSKQHQQQQQQSQSNNRTIQKNHEFILKDGDLFIMGGNCQKEYKHEIPKPQQQQQ >gi|Fcyl1000099591|ref|jgi|Fracy1|174063|estExt_Genewise1.C_270102 ; gi|Fcyl1000099590|ref|jgi|Fracy1|174062|estExt_Genewise1.C_270101 MIQGAGLQAGVPGSIMDILLKQFNCKKECFASPFNCRYETFSSAFYDIDFYFGSHGSFFEFEKEEGICYQANPPFCEGLILQLNNKITDILLSSQQHQHQQQQRPIMFVVFVPAWHESICYQALLSNKYLTQHLLLKQGQHWYAEGTQHRRKDSFRVASFDTSILFYQNEAAKRLWNLQKTDQK >gi|Fcyl1000049607|ref|jgi|Fracy1|223570|fgenesh2_pm.1_#_442 ; gi|Fcyl1000103291|ref|jgi|Fracy1|177763|e_gw1.1.174.1 ; gi|Fcyl1000105119|ref|jgi|Fracy1|179591|e_gw1.1.2002.1 ; gi|Fcyl1000088119|ref|jgi|Fracy1|162422|gw1.1.2002.1 ; gi|Fcyl1000064982|ref|jgi|Fracy1|137554|gw1.1.174.1 MPKKRRKTQDVSQLSNLSNLTGFGNAGFGASSSLDDCLAEMSNTVSATTTTTTTTTTTTISDCIKGQNEDKKVESSLHNKSRSKTTREKQQPSWIRQAKISTAEGIDNWNRNFQIWAKGGIYHPGLLPNIDQEIARNFKVQELSTLLLSNEFVKGKDIRMTTFERWLLDSKHEEEEEDEEDNAGGGGGIVQGDPVLPLRSSPDSKASQRLLSELITCTSTNLKNKNNNKTSKKIPRQDAEKIIAKLCRTTNLTCQELLCQEDRYRKQSPLNKGDRINVSTKTTESSNVIVVSILYSRKRWKKPFCFKLNQNHYKLLKDRFMEIHAPSSTTLTNAPLFGGRNNNIMSSSDNTMTVLVERSFHVLVLALLLRYSALSGGQLLEDLRGGGMQGAIHSSVFDVLQSHFSKIPHNDKKSSSSTKQFWLEGFASPFNATLPRFASAFPDLDWHFGSVGRFLDCSFDEEYCEANPPFTPGIMLAMADHTTNVLQRADNDNTRLTFVVVVPSADNKKNTNKDEAVVKHEAQKSFRSMVSSVYCTKHIQLKAREHGYVEGAQHLRPTQYKQSSYDTSIIILQSPKAKKYGLDKTNMKQLEKDIRIAFASRHENEIMERKKMAASAM >gi|Fcyl1000019629|ref|jgi|Fracy1|237495|fgenesh2_pg.4_#_1043 MDEQYLIRDIEIPCVYHDSNFISKSDADDYYETLRTTIPWQKTAKINRWVRLYQAVDDVDESTIETTTSDGDEDGSDEGSASNQNQKQQQKQTTASGYTYKDAPPVEDDKNKDNNVIGAGYPELIQSIRQQCQDWYAAANPHCKQDDNIPSFNICLLNFYEDGQQRIGWHSDREEIGRTTPIISVSLGASRQFLLRSQTDGRNDRCSLNLTSGSVTVMEPICQIKYLHSVPKESDVVNGRINLTFRCKDFNNNNSSIEDQTTEGEELHERRDTFIDRITNGIEPSTTPWTAENAMSSTINSTCRAADNTAFGSKPYLFGEEEEESSFEDSDSNDILEPSNVQFLIKTNMGAERYCKAEIRERLFTLACDNYLNIADHWKVITRPFNLDGYIAVVCNKSTTGDNDNNDNNDDNTGFENDNQTTINDIRDTLLQLKTAHHVLKYHYHFHLKECKKFVPQYIEDGVDSNIDEMKVEQYPKETLYEHIKEQLTNNSISFRPSQDLTKNPSKTSFRVTSDRVGGPHAWQTPEVEYEIGGAIAEAYEHYNWKPKMVDYDICIRADIIGPSCIIGTQLNIHDLSKGRHFTRFRNAVTIKTNLAYAMIRLANITEGSKIVDPFCGSGTLLLEALDIYHGKLTNCLGMDVSRRSAIGSRENADAEGYTEDIVRFVCSDARTLRRKVDGDNTVDAIVTNLPWGIMTGQNQSVSSLQTLYEVFLRNSWYVLKPGGRIVMLVLRGLQIMRIVRKLSGRFRLLHINVIRTTNNLPCIIVIEKLNDDIVRDSIKGQLAHLNQYVNVSPEIYKSIHCEEVDEDCEPTNNNYNNNKK >gi|Pram1000009643|ref|73389 MDVEDFRRGPIPGVFYIPNWITQDEEDAVLERVYAVPDDNELWVRLKHRRLQMWGGEVKDPFEPKPLPQWLMQISQTLMDAGFFSEEKTPNHALINEYGAGDCILPHEDGPAYFPFVAIISTGAECRVTFELHRDLASTDNQGVSAATELVPHFDFQLERRSLLMFTGEAYRRYLHSIDNVEVGTRISLTVRHVDLR >gi|Caps1000009657|ref|jgi|Capca1|191192|fgenesh1_pg.C_scaffold_335000028 MEKFKESFKLYKRKKPPPDYSNVIDFTKIDAEDRQVCSLILEKQSCDVLDGMIPFTSWKAYQHRKIPGFYFISNPFTPDGQRKWIKRCLNDFPLKPNITNLDVNYSDLIRERNVWSMHLDADSTDEEKQLLHKLRWSTLGYHHNWDTKKYTADRYTPFPDDLSCLSRCIAHGIGFPHFKAEAAIVNYYHLDSTLSGHTDHSEFDHISPLISISFGQTAVFLLGGLTKDIDPIALYLHSGDICIMSGECRLAYHAVPKILRTPTSELPYHEGDDDTVNGRKVEDTFEPFESYLQSSRINMNVRQVLCDGQEFPKDES >gi|Sarc1000009775|ref|SARC_09551T0 | SARC_09551 | Sphaeroforma arctica JP610 hypothetical protein (395 aa) MEGATVNYLHTSAQHFLTDIEYGKFPLRLANNKTTIITTRGTLPNLGLAYLDHTLTQGIIAQSHLQDSGCSLSFPGDIMSCHLTYTGTKLELRRRNGGYYIYYTALQNLFHSPTIRNIPQESLDIHTAPNTTSNTIMIDPPAPPTHTTMTYDEFHRSEGHRIRRTTLRMAKQKHITLTHTPSKYVQCEAWMLDRSIFASSMIFLGYTPKMDLFAAKHNVQVPHFVSPEGGGIATDWRQIDFTNEPGLYGNPIWPDIKELIDKNTKAGTTLAIVVPLRPTATWWDSFLAHLQSQPYIIQNTTSIFRRHSATIVGKPSWLFTLVALLGPISSHVTPGDDLKSAIDYVRFNPEFPKNCRVCVEGKMKAKHVSHCDKYASITISQRTPLIRGETLAIY >gi|Ttra1000009841|ref|AMSG_11724T0 | AMSG_11724 | Thecamonas trahens ATCC 50062 hypothetical protein (232 aa) MSAEASTSTSRKRVRDEDGDGFTRDDGGRTGKAARGLDGEAVPGVPGLTVVTDAVSPEDELALLAQVDNGEWMTSLKRRVQHYGYRYDYRSRKVAPESYLGELPEWSKQVMARLGAAGDGEFDQLIVNEYEPGQGISAHIDCKPCFAGVIVSVSLGSGAVMRFAKDDVVCDVWLPARSAVVLTGPARWEWSHSIAGRKSDKVGGVRIKRTRRVSLTFRTVRLAGSEPATGE >gi|Fcyl1000129851|ref|jgi|Fracy1|204323|e_gw1.113.10.1 ; gi|Fcyl1000129857|ref|jgi|Fracy1|204329|e_gw1.113.39.1 ; gi|Fcyl1000088406|ref|jgi|Fracy1|162728|gw1.113.39.1 ; gi|Fcyl1000066919|ref|jgi|Fracy1|139674|gw1.113.10.1 MKAKRYGSDTKLSNVISFLRRFGWLHVTVFAVVTVSAFSGKTGTVGHRRNKKSPSVQVQQQQQQQPSLHTKLQNCFTPTSILENIAVLVTPKVDPSASLSSLALIRLSKQIIALDNENNEYLINDNNKQLWKEGLRNLVSCLASSNWKASPKALETAVEGVKAASVISRLVSSDYLISSNNDNNNGKVWWEPLVEKLHEEADDQLVRMIQPHQLSGIKFSIDCIQLSSSTKQDASDLLSSHRQQYLLPQSLQIAYDNLNLPFSVRPGFLNGNDDDDDDDVDNKHNNNLFTVASFVKQVKFQIETIQTATNRTVAERRQTAWEGDEHVENFEYSEKSMRRLPWSDVVANVRDRLYNETSHYYDGCLLNFYPDGDSAMRYHIDPDQGVLWDYETAVVSIGATRRFSFRESSSGNGSNKPHVFVLMNGDVTEMFNDCQERFQHTVQKSSVKGESASRVSLVFKKTLGYSNYKRENKISN >gi|Fcyl1000029925|ref|jgi|Fracy1|247791|fgenesh2_pg.21_#_237 MGKSRSKRKSKDKLSEENISQPQQLPSTTMTASLSLTNGDNNNSVHLPQAMPQHLPRPFGDNFLKDESPYRKSFREALTTSYEGFVFDDAATLMSQPTAINNVKEDVVQNSLESMSRGGIFRTDVTQPFGLGTKCAKTYVTRCLVGSPGTTYKYLGLRMFAHPWTTSTTTGEDNNNNSNMKRNDNERRVCHTVVTNDAQTIQELALALTNRTKKHLRDLDESRRQRQPMFGTRGRPGFDICLINRMESSSDLKPYNFSGDSSTSSSNGKSGNKNGVKTTVSWHADSSLEHFSSIAVYQTILGSQKDESNNNSNDKRKRQRTDCQKQKADEEEEGQWLVALRVAHHSEGPQASQQRRRGTNTETATVEETPPIAVNLPSGSCYYLLDDFNHHHQHTVLTTGNTSTDWIDRGKSAIRQFHKKGSRIWRSEQLLLTEIESEWIRQFYIQGMGHHQLLWESYWKDPIQELLSIWSRLEHRTEQTIELLRAAAEGKCGVGMNTEKAADKPTKAERKARDRRKKSLASIRELVSRINETPEEDGATAFTELYQPMAELLEERAEMRSKWEKREKDHVFHELPLDYRPMKVPFKFERTIDENNIGNEYVATSPLPNSPDKLKEIAAQLLQLGRAYRNGDAKQLPPPWKKEKPKQDALAEGDSTVDDHSKPLDWSGWNACAQLFGLELQHPWAAAIIDGEKVIETRSYSLPPSLIGGTKVMIIESSSGKAGVSSLCNHVDFSTSSRKKGTGNSKVIGWCTFISVKTYTTKQEFQAEENLHLVTPDSGYGWKDDGSTEKVYGWIVGERYRFDESSTTDEENFLYDSGVRRFRSLFQLHKKKSKDAYPNTSNKKNINKRNLERENKQNSNGKNKKRGRY >gi|Sarc1000009974|ref|SARC_09749T0 | SARC_09749 | Sphaeroforma arctica JP610 hypothetical protein (333 aa) MWHRDEFATDPRGRHQGRWQEVCRNIDVLLLHVRVVDNGPVDMLSRLTPKVDPIMGQVGGQGANPRRKVNWRYPVAREAQPRTNEYKAAAFAFNTAEKAWGAHTHDAFALPGNQVPKYFSQSLEGGASAESWVGRNMWVNPPWELIPRVLAKVVAEHLEITLVCPYMPKAKRWDPMTRMRASQSQYRCSMVFSCGMGMKLRVPTMGSDAALSDLVQQALPGVVGDCPTAPALVSRPERALLVPTPAESDQLAAAAEVEEREARQQRAARAKRRVAQKLAGKALLARQGQSPYALRGLTLSRPEVRNVVGAAHNVYMHTGAEKFLGKLEELYG >gi|Fcyl1000039992|ref|jgi|Fracy1|257858|fgenesh2_pg.102_#_45 MGKSRSKRKSKDKLSEENISQPQQLPSTAMTASLSLTNGDNNNSVHLPQAMPQHLPRPCGDNFLKDESPYRKSFRKALTTSYEGFVFDDAATLMSQPTAINNVKEDVVQNSLESMSRGGIFRTDVTQPFGLGTKCAKTYVTRCLVGSPGTTYKYLGLRMFAHPWTTSTTTDEDNNNNIHIKRNDNERRVCHTVVTNDAQTIQELALALTNRTKKHLRDLDESRRQRQPMFGTRGRPGFDICLINRMESSADLKPYNFSGDSSTSSNGKSGNKNGVKTTVSWHADSSLEHFSSIAVYQTILGSHKDESNNNSNDERKRQRTDCQKQKADEEEEGQWLVALRVAHHSEGPQASQQRRRGTNTETATVEETPPIAVNLPSGSCYYLLDDFNHHHQHTVLTTGNTSTVRYSCTFRLLRDSHNIQDWIDRGKSAMRQFHKKGSRIWRSEQLLLTEIESEWIRQFYIQGTGHHQLLWESYWKDPIQELLSIWSRLEHRTEQTIELLRAAAEGKCGVGMNTEKAADKPTKAERKARDRRKKSLASIRELVSRINETPEEDGATAFTELYQPMAELLEERAEMRSKWEKREKDHVFHELPLDYRPMKVPFKFERTIDENNIGNEYVATSPLPNSPDKLKEIAAQLLQLGRAYRNGDAKQLPPPWKKEKPKQNALAESDSTVDDHSKPLNWSGWNACDQLFGLELQHPWAAAIIDGKKVIETRSYSLPPSLIGGTKIMIIESSSGKAGVSSLCNHVDFSTSSRKKGTGNSKVIGWCTFTSVKTYTTKQEFQAEENLHLVTPDSGYGWKDDGSTEKVYGWIVGERYRFDESSTTDEENIPYDSGVRRFRSLFQLHKKKSKDACPHTSNKKNINKRNLERKNKQNSNGKKKKRGRY >gi|Wseb1000004329|ref|jgi|Walse1|61194|estExt_fgenesh1_kg.C_210042 MISSEGDALLRYHVMKRNNEEEYLAKLPEDKKQKRKVYGQRYSLTDEESIRNDYSMAYVNQLVRPQDQIINPHREGRFAEYPRQKQLLKLKDSLVEEYAYPPVYLPFDLVDAIPKSPTTPSSLFDAIDIGTKFDVIMIDPPLPNSRSSKLEGRQWTWDALATLPIRNLSADPSFVFVWVGSGGEDGLEKGRELLAKWGFRRAEDIIWVETTPEGHKEEDTSDVPDNLFKRTKQHCLMGIRGTVRRALDSHFVNCNIDVDTIIAESASRKPSELMALIETFCLGTRRLCLFGEPENARRGWLTVGMKGSNNETYSPPEGVDMFDSKQWSERFPNNKANLVPLTPEIDSLRPKSPNRSNNNSRNGTPKSDPRTPPFATRPVGSPYGTPPNMPYNPQFNAQMRYHMQQQMQQQMQQQVQLQIQMQLQMQAQQAHMQMFGGGNLYQGVQAVPQMYYPPVNTTGTIYPLSPQMPISLGFDGNTNNRRHAKTPSNSNQNNINRNRN >gi|Wseb1000002161|ref|jgi|Walse1|60123|estExt_fgenesh1_kg.C_70035 MESAIMNILDNTSGYNINQLLRRLIFNYPDLGLTFNDLDKVSSKLIELGIDLPTTSYINNKDTLYDSYTRSVVPEDDYVKICSNTTRTECTEDCQKVHFQPIIRPYSDHALGHCSYLNTCYPFYNNAPPTLSNAFQPAKLNSPRLDRTCKYLHFQLESPSESAIEQADYQTKRRKKCRGDGLRQELDTILGSKRYPAQYINCDLRSFDYNTLGKFQIIVADPPWDIHMSLPYGTLTDDEMRKMPMSTLSEEGTLIFLWVTGRAMDLGRECLSIWGFKRVEEIAWVKINQLQRLIRTGRTGHWLNHTKEHCLVGMKVSDPDASDIQWPEWLNRGLDTDVIVSEVRETSRKPDELYGMIERCCPVGRKVELFGRRHNGRDGWLTLGNQIGEDEVYDPELSQRLNERYPEKGKLVVGR >gi|Uram1000008906|ref|jgi|Umbra1|259486|fgenesh1_kg.75_#_199_#_combest_scaffold_75_134233 MSSREDTPISVDADDFALDNVTDSSLKALLQEETKLKAQLDALTTEIAKLESQINPAPKEEDDEEIDMEEFEAPQWCVPIKANVMNFDWDALAAETQFDVILADPPWQLATHAPTRGVAIAYQQLPDVCIEEIPIPKLSKNGFIFIWVINNKYAKAFELMEKWGYKYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGEDPPGCRHSVGSDVIFSERRGQSQKPEELYEMIEELVPNGRYLEIFGRKNNLRDYWVTVGNEL >gi|Uram1000004716|ref|jgi|Umbra1|238969|fgenesh1_kg.24_#_143_#_combest_scaffold_24_50604 MADRSRRKRKSRNTVASNIHYVGYVEDEESVEAIMKKFEELDRIQKEFSAISVNSTPNTSSENKENEVESNGTPAEMEEKKSEASSQGLTEAQLQEIFKRTSAFTIRSAMMDSNDMDELNDVEIWQLQYHDGLTDEIYEEDDYIHLDMEDDDFWDMEFGGDAKQKKRMRRAPAPVREKAPSGGRRGIDRESIIAKYKIMQVRVQDRNGNFFMVKKKVSAIDPSLPTYVKIPGEPIPRSWVHTILHQQVPTEDIASTIGQKYLKKNILDMDLKELGTNFQAVYMDPPLLLPGEEPCPGKITIEQLASLDIPSIVPKGFLFIWLEKEWLPDIVRICEKWQFKYVENFCWIKKNLNNQISKKPYTYFNKSKLSLLIFRREGDVELRHQRNPDCVFDFSKPRVPGELTELKPRFLYQIIETLLPSSTFHPENNPDGNRLLELWAKHGQKRLGWTTVVDES >gi|Uram1000004450|ref|jgi|Umbra1|237704|fgenesh1_kg.22_#_168_#_combest_scaffold_22_46365 MGQLDDSMDTLSSHAEDQGGYDFIVIDPPWPNKSAHRSKNYDTLDIYALFDIPMAKLLSSDALVAVWVTNRPKYKQFLIDKLFPSWNLELVGEWYWMKMTTMGQPVMPLDSTHRKPYELLLVARSKAGSSIIHVPEKLVFASVASQHSRKPPLNDLFSLFDSNHPDRRCLELFARCLAYNTTSWGNECLLFQHDSYFTKTSSAPDTP >gi|Uram1000001485|ref|jgi|Umbra1|223403|fgenesh1_kg.5_#_767_#_combest_scaffold_5_103034 MEFNLTRFEKSTTEEKPPKKDLLSLSRKRRIGSEEDNSTKRQALKPSNGAVQGSKCNVTDHIADKLQDSHISKQPNKPYCDKACAHGEGSSKHHQIHWGFMMKDKPKVKKEENKKVDSDEEFWKKAAEMAPEDDVQSEDEDMIETLPGTPSFDDMPSPVVNTLLEQLEAQPEEEDDDTLMIPDDDTDTEKEVANDRRSPEPSEQDIPANDEEETEKSSVVRTKVPGENTFSHDCVKTWSPARIHAWEIRRLNPEAFYYRFVDPAQGQSNGSFTKTDHDNFMKRMEEWKEKGYRIGASWGIFSMGIPHRAGYQCSAYYRKLIQSKKIKDDAYAIVDGKLKMVDKNRTNAGEAATSALSSAWDLDEVKEIEREVDQWLKEYHNRSGSIVAKKPAAPRPKAAPRVHKSTATKSGDIALLVKQNIGVGNFRLLSDEEAFMLANEPSAITLKRRNWDLEWKESLQEYRKFVEGFQDPDVRRKYHELKKQWRKGLITTELNMSLRRQTQNTPTTTVAPTTSAPVAKKPTQLSLANFFSGVKKVKPKTDTPDDLISRYIIPNDLFSGVKQMRGLYNSKRASATEPKTAYWEMKDMNDCVSVLDEVMSQTEAASSQSPIEGMLIDPPWEFYVADGRNDGRCTWSLTDMMSLMENVLQHMSAGLIFMWTHKSTQADVVRMMYSLGCTYVENLVWFKKTLSNVLQDRPSPYFSSSKEILLIFKRGEGFELRHQRTADVIIDFERPTEQWIHEEYTELKPPGVYDMIETLLPKAGFDEALGRGRFLELWAKRQAPRRDGWLAFHHIKTEVDTDAKQIHQEESMDTANNETEVASMADQN >gi|Uram1000000276|ref|jgi|Umbra1|217644|fgenesh1_kg.1_#_1214_#_combest_scaffold_1_3769 MADPSYRWFDEVPPRKKFKQQHSGIPNHNYPHESRQRVCPVALDELLQTETTSEEIQRLTWKSSLDGFKSHCEHGTKQECRRRSIENRTCDRLHFKHILNSHTDSQLGDCSYLNTCHRMGSCKYMHYCLDATAEELKPILRPTLSNPLMLQRKKLLPPQWIKCDIRKLDFDVIGKFTVILADPPWDIHMSLPYGTMTDDEMKDLKISKLQDRGLLFLWVTSRAMELGRECLMNWGYELVQELVWVKTNQLQRIIRTGRTGHWLNHSKEHCLIGRKGNDPFFNVGLDCDVLVSEVRETSRKPDELYDIINRLSPGTRKLEMFGREHNLQPGWLTLGNQLKDSRIYEPQLVERYNAKYANHPIRLFSLPHEYR >gi|Ttra1000008696|ref|AMSG_10222T0 | AMSG_10222 | Thecamonas trahens ATCC 50062 hypothetical protein (556 aa) MSTPSVASLYVGYAEDFESIESIMRKFEALESVKAKAAAEAAAQAEAAEVSAAEATAIGEVVADGGDEDRGVDTTRASMPAQSGSQELTQAQLEEVFRMTSLQPVTLLQTDFVFESGDVAADKAELDRFLEQNEAFFENSESDDDDAAFVAVLEDDGWWDLEYAAKPKKRKRKRSQSSKMEAKMEARRAAAALKKARRLEAKARKMEEARLRREMRAQKIRERKAARAAAKKAKASVPKPRTSFMRCELLDEFGRKFVFKKREKDVDPRQPQYIRLPEYPQLRTWCHRIRPAGQPPSSLPPDLMEASHLVGLSEHGRSPGTVRVADILRTPWDVFGEGYEGILLSPTLRLDSEPAFPNSISISQLEELRLSDALFPGGLIFVWTDKLTLPHVVRIFEAWGLKYVENLVWIVLNRNHKFHESPSSVFKSSKLTLLIFRKCGRSLDLRHQRNTDVIFDFMRDPELRKKPTQVYETIRTLLPYSRLLNVWAPPTALDQPIPTWEAIVDVSVVPPTHPMDTLDVIDVPPASVPLVASVLPPSRDTPLPMAVAPVATGAL >gi|Ttra1000006356|ref|AMSG_07188T0 | AMSG_07188 | Thecamonas trahens ATCC 50062 hypothetical protein (1237 aa) MPTVAEMVEEERGRRRRRGKVVVYNEEALLKSQWRDSEKMKKAMKVEEIEAKKVRRAHARRRAAEKKKKAAERRRKAKAAAAARRKKKAAEAAAAAAAAAAAAAAMDPGDAATLQDSVGSTVGGQKVDAGSGAQASAAGMGAGSSGAASVSASASAASTPAATPSRDSRNARHPRHPQHGYVIYDDYGLPIVVDQLPKRVRKRAPAAVTSAQAAAQNRPMVARKLLDVPSPPTTPGRRAGLARPDRNSPRLSAKASGSAPPSASKGKGKSKGKSKGKGKASVALGANGSGGPPMKDVLRRFKRWTKTYLTNAVDVLPQTAMPSTVIVGENAVVRHGSSVFLDRITAYNEAELKDDPVALALDKALSCFSEDNPLTTLPGFSPRSLWYPSLSEVLDAAFGGSVEQMARSLSAVDASTRAHQAALRKGLIRPRTLTATVSAPRSSAPEQALAPMDVVTAPPQRSQPPPVSTRTRAVQTEITHVQSKTARRWDDVQRVSSELDAGIIPKNVYADQLNPKRTFEHFISRSPTKTATRPTAPLATIAATVATAAPAAGEPKDSSVNTLTSAMNPAPSDALRNPAALSTGKRGRDEADEATGTAAGHGPGPVPKKAKPMEGPQLKLSAPMAAPAPGPVAETPAPSMTVAEMVAAATAAATTAAAVLGTGGPAGRSTVSATPAGAAVPPRACIASTASVSSAPPLAAESRPPPPPPPPPPSLPLPPPPPSVPTGVEAAMANEDGGPKMMLKGEERWTSGTYVNCDVRYFNLKCLGKFDMVYIDPPWRIRGNQVMPEDGHIFSNSQRRRNYDTLSNTDIYDLEIGELVDTGLIFMWTVTSQLKVALQCLEHWGFDFIDKITWVRLTHRDNVAMGLGYYFLHSSEICLVGAKSRPGARLEIIPKISNDLLFARVGKPSEKPVEMYDIMERMMPGGRKIEIFARNNNLRRGWLSVGNLLGPNFEYVKDRVMCDECAAGIPIGTRRFKSRSVPNRDVCAECFAGTGEPAHNYFELEHAIAVMVFHNNYTCDMCGMNPICGIRFSCSRCEYDLCEGCFDFSVTTWAAAAGAASPSAAEPSIFHTPDKKSTRVRSATHDPAHAFLAYEDMDDSGGLPKHHARCTSCFDFPLSATGSSASTANGSRSAKSASSAMRLLDRPRESCICPRCVIDVHATERALVRELPSVRVADRALVLAQRAASAAAADEIIHDAVSDVAQGVAADAVADAVAETMIDDAAAAVAAESMAG >gi|Ttra1000005250|ref|AMSG_05730T0 | AMSG_05730 | Thecamonas trahens ATCC 50062 hypothetical protein (304 aa) MTSRGEGEDGAGPNLEAGTVETMTTGQTMTTDMVTAGPEASMLVADEELKSLDVDTLARLLAQEEKEVEAIEGDIKRLNAQKRIPLATGKNKVEFLKSSFDEIDWESFEAPEHCVPIRADVRLFDWAALGAQVQFDVIVMDPPWQLATSAPTRGVALGYSQLRDEDIMRIPVPKLQENGLLFIWVINARYSFAFQLFKEWGYEYVDDVVWVKRTVNRRMAKGHGYYLQHAKETCLVGRKGADPPTLQSGVVSDVIWSERRGQSQKPEEMYEMVEKLVPNGRYLEIFGRKNNLRDYWVTVGNEI >gi|Spun1000006218|ref|SPPG_06562T0 | SPPG_06562 | Spizellomyces punctatus DAOM BR117 hypothetical protein (358 aa) MGPHMGCDLENEPELLELSANNPEKKEKTEDEPEKPAAQIIVRNDYMQNFINTSLRPQNYIREVHPLVRFTEYPKLQHLAQLKDAIVDAYATPPMYIKANLRREGLGTLLGGIRFDVILIDPPLREYCEWSPSTIINCPDPVRPYWTWEEIGALAIEDVAATPSFIFIWVGDCEGLDQGRSLLRKWGYRRCEDICWVKTNKTWPGTPLMAPPSVFQHTKEHCLMGIRGTVRRSTDGHFIHCNVDTDVIIAEEPEDLTTRKPEELYYLIEHFCMGRRRLELFGNDENIRRGWITVGLGLSTTNWDRDRYLSWLRGEGAAQGTPYVRGGLLVPTTPDIEAIRPKSPPLSVRKGPRRGII >gi|Spun1000005457|ref|SPPG_05756T0 | SPPG_05756 | Spizellomyces punctatus DAOM BR117 hypothetical protein (454 aa) MSSRRSTRKRKCNTADISSSWYVGYAEDGESVEAIMQKFQELERMQQELAAQGSSTPVSAPTPEASSVFASGTNSDADADMARAIALQEGQEESTFTQAQLEELFKRTSCFTVKQATLDLDPDDLDELELWRLEMEGGDDDDWEENDNHILDDDMWDDEFGPARSGRRGERIPRARSGGLRSKLDRESLIAKYKVMQIQMQDRNGNFFVMKKRVCTVDPRLPTYIRIPPVPIPRSWVKLITSYAPPSGDIEGCRYFEDDILHFNMKPLGNRFQVVHMNPPFLMPDEEPTSGKISMKQFEKLDIPAIIPFGFLFIWAEKELTPDILRATQSWGFRYVENFAWIKRERSNKIARQSSRYFNKSKTTCLILRKERKEGDVELRHQRSPDCEFDFIKPKLPEDLTEEKPNFVYDVIETLLPQAVYGPANQNGDRMLDLWAKPGRRRKGWTMVVQKRS >gi|Spun1000004502|ref|SPPG_04738T0 | SPPG_04738 | Spizellomyces punctatus DAOM BR117 hypothetical protein (418 aa) MSQIVYQSDFGWVLRPHLYTLGDAWHWPASAFDVLAPYTARTKQHVSKALPVEQGDAGVDAPNVRKRKRSKQVKSDHVNKQNGINELHSSICHWLSEAHASLVLHSGTFPLPTTAAVADTSSGASIDDLDFIKFRELADIADSSKHALESEETDQACGIVPVDDVSRELDISSLYHQFISNDSDECKTLPVLGHSFIVPPHSVFLMSDLSQAQLLRSLNPFDFILMDPPWPNKSVRRAGKYQEIDIYDLFRLPLKHLIKPGGCLAVWVTNKPKYQRFVRDKLFSACGLAYVAEWYWLKVTLKGEWVVDLDSLHRKPYEVLIIGRAQHASNSHPDPVASLPTRRAICSVPSKHHSRKPLLDDAIAPFLPRDAQKLEIFARNLLPGWTSWGNEVLRFNDVQYLTQTTEGYLAPPPTELG >gi|Spun1000003158|ref|SPPG_03319T0 | SPPG_03319 | Spizellomyces punctatus DAOM BR117 hypothetical protein (581 aa) MDGHTDVESSANADNHGLLSMSLKRRIAERKAKGVMLDASTSFEGRVRKTSLLRRPSPPRRIQSPGDTYTNKPNEDLLASALLLSQRDLEPLVQNFLNTSNCIEWLPMDSHTLLRKYAGVTVSITPTLIIMIEKFFITSCSLIEFTPTLAMAHYNWKNVQSLERTLRALESRDGEPYLQLDIALVGSGKRVAVTRVLSKSTCPATLYPPSTRHRQLEVDWVHEVFVSLHPEPPGIESIKELLATPSFKGTANGKIWNELHNLIHRPTAKQNLIQEKFKNREDVEFKEFCEWGLRTDCNKHQHSTPCPKLHFRRIIKPQTDVSLGDCSYLNTCHRLDQCKYVHYELDEVNVTIDIDRPLPVLQIGNPLPAQWISCDVRKFDLSILGKFTVIMADPPWDIHMNLPYGTMTDDEMKEMPIQDLQDEGFLFLWVTGRAMELGRDCMAIWGYTRIDELIWVKTMQLQKLIRTGRTGHWLNHSKEHCLVGVKGGCHLQTGGIDGDVLVAEVRETSRKPDEIYSLIDRLCPGTRKLEIFGRPHNTRDGWMTLGNQLDGVRICEKAVLERYNKLYPATPATLFVDRMS >gi|Spun1000000693|ref|SPPG_00725T0 | SPPG_00725 | Spizellomyces punctatus DAOM BR117 hypothetical protein (357 aa) MSKMPPSRGPMAISSDVDVLTSDSEVLNIVESDGDGDFAPSRPGKRSQRVSKIVSKTSVSKSRTKSKSLKSTSRLSRPEIVRASSVVSTGSSASGTASADGKEDAAKIGPESSLEELLKREEHLQLQIDILMEEIKVLRGGEKSNAVGDQAEEEEIDYSNFDAPEWCVPIKANVMTFEWDRLADACQFDVILMDPPWQLASHAPTRGVAIAYQQLPDACIEELPIQKLQKNGFIFIWVINNKYVKAFELMERWGYKYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGRKGNDPPGCRHGIASDVIFSERRGQSQKPEELYEMIEELVPNGKYLEIFGRKNNLRDFWVTVGNEL >gi|Smar1000012137|ref|SMAR002698-PA pep:novel scaffold:Smar1:JH431216:18104:19856:1 gene:SMAR002698 transcript:SMAR002698-RA MGDCLKLLKERSQKRRELLAQQLGASSVEKLSEVLGNENGKERNKEESSDKKESKDTGDRPKKRLKDDEDDEEYYASYELPEAEEDIYKDSSTFLKGTQSANPHNDYCQHFVDTGQRPQNFIRDVGLADRFEEYPKLRELIRLKDDLISTTATPPMYLKTDLETFDIRELGGKFDVILVEPPLEEYQRSAGVTNTQFWSWEQIMKLEIEEVAAQRSFVFLWCGSSEGLDLGRKCLRKWGFRRCEDICWIKTNIANPGHSKNLESRQVFQRTKEHCLMGIKGTVRRSTDSDFIHANVDIDLIISEEPQYGSLEKPVEIFHIIEHFCLGRRRLHVFGRDCTIRPGWLTVGPDLSNSNFNNETYNMYFNNGPTDYLTGCTERIEALRPKSPPPKMKGAGAGGGGGRGASARGNSGGRGRGGFSARGRGSHRGR >gi|Smar1000010256|ref|SMAR004352-PA pep:novel scaffold:Smar1:JH431477:154003:157599:1 gene:SMAR004352 transcript:SMAR004352-RA MGERKGVNKYYPPDYRPEKGSLNKYHGTHALRERARKLDQGILIIRFEMPYNIWCDGCGNHIGMGVRYNAEKRKVGMYYSTPIYQFRMKCHLCDSHFEMKTDPQNLDYVVVCGARRQERRWDPTQNSQVVPEDKDTTQKLVTDAMFRLEHGEADKRKGKEAAPSLARLEDMQNRWKDDYTSNKILRQIFRDKKKEKQEKEKNNKLLLIKSSLSIPLLDENPEDVKMAKLLKLTPLQSYEERQREKRIEIDSKPILPQLVKNLNQNQKDLKREELVRIKNCKNSTETRISLVASEYNDSGDTNSYEIKDIFFDIRSEYLMDTQALARVSTTQRKRRRKIERKNEFSIYHKEVCDLIEKCAQSWKPECLPGKPNLEEILLNNIFARTASKIIDDNCRLSQLCSRNMMKEEPLIHTITRNGHVEPNLKFVNKLITHDEDYACVIKWDSKDFLIPANSSFLLSNDLNILTKTNRRYDVIIIDPPWTNKSLKRKKCYNTSSEVNFLSLPIKDLASKNCLIGVWTTNNSYLINYVKSVMFPHWGVEYETDWHWLKVTRYGEFVVPLNNHTKKPYENLILGRMTGSMENITSNLIFVSVPSCFHSHKPLIRDLLKNFINKEVKGLEIFSRSLCSGWTSFGDECEHK >gi|Smar1000006443|ref|SMAR007641-PA pep:novel scaffold:Smar1:JH431796:151221:153694:-1 gene:SMAR007641 transcript:SMAR007641-RA MSDTWSDIQAHKFRQSSLREKIQKRKKEREEIVNSIANDLSPTVTNKANTVESRSNSPTPIITKAKDSSESDSKCDPDVESKLLLCLCDVALNLPTDSRALGSLVSKALNREASNKIVENLLQKFAAQELISLKDGFTPDGKSCLNVTSAEHTKLTAVSNDLIGIQSEETTKVGKKRKHDSLHDDETNVKIVKDNLKKDKKDESIESLLSMPSIREKENKKMGEEILDLLSKPTAKERSLAERFRSQGGAQVQEFCPHGIREECAKVSGNNEPCRRLHFKKIIQKHTDESLGDCSFLNTCFHMDTCKYVHYEVDYYGAQIGEKFRQERELVAPKNLCGEKDLATLHPPQWIQCDLRYFDMTILGKFAVVMADPPWDIHMELPYGTMSDDEMRQLNIPALQDDGLIFLWVTGRAMELGRECLKLWGYERCDEIIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNPKGINRGLDCDVIVAEVRATSHKPDEIYGMIERLSPGTRKIELFGRPHNIQPNWCTLGNQLDGVRLMDMELIHDFKKAYPDGNCMVPMAKS >gi|Sarc1000000133|ref|SARC_00130T0 | SARC_00130 | Sphaeroforma arctica JP610 hypothetical protein (361 aa) MLGSVADDTKEMDSYSDDNELSSCDASDSSEEEPELASDVDELIDEEIRTVQEMVLSKRKIKALKKARRVLTEDEVVTEESTTLCELNSGLSKEELLASRSARRVLRRNHEAKRDQTAEDETNLHNSNEPHATNNGAVATAENADVAGSCSESKKTSDSKSSTQNFTPPPFSVPISADVTSYDFKSLAQLTKFDVIHMDPPWRLANAKPTRGVALGYSQLCDNDIADMPVECLSDSGFIFIWVINNRFEVGLELMTKWGYKFVDNVDWVKQTVNRRLAKSHGYYLQHAKETCLVGFKGDLNYVKSTTHTDVIFAPRRGQSQKPEEIYHLIEALVPNGKYLEIFARKNNLRDYWVSIGNEL >gi|Psoj1000016503|ref|144700 MEPKVSSVNHAALVNGYYMGRLCLREDSLQIPATAFIRPSIAQAPGTLSAAAARRHRKRETAARRRSERLGELTAQGKFVPLSDQVRDALQNAFERFGGPRFVLRDFLPAFQDPSDDIQKVLGSLPVLSDTEALDDGDLHCNDTDQVRVSISVSRAKRYDMFDHTELLKIDVPHLADSDECILAVWVTNRPRYMAYLREQALPAWGFTYHASWDWLKLSKKRLGEHKYF >gi|Pram1000002112|ref|82533 MWLLNRIGLAAAGCSAALLSCGATASEPRCFPADFLFGSATASYQVEGAWSEDGRTPSIWDDFCREQPGFECANVADDFYHRYADDIKLMVETGLQSFRFSVSWSRVMNWDPETRRMQPNAPGLVFYHALLDKLVENGISPILTLYHWDLPIELHNELTPQGWLNPDIIDHFVQYSTLLYNEFGGKVDFWTTFNEPLSFVVYGYNTGLHPPGLHDSPTLVYEVAHKVLLSHAYAVQKFRELKSGGVIQPKARIGIVLNANQFYPLDASNPKDVEATERAMNFEFGWWLLPLTTGDYPPVMRERVGDRLPRFTVEQAAVLKGSYDVFMLNHYYSRAITDCDSETSNTPCSSLHVGFGQDKGVDDTHIVPGSRPGLQDSQGNNYCKSYTAFPPGYLQTIKWMHAKDPSAEILLTENGWCGNDEVENLDQLWFFQSYTEQVYKAVVEEKIPVIGYTAWSFLDNYEWGSYGPRFGLYYVNFTAQTGSVEGYEPKPTDLERIARPSAKWFSKLASTGCLDELSQADETAAIAMRATTRQELSVPVRTKPMEPKVSCVNHAALVNGYYAGQVHLREDSLQAPSSAFVRPTGSAQSLEKMSAAAARRQRKRETAARRRAERLDELTMQGKFVPLSNGARSVLQGAFDRFGGARFVLRDFLPSFGDDRSKTPDVVASVPEMSNATALDGGKVYCNEADHVRVAAVDNARVVLPAWCSFAQCDVRELHQLELARHKLIVMDPPWQNKSVSRGKRYDMFDHTELLKVDVPHIADLDECILAVWVTNRPRYMAYLREQALPSWGFTFHACWYWLKLCKDGELVTPLDSTHRLPVETLVVAYRAKDPHHEKLLRQRLGEQMRVVLSIPLRHSWKPPPECSFDEDIISRTDKKAELFARELRPCSSFNQAKCSTGAALSDAETSVVSPFTSKEKATQSVVDL >gi|Pisp1000009414|ref|jgi|PirE2_1|12713|gm1.11659_g YYEEDIFKFDFNKIGNDFQAIYMDPPLLLPGEEKTPGKITIEQFGSLKISRLVKKGFLFIWCEKEYIPNLIQICDKWGFKYVENFTWIKKYINNRIVRQPYTYFNKSKITCFIFRKEGDVELRHQRNPDCEFDYIKPKIPGELTEQKPEFIYKVIETLLPQARYSPTNNKGTRLLELFGRKNYKRRGWTSIVQKSD >gi|Pisp1000005866|ref|jgi|PirE2_1|53885|estExt_Genewise1Plus.C_1020015 MDPPWQLATHAPTRGVAIAYQQLPDQFIEELPIEKLQKNGFIFIWVINNKYVKAFELMKKWGYTFVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGDDPVGCKHKISSDVIYSVRRGQSQKPEELYEMIEELIPNGKYLEIFGRKNNLRDYWVTIGNEL >gi|Pbla1000012755|ref|jgi|Phybl1|76718|estExt_fgeneshPB_pg.C_40189 MSDRPRRARKQKPTVASNVHYVGYVEDAESVEAIMKKFEELERIQKEFSTMDVKEPEVDIVEEMEVESQPLTEEQLQEVFKRTSAFTVKTATMDSRAADDMDALELWQVENKDGNTDEIYEEEDYIHVDDDFWDLEFGELPRPKRARKGGPVVERVARRGVDRESILAKYKVMQVQVQDRNGNYLLLHRFQNSTIDPSLPTYVKIPGKPIPRSWAHSILCKSKSQLPKPPPIASRLHTVNNILSTDLSRYGKSFSAVYMDPPLLLPGESPVPGKIHIDDLAKLNVSSVVSAGFLFIWLEKEWLQRIVSMASQWGFKYVENFCWIKKNINNQIHKSPYKYFNKSKLSLLIFRKEGDIELRHQRNPDCVFDFVKPMLPDEISEKKPEFMYNVIETLLPNAVYHPEKNPEGNGLLELWAKRGQRRTGWTTLVEQPIKEGENMERHRDIEAQRHGEE >gi|Pbla1000007785|ref|jgi|Phybl1|66651|fgeneshPB_pg.16__43 MSHNPQDSGDWQSEFLTFSLYRVSHSKMRKYLDIMDHILESDSTNPKSHVNFPLLLKPRVGKMSSRESTPSSILMDRDDFDETTVSDGTLKSLLKQESELHLQIDALQIEIATLEEKLGKQEKGDELDEQDLEEFEAPEWCVPIKANVMNFDWDSLAAEVQFDVIVTDPPWQLATHAPTRGVAIAYQQLPDICIEDIPVPKLQKNGFIFIWVINNKYAKAFELMEKWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGEDPPGCRHSVGSDVIFSERRGQSQKPEELYELIEELVPDGKYLEIFGRKNNLRDYWVTIGNEL >gi|Pbla1000005949|ref|jgi|Phybl1|63089|fgeneshPB_pg.6__310 MHRESSPSSSKPGRSSSSSQSPLPFRLSLSKKPKKLGSESTAISTSSKQNSPDVSESSLNSTIKAKDEDDEDDIQFDSITPRINTLDTDGESMDTSENSSEDEYIEDSAKKNISNRIKPDISSAKPRPTVRDTNSESKSAQSTDLQPLFLGEDTFSEASVKEWQTPRIKAWESRRSTPETFYFRFVAPGEGQSNGKWSKEEHKCFMDRYEEWIASGRKMGHSWGLFSIKIPHRVGYQCMNYYRRLVRRGEIKDNAYDTTNGVLKHVGRERASVTISSTELGPEWETEHVKNIEKNVNNWIKEFHNSTGRKPSGKSKTEKLVVRRSVPIGDLIKAQPARKRKRVSEINPDEEDFMEMQEDEPERMNVSTHPIDWEEGWKERLENYKDFMKPFLDTETRENYWHAKQRWRQGLVTTEMLLAKKPIQRLVPQETTEPIIKPLANRMQASLSRFFAGVKKLKVDTPEELISNVRIPNDLFSGVRHILPLKMAIEKKTGRTKTIYYEVDDIMDSIKTIDEVLENVPEHLSHQSPLEGVLVDPPWEFYVNDGRNDGRCTWNLKNMAMLMDKILDKMTAGLIFIWTHKLIQADVVKLMSTLGCKYVENLVWFKKSVNNVQLDQPSPYISSAKEILLMFKKGEGFELRHQRSADVIIEFESPREEWIHDEYTEPKPNAVYEMIETLLPKAAYDETLGRGRFLELWSKRVSPKREGWISFHQKKYPLVMTADPQIQDTEMCIKDEFSNMSIKEYIKNEDRIMDEDMLEG >gi|Pbla1000000301|ref|jgi|Phybl1|7247|gw1.70.18.1 EGFDCIVMDPPWPNKSVHRSSHYETQDIYDFFKIPLPSLLSEEHPSLVAVWVTNRPKYRRFVIDKLFKAWGVTWVTDWYWLKLTTKGEPVMPLDSPHRKPYEQLIIGRRIPESTGEIPSNINIPPRILASVPSNRHSRKPPLDSILAPYLPNKPKCLELFARCLTPGWTSWGNECLKFQHQHYF >gi|Mver1000006359|ref|MVEG_06347T0 | MVEG_06347 | Mortierella verticillata NRRL 6337 hypothetical protein (466 aa) MELAAGRNDTWLLVPQRTPVPGWCVRPGTFRVTHPYDRTLDKSTGANAPSNRALKKRKVGAQALLEGNDVNKAEQGIVAWIKSEWLGLLDQEDVTLLFQSMAPDYFYGSCVYAQADEDVALDFVQLQPMLKMLTSGFHNTGASAEVGEDYEPFGMLQLSPSREQDKLLTLDLGDIYETLVTNNSSDAMVVSMASDGSPLYLIPPRSGFVISDFGQIHRLKDIAQKHSGFDMIVMDPPWQNASVDRMSHYGTMDLYDLFKIPIPHLLSKDGVVAVWITNRAKVQKVVVEKLFPAWGLTWVAHWFWLKVTTHGEPVLSLECGHRKPYEGILIGTRIPSNNTDMSTTTADLPNVDKECSSPHSVKKKLLVSVPSQHSRKPSIAQLLEKEFLASSEGNSNPDKEPRRLELFARNLEEGFVSWGNEPIRYQYCGRGHANGKLDIQDGLDIQDGLDVQDGFLVPAPRPIVD >gi|Mver1000003113|ref|MVEG_03108T0 | MVEG_03108 | Mortierella verticillata NRRL 6337 mRNA (2'-O-methyladenosine-N6-)-methyltransferase (309 aa) MTLPDEVVMTDSGNDSESGSFQDGNNSSTSSASNNTTRRTVLLGNSSLGKNDSPDLDDNTASGRLTKLLRRERELIETLEALARDIEQLEKKPEEEGEGGKDEDEEGDDLEEFEAPEWCVPIKANVMTYDWDSLAAECQFDVILMDPPWQLATHAPTRGVAIAYQQLPDICIEELPVPKLSSNGFIFIWVINNKYARAFDLMRKWGYSYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGVDPPNCRHSIGSDVIFSERRGQSQKPEELYELIEELVPNGRYLEIFGRKNNLRDYWVTVGNEL >gi|Mver1000002542|ref|MVEG_02535T0 | MVEG_02535 | Mortierella verticillata NRRL 6337 hypothetical protein (471 aa) MSTRSRRKPRKAQTINNAHYVGYVEENESIEAIMKKFEELARMEEEFKKSANTNTNTDTNTNTNAIANTNGEGGSNGTINDNSSPLTDGELILGVERVQAHDEHQGFTDEQLQEVFKRTSGFTVRSMMRDTPEDIYEEDVWQANIADEDYLYDFEEEDDYLMAMDDHFWDEEVGGSRKRGRKEKEPRVPREPRVKGERKRHVSDRESILNRYKVMQVRLQDRNGNFFTVKKKISTMDPSLPTYVRIPPVPIPRSWIHPIKPFVDTTVEIPGSRYEETNNVLDMDLKRFGTDYQVIYMDPPLLRAGEEPGPNKITMEQLATLDIGSILPKGFLFVWIEKEFLPDIVRLAERWEFRYVENFCWIKRNVNNLIAREPSPYFNSSKLSCLIFRKEGDVELRHQRSPDCVFDFPKPVNAATLSEEKPKFMYELIETLLPQAVYSESNPNGDKMMELWARPGTRRKGWTSICQTKV >gi|Mver1000001135|ref|MVEG_01134T0 | MVEG_01134 | Mortierella verticillata NRRL 6337 hypothetical protein (786 aa) MSPTTPQTDDLTSSSSTLPASSSLTTTTSSTANPLQTLNSSLKRIRLNDSRNDSSTPGSDDSLEREWGGIDPAELDEAFDAIQEELPSPKKPRREIKDEDEDEDEDFVVSSKLSVKAAQAPPSSNFGNISNSTLSATKFTPMIQRPSLPAKSSKGKLAKPTRSNNGMEDVFTHEQVRTWSEVRIKTWEHRRTNTEGFYYRFVDPTEGQQNGAWGKKSQQEFLERLEEWKSRGIRIGTSWGVFSMGVSHKAGYQCSSYYRKLLENKTLTDPAYAWENGKLVMVQKSSGGEMAISGLSTRWETEEVKEIEANITSWIKEYHSNGASRPAAKVKRVETSISSSSGSGSTATGSKIMNGMFRPTTEAPRVKATTAAAVSKALSLRPSKQPIDESEMIPVVDIDDKLAEYSSFMKKSTSTSSGPNKIRQTISLTVPGTVHTPTTTPARGVVSIEVRKSPLATHTVKGQTGLSLFWKNIRPVRVECPDTLISKYIIPKDVEQRPTWAHRVQAVTDPEDDESVVAGSIHKEVHTMVDFDWASLSENFDDPVTDIQGIMADPPWNFIVEDGRNDGACRLTTKAFGDIMEKALELMPSGIVCVWTHKAILPEVVSIMHGLGCRYVENLVWYKIALNNTNLDRASPYFRTSKEILLMFKKGDGFDIRHQRTADVIMDFEKPTSAWIEEDYTEPKPTAVFEMMEILLPDARHMPEAGRGRLLEIWAKKDPEYRRPGWFSIHELKGETPDGVVSKAPLQVIEIDEDSDAKSDDLDLLLRDNPEFEASHRDIDMEMDA >gi|Mcir1000010337|ref|jgi|Mucci2|157306|fgenesh1_kg.09_#_17_#_987_1_CCIA_CCIB_EXTA MSERPKRKRRQAASRSARVSNAHYVGYVEDNESVEAIMKKFEELERIQQEFTSSPQPVNDTLQLEQDDRDQDTLIEDSLTQEQLEEVFRRTSAFTVKSASVDPDFIVDMDALDLLQAEYRHNNTAEFIEEDDYYYVGDDFDLDGQVDDDGKDEQYRDRYYRRSTPSKKKRLDRQSVLNKHKMLAAQTKDETGRTVPVKRQACDIDPSLPTYVRIPPRPIKASWAHSIKPLSSNATPPTNAAYHEVHRLVDQDLTQYGSQFQAIYMDPPLLMAGEPPTPGKISIEDLAKLNVPDVMDTGFLFIWLEKEWLHRIVKIATQWGFRYVENYCWIKKNINNTIYTGESNYFCKSKLNLLIFKREQSKIEIRHQRNADCLFDFVKPMKAGQFTEPKPAHVYHVIETLLPKSVPQSKLLELWSTRNYRRQGWTTVAEVVCND >gi|Mcir1000003212|ref|jgi|Mucci2|138359|e_gw1.02.942.1 MYVFCPCVTPSHQLKTIQRLAETTQFDVILADPPWQLATNTPTRGVAIAYQQLPDVCIEELPIPKLQQNGFLFMWVINNKYAKAFELMEKWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGQDPPNCKHSLSSDVIYSERRGQSQKPEELYQMIEALVPNGRYLEIFGRKNNLRDYWVTIGNEL >gi|Mcir1000000489|ref|jgi|Mucci2|31991|Mucci1.e_gw1.1.979.1 GADLIVMDPPWPNKSVQRSSHYETQDIYDLYSIPMQSMMCSKDTIVAIWVTNKPKFRNFIINKLFPSWKLQCVSEWVWLKMTTQGECIFPIDSEHKKPYEQLIIGRPIASNNMSNRISMPESHTIVSVPSTRHSRKPPLHDVLSAYLSDKKQDPVCVEMFSRCLYPGWISWGNECLKFQHLSFFDKEDAIKEPE >gi|Lhya1000007505|ref|jgi|Lichy1|132223|e_gw1.100.12.1 MTAGLVFVWTHKLLQADVVRLMDELDCRYVENLVWFKKYVNNIPVDQPSPYISSSKEILLIFRKVSSNIIGDGFELRHQRSPDVIIDFEQPASQWIQHEFTEPKPAAVYDMIETLLPKAGYDEELGRGRLLDLWTKRDAPRRDGWIAFHQTKPSDAKASTMTAQDDTTHEINGIHHKAMNETNSKSIEDQDDIMDIL >gi|Lhya1000007007|ref|jgi|Lichy1|163792|estExt_Genewise1Plus.C_880061 MSDRTTRSKRSKRNNAVSNVHYVGYVEDEESVEAIMKKFEELERIQSEIAGSSTPETPEITDNDVNMDSPDVNTAAARPLTEEQLEEVFKRTSAFTVKSAMIDTNDDDVDALELWQIEFQDGNTEEIFEEDDYMHVDDDFWDQEFGDAPQRPKRGRRAAIPRERPASTGRRGLDRDSIIAKYRIMQVQVQDQNGNFFMVKKRVSALDPGLPTYVKIPGAPIPRSWVHQIMEVPRIRKFSGSHLHEVDDILSLDLKQYGSDFKAIYMDPPLLLPGEEPTPGKIHVDDLLRLNIPEVIPCGFLFIWLEKEWLRRIVEIGEQWGFKYVENFCWIKKNINNQIHTSPGTYFKKSKLTLLVFRKEGDVELRHQRNPDCVFDFVKPMLPGECTESKPSFIYHVIETMLPGANYHPETNPDGKGLLQLWAKKGQQRTGWTTIIEKHN >gi|Lhya1000004692|ref|jgi|Lichy1|206348|estExt_Genemark1.C_470043 MTILYSTHNVDVIDCQSAFSKELVLHSQALQLRRGEFNVHEPYYRSSTTALAGQKRKRENTADIDTENWHQEHARPFLIKCIDELPNNVFSHLDNTEDTSTATAKDDSGIDFTSLISLAQASSRFTGPMDHLELTEENHVITMEPLDVFYRILSNPSPMNSMQITIEDQNYIMPPGASFYMSDMSTGMKDLKAHARSIGFYDFVVMDPPWPNKSVHRSSHYETQDIYDLFKIPMKQLIATSGLLAVWVTNKPKFRRFVINKLFPTWNIKCVGVWYWLKVTTHGEPVVPLDSPHRKPYEQLVLGIAMQQQQQEDPGTREEIPDKHAIISIPSRRHSRKPPLQDVMAKYLPKDPKCLELFARCLTPHWTSWGNECLKFQHTQYYETVEEEETIKDRVES >gi|Lhya1000001414|ref|jgi|Lichy1|141189|estExt_Genewise1.C_100084 MSREGTPSSDTLIDIDNFDENEVTDLGLKNLLKREIELQLLIDALQTEIAQLEEGINGKDKGDEEEELDDQDLEEYEAPEWCVPIKANVMNFDWDSLAKEVQFDVIVADPPWQLATHAPTRGVAIAYQQLPDICIEEIPIPKLQKNGFIFIWVINNKYAKAFELMERWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGEDPPNCRHSVGSDVIFSERRGQSQKPEELYELIEELVPDGKYLEIFGRKNNLRDYWVTIGNEL >gi|Lgig1000006160|ref|jgi|Lotgi1|134083|e_gw1.88.185.1 MSDTWSDIQAVKTKQSSLRAKLAQRKKEREGLAKELNLGVSTSSSTVSTSVVDPEIEKKLPYVLTDINLEIPAESTVIKDFLTKSLERAVNQTVVDELLEKFAAQQLIRFESLNIEPFLTISASYTIITKIFFLYFQLSALTSDDKGRKRKREDGDEEDKRKDDKNSKQSKKSADILESLLTSQSAKEKENKKVNEEILQILSKPTAKEQYLVERFKSRGGVQLKEFCQVGTREECRKMNNTTEPCSKLHFRKIIHKHTDESLGDCSFLNTCFHMESCKYVHYEIDYPDKKPEPVKDIVKYKVPDVDSDVYMFPPQWIQCDLRIFDMTTIGKCAIVMADPPWDIHMELPYGTMSDDEMRRLDVPGLQDEGFIFLWVTGRAMELGRECLELWGYKRIDELIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNPKGANKGLDCDILVAEVRATSHKPDEIYGIIERLSPGTRKVELFGRPHNVQPNWITLGNQVDGVRLKDPDVVKLFKERYPDGNCMEPPKPR >gi|Lgig1000004573|ref|jgi|Lotgi1|121106|e_gw1.35.28.1 VKIFGEKYVIPPYCKFLLSDWKQLLQLHTVDANKYDLIVIDPPWKNKSVKRKKSYDTLWEDTLLDLPVTKLVNPGCLIVIWVTNKTRLHAFIENTLVEKWQLQQCVQWHWLKITRGGELICDINSNKKPFETLFIGRYDNSLTNQYSTVPNHRTIISIPCSLHSKKPSLAEILKPYLPEKPECLELFARNLQSNWTSWGNEVLKHQHLEFFDVT >gi|Hrob1000012694|ref|jgi|Helro1|153237 CHRIRTQIYYTSRKPDKIYGIIERLSTGSRKIELFGRLHNVRPNWVTITHQLPNIMIVDPKMKEAFSNSFPNGN >gi|Hrob1000012559|ref|jgi|Helro1|79167 MFIYFQKESSKEDQLIESLLSTQSAKERETNRLTEEILILLAKPTAKELLLSERFRSQGGKQVKEFCGYGTREECQKNNIICDRLHFKKIIHKHTDESLGDCSFLNTCFHMDTCKYVHYQIDYLAVNNRVHIVNDQQIISTSNNNVEELALSKIDDATTTLFPPQWVQCDLRQFDMSVLGKFAIVMADPPWDIHMELPYGTMSDDEMRKLNVPVLQDDGYIFLWVTGRAMELGRECLTLWGYERVDELIWVKTNQLQRIIRTGRTGHWLNHGKEHCLVGVKGNLPVNRGLDCDVIVSEVRATSHKPDEIYGIIERLSPGTRKIELFGRPHNVQPNWVTLGNQVDGVKLLDPEMVKAFRKVYPDGNCIK >gi|Hrob1000012135|ref|jgi|Helro1|121421 CNERHFNKIIHSNTDESLGDCQYQNMCFNMNFCKYIHYEKDEKDEVTFKMPSREKSYPPQWIQCDLRYFDFKILGKFSIVMADPPWDIHMDLPYGTMPDDVMKKLDVSSLQDDGYIFLWVTGRAMELGRECLTLWGYEKVDELIWVKTNQLQKVIRTGRTGHWLNHSKEHCLVGVKGNLTVNKGIDCDVIVSEIRDTSRKPDEIYGIIERFSPGTRKIELFGRLHNVRPNWLTIGNQLPSVMLIDSKMKEAFYNTYPNGIC >gi|Hrob1000005904|ref|jgi|Helro1|69400 YIIPPRSSFLLSDFSEIDLLAKVETKFDFVVVDPPWQNKSVKRKKRYYETFSMLNLVKIPMPEWCNENCLVAVWVTNKIKYQDYVREELFPSWNLQFVATWYWLKVTKKLTPVYPLLLSTSSTKHPKKPYELLMLGRFRLEKYHLMSNSNIDCKIQDRMVIISVPSAIHSHKPYLREVFKDHLPEDAKCLELFARNLQSGWTSWGNEVRVCVWIYQQYIFISLSLLGMQSTLACSQDLINII >gi|Hrob1000005423|ref|jgi|Helro1|68987 CYERHFKKIIHSNTDESLGDCQYLNMCFNMNFCKYVHYEKDEKDEETFKMPSREKSFPPQWIQCDLRYFDFKILGKFSIVMADPPWDIHMDLPYGTMPDDVMKKLDVSSLQDDGYIFLWVTGRAMELGRECLTLWGYEKVDELIWVKTNQLQKVIRTCRTGHWLNHSKEHCLVGVKGNLTVNKGIDCDVIVSEIRDTSRKHDEIYGIIERFSPGTRKTELFGRLHNVRPNWLTIGNQLSSVMLIDSKMKEAFYNTYPNGICPK >gi|Hrob1000005324|ref|jgi|Helro1|164465 MYDVAWPRHQPIALNYWNWDEIEKLELEHVAALRSIIWLWCGGSNGLEASRKDGSEEKPVEIFHIIERFCFGRRRLHLFGRDTSVRPGWLTIGPKLTSTNYDRESYNANFVKNPVVTRLVALKRYQRLRPKSPPRKIGHQM >gi|Crev1000002253|ref|jgi|Coere1|80436|fgenesh1_kg.9_#_19_#_isotig04348 MWQDEIYYEDDDAAMAAYHEATRASSNDDDDDDFAPEPPTGHRGRTKSQRVSVSKKKTESARRATTNSHKPRIADRFMSSKLDTTLPSSSGEDQSNIGDEDIDIESLGDSMGVRDSCASAVPTPDIGQLKVCSSVGGSPSPPSPQQLETASATAMDTDEGSNDPSTALLALRQREQTIRARMEQLESEIAELEKKCGVEDSSDKKGRSEQLDLSEFRAPEWSVPIRANVMNFEWEKLAASCQFDVILMDPPWQLASQAPTRGVAIAYQQLPDVCIESLPIQLLQTNGFIFIWVINNKYTKAFQLMKQWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGADPQTLQRSVASDVIFSERRGQSQKPEEMYEIIEQLIPGGNYLEIFGRKNNLRDYWVTIGNEL >gi|Crev1000000518|ref|jgi|Coere1|37216|e_gw1.2.97.1 ISNQYYVGYVEDDESVDAIMKKFEELERIEKEFSAKKKSAADDSSTEHKSKEQPIESGMSLELLEEVFKRTSAFTVRGAMMDDFDLEVMDDIELWEAEAADNVLAEWEEEEDYVALHLDDDALDDEFGVVLPRRRYQRHDTAKKSGPKLSSRDQIIQRYKYMQVRVQDRHGHTFFVSRKVNAVDPSLPTYVRIPPNPISRSWVKHIRPLSYETESTPTPDWAQCIETDNTLEFDYSKFDRTFQCVYMDPPLLLPNETSKPGFFAIAKLATIPVNKLLVPGAFLFTWCEKELIPDMCEIAEKTWGLKYVENFCWVRQQTNNQIARLESPFFCRSKVTLLIFRGEGNLEMRHQRSPDSVFDFTKPRLPGDLNDRKPEFAYTVIETLLPRAICTKEEPDPNRLLQLWMPADSHRKNWTTIVQNTQ >gi|Cmer1000000996|ref|gnl|CMER|CMH026C similar to (N6-adenosine)-methyltransferase MKGERKRPQRHDRDADRELALRLHNEEKQELRGLRRQTRTSADKAVVEDATVMGRPCETVTGLDSFLSSCLNETGRHSRPRASRKRTVENAFEHFSVAKYTEKHAKLEPRKQGTSALPIDEIQTVPIKDLDQFPSCVGVAENQKTSSSLRRQIREQIRRLNDGYYINCDLRYFNLAYLRECVGNFDVVLIDPPWRIAGGQRASTPNGPMFTNNHWAVNYNTLSNEEILDLDIGCLSNSGLCFLWVVSSQLPTGMACLSRWGYEYIDKITWIKKRQGKLHVSHGYHFMHSSELCLIGVKRPCEFIGKVSNDLIFAEVREKSRKPDELYHVVETMLPGTAKIELFARNHNIRRGWLSLGNELGEQFCDWFNDFECDMCGARIHFGERRYKAKNRPNSDFCRDCYLEAISAGVTTETEWFELANDAADPVYHEYYECNHCNIYPLWGVRFHRDPDMDLCEQCYDELVANDAETEDASNNARTSSPDDWTAIESPICGGSLPVHRNIRCSSCLQCPIIGYRFSCTCCENLSLCQKCFFQQKCPKGHNADHDIVIIVDSEAALNALVRCDGCGIRPILGTRYRCNTCYAFDLCETCYKKVEQGEYDLQQVSQKKLAEHNRSHAFSAIPVSA >gi|Cmer1000000605|ref|gnl|CMER|CME116C similar to (N6-adenosine)-methyltransferase MNVLQTHLLQTVKAHLITRSRKWNTIEVNEDDTDSGWRQLDASCNRTHARHGAKTYKRHCLSGAAPRLVAMVESAPLGLLESTEELENRLLQSYVSLIKNLTQQNASSNPEDAHGGSENPAGAAAAPGATAAAEAAVARHEHPGGNETAKADALTDSVFSDLCYVPEDFMVPPHCIPVRADVRFADWDQIAAAANGNYDVILMDPPWQLATANPTRGVALGYNQLSDESILAIPLEKLQRCGLLLIWVINAKYRVALQMFERWGYRLVDEIVWVKLTVNRRLAKNHGFYLQHAKETCLVGVKGNDLSALSTAPGMPRPDVILSERRGQSQKPDELYEWIEALVPNGKYIEIFARKNNLRNFWVSIGNEVTGESFEQALPPELCKQLREELST >gi|Cmer1000000469|ref|gnl|CMER|CMD131C hypothetical protein MEALRITFWYLRYWNTIITDKGHQLCFGEKHRPRRLGIAQSLVQGIGAHCRTDLCHYFWRETVSIASVGISILHWVYLLFGACTGAAETLLCTDRVLAPGVDALGTMPRGTDATASDEREQEEQARATRPRRRAAARYGSQKRSVAVDPAYYLGYADDDETPEMIMRKFEALERLQAQKAANTQQSSGDANDSQTAKEEVTLTAAEQAELFRQTSFFSVDMLAGSTTQSLPGGVGESTDLIAVGALDDSSFERVLAEYGLDAVLFGAELSDSEPGSEYDDGEDLLWEEALESDRSARFPGRLTRSRPDRSESSRVALELWRARAALLMRLSAGGDALAAAQAAAMIPAALRARRRRLAAEGVLRETIPTEYPLPVSWARTIRPLREAVQEGQKLERNYECGRLLRFANVFQAAKTLHERIKGRHFRCVVLDPPCWNQPDFRVEQLRSLRIAEIVPFGFVFIWVPKTRIAETLRWFGEEMHQGGGGGFHFVESFCVVLKWTNNRVATLLSEPSDLLCLDQPVAVAEHAPSGVYACSHETCLVLRRPGPHIELAHQRNPDLCFDYVRLHQKRYQRRRPELFYHMIETMLPEVARDGDMLCVWADQSVCKRTCWVSVVEDQSLEAGESSS >gi|Chet1000009456|ref|jgi|CocheC5_1|33790|estExt_fgenesh1_pg.C_390006 MFSTLPPPPPRASPTAQSAPHVPDPIIFQNADADITLVDIPASIVAAQGDRSDVLLSTAPLEEPIQLRQDYEPKTQKTRAQAAKVHHGDSTQQNEHDLQSLHLDDAGYKLLVEHALAQIRAHVSGPWCMRRQLMTQTSRSAQDGAMDLDSPSERNLELCMREWASRSQAKQDDMAFNLQQMMASLGAASEPADSAAAADKCILSYRHAPVIESNTQAADSVTQETQTVPWTCTFHNPNQHSLEATITDRTLQSASSAQDYRFAIPPRATLFLSDSTASDAFRWSFRRLTDEYSLARHFDLVLLDPPWPNRSAKRKGTYEQVGGMPYLKRMLLNMHLDMYLEHNALVGVWVTNKPSLKQHVLGPGGLFEAWNVGLMEEWIWIKTTTKGEPMFDIDSPMRKPYEMLLLGRAAPNSWSRMAHAPVVKRRVIAAVPDVHSRKPCLKSLLEPYLVDPTDYTALEVFSRYLVSGWTSWGNEVLKYNWDGYWVKAGSAEN >gi|Ccor1000008813|ref|jgi|Conco1|87489|estExt_fgenesh1_pg.C_3190005 MSSRRSKRKSTTRGNYVSPNSSHYVGYVEDEESVEAIMKKFEELEKIQNQPPSIPQNTSTNNDDMLTGNEVDELSLLTSEQATISGNNNQALTESQLEELFKNTSVFTVESALKHNQTYFPLNPNDFDDDDLEYYGDDLFILNEDEEFEEYDDIDLDDSDWELEMGLVSRKRVKRSSASGPREKRQSSTQKREHYRNTHLRVLDDLGKSFVLVKKQLPQDPTQPSYVRIPQNPVPVSWAHRIITLKQANQLRGGKRLGIDFINSFDQLIGFSGIGSIECILMNPPLVSDNFMQTKEEYLAHREPITVSQLLALPIKKLLKSGFLMIFVPKPLISKVVQRISNEWKLRYVENVAWLQLTPNQEILKADLDESYIRTSKLTLLMFRTEGDIDIRHQRSPDCVFDFVQPDVDLSKKLPNHHREFPNFIYKMIETLLPNGKSGLDNGFKLIEMWGHPKSRRDGWLSIYHKNQ >gi|Ccor1000007140|ref|jgi|Conco1|109297|CE21212_12273 MDPPWQLATHAPTRGVAIGYQQLPDLFIEQLPIPSLQQNGFIFIWVINNKYVKAFELMERWGYEYVDDICWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGKDPPNCRKGIGSDVIFSERRGQSQKPIELYELIEELVPNGKYLEIFGRKNNLRDYWVTIGNEL >gi|Ccor1000001300|ref|jgi|Conco1|3188|gm1.1428_g MEGLKSLQPLNLTNSQYLDQQPLNKFNHQVNKGVNFAQFDDLIKLIYKFNDECDIDRIELSDEEVNTLDIEHLFHHLISNSTDKVCKLSIANYTQTYAIPPNSKFIMSDLESFNDLLIYKQPASCIYDLILMDPPWFNHSIKRGNHYKGQDKYTLLNIEIPKLLHSRGILAIWITNQPSILKFATLKLFPKWKLKLVSTWYWVKITTQGDLVISLDGERERKPYEILLLATFESNQQFTEIPSNFIYSVPFQYHSRKPNLFPYLSELIDYKNGNDITKLELFGRYLTQGCTTWGNEVLKFQESYFLD >gi|Caps1000024491|ref|jgi|Capca1|192776|fgenesh1_pg.C_scaffold_975000002 MADSQDTWKEIQAHKSRQLSLREKLAMRKKAREEVVAQVAEIIGEPAALGPPTAKAGKVESRAVEKELLIVLDEATLNLPVNLEALMVEMSKVQTNISPKLIENLLQKFSAQMLIRIVSAVSASLAGVPGQKAPRKRKHEGNESEEVGRKEAKKSHSEPSIEVGEVDVLTSLLLMQSAKERESKKLNEEIVQLLSQPTAKEQSLVEKFKSQGGAQVKEFCQFGTIGECQKINGVNQHCGKLHFKKIIHKHTDESLGDCSFLNTCFHMDTCKYVHYEIAYPEISPKAQPSTISKKTEGTILYPPQWVHCDLRNFDVSVLGKFSVIMADPPWDIHMELPYGTMQDNEMRNLQVPLLQDDGFIFLWVTGRAMELGRECLTLWGYDRVDELIWVKANQLQRIIRTGRTGHWLNHGKEHCLIGMKGNPTNINRGLDCDVIVAEVRATSHKPDEIYGIIERLAPGSRKLELFGRPHNVQPNWITLGNQLDGVKLIDPEVVEAFKKKYPNGYNIVKGKLPPPT >gi|Caps1000008038|ref|jgi|Capca1|218111|fgenesh1_pg.C_scaffold_355000005 MKRMKTRKKSTQTPALSSKVPKAPILTTITVRTLLTPESALRITSGMLVRYMHCDLDTYDMRDLDSKFDVILIEPPLEEYQRYLGVSREKFWSWQDIENLQIESIAAQRSFIWIWCGFGEGLDAARRCLRKWGFRRCEDICWIKTNIKNPGHNKNLGPKAIFQRTKEHCLMGIKGTVRRSTDGDFIHANVDIDLIIEEEFPPGSDEKPVEIFHIIEHFCLGRRRLHVFGRDLSIRPGWMTIGPDVTNTNYNAKTYNAYFNKTPDGFLTGCTEEIERLRPKSPPAKMKDGKGSGRGGRGGGGNKGGGGARGGGGRGNFSGRGGFQGRGRGGQRSGPPR >gi|Caps1000003296|ref|jgi|Capca1|94532|e_gw1.295.23.1 MPGRTNIKLADLVGVFAVNSADTSSIIPFSGINYIVPPRCSFLLSDVSNPHLLPPNVQYDLIVMDPPWENKSVKRKKNYQMVRDFELEDIPIGQLATDGCLVVTWVTNKQQQQQLVKETLFPKWGITPLATWYWLKVTTEGEPVYPMRSQHSKKPYEALILGCKSLSPPLKIPDHKVILSIPSCIHSHKPPLHDILQDFLPSSTPRCLEIFARSLHPRWTSWGNEVSSDNISKIE >gi|Bnat1000020351|ref|jgi|Bigna1|81922|fgenesh1_pg.85_#_83 MRHNKTGNDEVYRMCGHQFVIPRGAKFLLADITGLGALLKAKPNPGFRLIVLDPPWPSMSVSRSHKYKTLNPRDLINLPIRKLLYYDDDYMEEEEEEEDVTKGKKKERKTAEFSPSWVMIWVTNDPDLHRLVRHKMLKKWRCE >gi|Bden1000006054|ref|BDEG_06038 | BDET_06055 | Batrachochytrium dendrobatidis conserved hypothetical protein (translation) (183 aa) MGELASMAVSSRFADPTTLSTVQSITLSANSHSIGLADLFNIQISNRDGSLPVVLDICGHKYIIPEHSWFVMSDFSFLSTLCKCLLDQPLRFSLVLMDPPWENKSVKRAKQYASMDCYRLSNIPLGQLVLPNGIVAVWVTNKSKVREMVLNRLFPAWGVEFVTEWVWLKVTHHGDLVIDLER >gi|Bcir1000011661|ref|jgi|Bacci1|265856|fgenesh1_pg.4_#_30 MSVSIIDTEKNFSTELYRLKPGDFDIKEPYFRPSKSAPTESNTKKRRKRNEPKQADIDTQKRHEELKPFLTACLAMLKWETKEILLHNKEEEDTIKEAIDFPTIQAMVQSAHLKFDQQDEEEEAPYHFETDCCKELDIFQIFNRVYINPTKKITLLEFNKDATYLLPPRSTFLMGSMQDSLKQLGSYVHSIGGADLIVMDPPWPNKSVYRSSQYGCQDIYDLFSIPIPDMLTPDSVVAIWVTNKPKFRHFILDKLLPAWKLKCVAEWIWLKVTVKGECVFPLDSSHKKPYEQLIIAKPVESESTIPKQHMMVSIPSKRHSRKPPLHDLLTKYVKKDPVCVELFARCLLPGWISWGNECLKFQQLDYFEKG >gi|Bcir1000007777|ref|jgi|Bacci1|290217|fgenesh1_pm.3_#_58 MSSGIIFVWVHKLIQGDVIRLMYELDCRYVENLVWFKKSCNNNVLDRPSPYIASTKEILLLFKKGDSIDLRHQRSPDVVIDFEIAPEHWINEEYTEPKPAMVHNMIQTMLPGGIYNETLKRGKLLELWAKKSATRREGWISFHQVKHSLQMETE >gi|Bcir1000006779|ref|jgi|Bacci1|219299|estExt_Genewise1.C_720022 MSDRPKRKRKERARVSNAHYVGYVEDEESVEAIMKKFEELERIQQEFSTKVDIKEDDVVEEEMEEEIAVDEVENKFTQEQLEEVFKRTSSFTVKSATIDTSFVDDLDALDLWQVEYHDNDTNEFYEEDEYTYMDDFFWDEEFGGDNNKKRTGGRQPRIPKEPRGTRKLDRESIIAKYRIMQVQVQDKHGNYFTVKKRVSNVDPSLPTYVKIPARPIPMSWAHKIKPIIRKQHISVPGSRYEVVDSILSTDLSSFGNSFSAIYMDPPLLLPGEKPTRGKISIDDFANLKISNIMKAGFLFIWLEKEWLRQIVAIAAKWGFKYVENFCWIKKDLNNQIHKSKYTYFNKSKLTLLIFRKEGDIELRHQRNPDCVFDYIRPMLPDEVTESKPPFMYHVIETLLPNAIYHPEKNPNNDKLLELWSKKDQRRQGWTTLCE >gi|Bcir1000003454|ref|jgi|Bacci1|330834|estExt_fgenesh1_pg.C_270053 MSTVSSRESSPFSNVDDDVLDSSLKELLEEEKELQLEIEAVRVEISKLEERLGVSANDNLGDKELEEFEAPQWCIPIKANVMTYDWEALAAQEQFDVILTDPPWQLATHAPTRGVAIAYQQLPDICIEDLPIPKLSKNGFLFIWVINNKYAKAFELMEKWGYTYVDDITWVKQTVNRRMAKGHGYYLQHAKETCLVGKKGDDPPNCRHSIASDIIFSERRGQSQKPEELYELIEELVPNGKYLEIFGRKNNLRDYWVTIGNEL >gi|Aque1000027926|ref|Aqu1.228194 MVREYQAEVKANGISDRYDSSRTWRSKTKHSARHKSFMKKVRNKRRHRQERIQRRTEEYEDEKVLLPSEIDPKLEQRVLYALLNPTLEIPTQGTKLLEMSGTDFENLPNLLQKLYGQDLICLQSDGYTITSINLESISRLIEMKNLSLRQSNIPRFKTDPEEIEYLLNAPSVRDKETKRVGSEIQELLTAKSYQEQLIKQKFQSAGGSQLREFCPQKTREDCRRVSRSGRACPRLHFRRIIQSHTDESLGDCSFLNTCFHMESCKFVHYEIDQTQETESRKGGIKPRPSLQSLGSKLVPPQWLNCDLRNFDTSVLGKFAVVMADPPWDIHMELPYGTMSDDEMRQLDIPSLQDDGFIFLWVTGRAMELGRECLTLWGYERIDELVWVKTNQLQRLIRTGRTGHWINHGKEHCLVGAKGNLQGVNRGIDTDVIVAEVRATSRKPDEIYGVIERLSPGTRKIELFGRQHNCQPNWLTLGNQLEGDNLHDPELRERFYSRYPEKLHAYPAVT >gi|Aque1000012323|ref|Aqu1.212591 MAGVGPSRGRSGTPSAIPHLQSSSDRGDETGSSCISSLRQCRKLLRSLKKRSVLKRKIASRKSGMSHLRYILKLSSQREFEMNDNHKFLINPKRHRVTQSKPKIKESAATKENGEDQTFKGSDVFLKGTQSANPHNDYSQHFVDTGQRPQNFIRDTGMNQRFEEYPKLKELIRLKDKQIKDGAIPPVYYKVDLSSFDLTSLDAKFDVILIDPPLEEYQRRTTGITYPWQPWDFEEIMNLKIEDVSAPRSFVFLWCGSCEGLDLGRECLKKWGFRRCEDICWVKTNMNDPGNTTHLEQKSIFQHTKEHCLMGIKGTVRRNQDGHFIHANIDLDIIISEEPEMGNNDKPEEIFHIIEHFCLGRKRLHLFGNDATVRPGWLTLGPNLSSSNYHKESYLANFTDESGGPLLKFDETIETLRPKTPPPKGRGIGRGGLNVNPGMGGVGRGRGSYT >gi|Aque1000011545|ref|Aqu1.211813 MRLVSLDSSTSFFFDDRTAANRRGRRGPHIPSIEDARKSVAYHRSISRYLRFSNRSLGPNPKRAKPKQSKGRMQMPGRAKSGDDSIFPPMPSKRYEVIYADPPWDYKGQLQHCGAGGGDSGGAMRHYPTVTLADLKTLPVDRIAADDSLLFLWATSPHLDQAIDLGKAWGFAWATVAFVWDKCKTNPGYYTLSQCELCLAFKKGKIPSPRGARNIRQLVSEPRQGHSRKPDEVRRRIESMFPDQSRIELFAREPAAGWKAWGLEAQWDAKRPRLTRPRIDR >gi|Adig1000021966|ref|adi_v1.16957 MSLWDIKSLPVPQLVAPGALVGVWVTNKQKYLRFTRSELFPHWSVELVAEWFWAKVTRRGELVTELDSPHKKPYEPLLIGRFQPMMKLLRSESLNSGKDLKDIDSCPGMDLLLNYQKRRKISLSHNFENNGTISLDGTHRTVSSSTNESEMTNRESDVRCTECTKGKTDTSNALLEVRTAQSSNLHGLDGKELVDIVTGDVPANSRSKHVDKQSTRNQTKGKIVGQSAREISETGLRLNLEEGDRDNKPSKSQSEILQTGFDGCINLPYHQVICSVPCRIHSRKPPLNDILEKYVPPQPLCLEMFARNLTPNWTSWGNE >gi|Adig1000009633|ref|adi_v1.06363 VLISEEKKEGKHTSQPDKEDADKAKDYEEEDVEEVVYKDSSHFLKILKLEIEKVAAQRSFLFLWCGSHEGLNEGRKPPLGSTEKPTEIFHIMEHFCLGRRRLHLFAGDDTLRPGWLSVGPSLSSSNFSADTYNSYFSDAPEQLLVGSTQEIETLRPKSPTGKHKGDGGGRGGAIAPSRGGSGPRARGGVRGGMMRGAGFQGKGGRGFISFSVR >gi|Adig1000003645|ref|adi_v1.21218 MSLWDIKSLPVPHLVAPGALVGVWVTNKQKYLRFTKSELFPHWSVELVAEWFWAKVTRRGELVTELDSPHKKPYEPLLIGRFQPMMKLLKSESLNSGKDLKNIYSCPGMDSHLNSDKRRKISLSHNFENSGTLSLGGAHITVSSSTNESTMTNCASEVRCTECTKGKADRSNALLGVKTPKSSNLYGLDCEELVDIVRGDVPLKSTSKHVDKQRTTNQIQGKIVGLSAREISETGLRLNLEEGDRDNKPSESQSEILQTVFDGCIHLPYHQVICSVPCRIHSRKPPLNNILEKYVPPQSLCLEMFARNLTPNWTSWGNE >gi|Adig1000001851|ref|adi_v1.03360 MDTCKYVHYEVDYSDVEMNKEDKGKDKKKDEKTSLANTVEGDNRVTLYPPQWIQCDMRYFDMTVLGKFSVVMADPPWDIHMELPYGTMSDDEMRKLDVPSLQDEGYIFLWVTGRAMELGRECLKLWGYERCDELIWVKTNQLQRLIRTGRTGHWLNHGKEHCLVGVKGDCKNFNRGLDCDVLVAEVRQTSHKPDEVYGVIERLAPGTRKIELFGRQHNVQPNWITLGNQLDGVRLLDPEVTARFKKRYPNGIVLNTSSTKI >gi|Abis1000008599|ref|jgi|Agabi_varbisH97_2|121782|Genemark.8250_g MISTAELLCEANDVLAAHASLLDHVRASQHQFRRHLHTLQSPPEELLKLPTVPSSPLLTPDDASPSPSPPLMQEDQERSDLPAPKKARLARYRNYVPEEETIRNDYSQRYVDGGEWPQDWVIGAEPEHRFEEYPKQQRLLTLKKNSVNSHATPPYYLPYHELSSLHPNKFDIILLDPPFSSSFSWEQLLELPIPNLAADPSFVFLWVGSGAGEGLERGREVLAKWGYRRCEDVVWVKTNKTSNQGPGTDPPTTSLFTRTKQHCLIGIRGTVRRSTDSWFVHCNVDTDVIIWEGDPADPTRKPPEMYTLIENFCLGIRRLEIFGRATSLRRGWVTVLTRGNDRQLAVSEDGSVHVEGEEGGLATTWRQETWDEQVKSLLTNGRAVVPMTPEIDALRPKSPVRHNQNISGGGGSAMSGGVAVGIPGGSNNNSMNTNSGGARFNSGNRPNAFINHGPAAMLPPNQMVNPNQMMVQQQNMMGMGVGVGGVNQFGMGMGVPVPMEEMMSAGGWNHHHMMNAGPMGGMGPPGIPGAPGHVGMNASNVNMGMSGVGPMGGVNMPLHHHHHHHHQQMMNQMGMGGGGFQGHAGVGFGANGMPVFNPAAMNNAGMGGWGDQGPMVNGMNMGGMNMNMNMNNMPHNMGMGGQWGNGGF >gi|Abis1000003455|ref|jgi|Agabi_varbisH97_2|67638|e_gw1.4.1246.1 CDRVHFRPLIRPHTDPSLGHCSYLNTCYSEPTYAQSPSIPAYPGRGKEKAPCRYLHYEVDWDPTDAENEKTKERVAVKGKPHRLEIGLGPPGREATPLPPQWINCDLRKFDYSVLGKFHVIMADPPWDIHMSLPYGTMTDDEMRAMPIPALQDEGLLFLWVTGRAMEVGRECLRVWGYTRVDEVIWVKTNQLQRVIRTGRTGHWLNHTKEHMLVGVKTPSSPSDGPETELKFPKWVNRGVDTDVIVSEVRETSRKPDEVYGLIERMCPGGRKVEIFGRKHNARPGWLTLGNQLGPADQIWEEDLLERVRAKBack to Contents